Proceedings of TDWG : Conference Abstract
|
Corresponding author: Birgitta König-Ries (birgitta.koenig-ries@uni-jena.de), Dagmar Triebel (triebel@bsm.mwn.de), Falko Glöckler (falko.gloeckler@mfn-berlin.de)
Received: 11 Aug 2017 | Published: 11 Aug 2017
© 2017 Birgitta König-Ries, Dagmar Triebel, Robert Huber, Falko Glöckler, Anton Güntsch, Janine Felden, Felicitas Löffler, Jana Hoffmann
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation: König-Ries B, Triebel D, Huber R, Glöckler F, Güntsch A, Felden J, Löffler F, Hoffmann J (2017) Setting up an Interdisciplinary Data Infrastructure: Why Cooperation between Domain Experts and Computer Scientists Matters - An Experience Report from the GFBio Project. Proceedings of TDWG 1: e20198. https://doi.org/10.3897/tdwgproceedings.1.20198
|
The German Federation for Biological Data (GFBio;
GFBio is currently in its second funding phase. Essential services, required for the operation of the future infrastructure, have been successfully implemented. The realized technologies and tools use globally accepted standards as well as innovative concepts e.g., for data visualisation or semantic integration.
A portal (https://www.gfbio.org) provides a common point of access to all GFBio services: data submission, data discovery, data visualisation and analysis, a terminology service, and a help desk. In addition, archived research data is shared with international information infrastructures such as the Global Biodiversity Information Facility (GBIF) and the Biological Collection Access Service (BioCASE).
As the data centers use different systems and thus internally build upon different data structures (
Since the aim was to provide an integrated, faceted search, it was necessary to agree on common fields that can be used to feed the facets. Therefore, the GFBio data centers agreed on using ABCD 2.06 (Access to Biological Collection Data) as a common standard and specified thirty elements for data exchange. Here, it was essential to bring together (1) domain experts for defining which facets they consider useful for an effective search, (2) computer scientists for providing the implementation based on Elasticsearch (
With that, primary collection and research data are available with metadata and data units according to the ABCD community standard and are ready to be reused following the FAIR data principles (
data curation, data management, research data, data archiving, FAIR principles, data portal, data standards, Access to Biological Collection Data, GFBio
Felicitas Löffler, Jana Hoffmann