BACKGROUND:Despite wide acceptance in medical research, implementation of the FAIR (findability, accessibility, interoperability, and reusability) principles in certain health domains and interoperability across data ...
详细信息
BACKGROUND:Despite wide acceptance in medical research, implementation of the FAIR (findability, accessibility, interoperability, and reusability) principles in certain health domains and interoperability across data sources remain a challenge. While clinical trial registries collect metadata about clinical studies, numerous epidemiological and public health studies remain unregistered or lack detailed information about relevant study documents. Making valuable data from these studies available to the research community could improve our understanding of various diseases and their risk factors. The National Research Data Infrastructure for Personal Health Data (NFDI4Health) seeks to optimize data sharing among the clinical, epidemiological, and public health research communities while preserving privacy and ethical regulations.
OBJECTIVE:We aimed to develop a tailored metadata schema (MDS) to support the standardized publication of health studies' metadata in NFDI4Health services and beyond. This study describes the development, structure, and implementation of this MDS designed to improve the FAIRness of metadata from clinical, epidemiological, and public health research while maintaining compatibility with metadata models of other resources to ease interoperability.
METHODS:Based on the models of DataCite, ***, and other data models and international standards, the first MDS version was developed by the NFDI4Health Task Force COVID-19. It was later extended in a modular fashion, combining generic and NFDI4Health use case-specific metadata items relevant to domains of nutritional epidemiology, chronic diseases, and record linkage. Mappings to schemas of clinical trial registries and international and local initiatives were performed to enable interfacing with external resources. The MDS is represented in Microsoft Excel spreadsheets. A transformation into an improved and interactive machine-readable format was completed using the ART-DECOR (Advanced
The BioCreative National Library of Medicine (NLM)-Chem track calls for a community effort to fine-tune automated recognition of chemical names in the biomedical literature. Chemicals are one of the most searched biom...
The BioCreative National Library of Medicine (NLM)-Chem track calls for a community effort to fine-tune automated recognition of chemical names in the biomedical literature. Chemicals are one of the most searched biomedical entities in PubMed, and - as highlighted during the coronavirus disease 2019 pandemic - their identification may significantly advance research in multiple biomedical subfields. While previous community challenges focused on identifying chemical names mentioned in titles and abstracts, the full text contains valuable additional detail. We, therefore, organized the BioCreative NLM-Chem track as a community effort to address automated chemical entity recognition in full-text articles. The track consisted of two tasks: (i) chemical identification and (ii) chemical indexing. The chemical identification task required predicting all chemicals mentioned in recently published full-text articles, both span [i.e. named entity recognition (NER)] and normalization (i.e. entity linking), using Medical Subject Headings (MeSH). The chemical indexing task required identifying which chemicals reflect topics for each article and should therefore appear in the listing of MeSH terms for the document in the MEDLINE article indexing. This manuscript summarizes the BioCreative NLM-Chem track and post-challenge experiments. We received a total of 85 submissions from 17 teams worldwide. The highest performance achieved for the chemical identification task was 0.8672 F-score (0.8759 precision and 0.8587 recall) for strict NER performance and 0.8136 F-score (0.8621 precision and 0.7702 recall) for strict normalization performance. The highest performance achieved for the chemical indexing task was 0.6073 F-score (0.7417 precision and 0.5141 recall). This community challenge demonstrated that (i) the current substantial achievements in deep learning technologies can be utilized to improve automated prediction accuracy further and (ii) the chemical indexing task is substanti
SABIO-RK ( http://***/ ) is a web-accessible, manually curated database that has been established as a resource for biochemical reactions and their kinetic properties with a focus on supporting the computational model...
详细信息
SABIO-RK ( http://***/ ) is a web-accessible, manually curated database that has been established as a resource for biochemical reactions and their kinetic properties with a focus on supporting the computational modeling to create models of biochemical reaction networks. SABIO-RK data are mainly extracted from literature but also directly submitted from lab experiments. In most cases the information in the literature is distributed across the whole publication, insufficiently structured and often described without standard terminology. Therefore the manual extraction of knowledge from the literature requires biological experts to understand the paper and interpret the data. The database offers the literature data in a structured format including annotations to controlled vocabularies, ontologies and external databases which supports modellers, as well as experimentalists, in the very time consuming process of collecting information from different publications. Here we describe the data extraction and curation efforts needed for SABIO-RK and give recommendations for publishing kinetic data in a complete and structured manner.
Systems biology aims at providing a dynamic view of the living cell,tissssue, or organism, by combining experimental and theoretical methods. Advances in experimentalmethods in biochemictry are making plssible an incr...
Systems biology aims at providing a dynamic view of the living cell,tissssue, or organism, by combining experimental and theoretical methods. Advances in experimentalmethods in biochemictry are making plssible an increasingly deailed analysis of cellular ***, inmany cases the data resulting from these experiments cannot be mechanistically explainedwith the existing knowledge because of the complexity of the biochemical networks. thereforecomputer-assisted modelling, simulagion and lther mathemagical methods are used for aninterpretation of experimental data.
This paper briefly describes the SABIO-RK database model for the storage of reaction kinetics information and the guidelines followed within the SABIO-RK project to annotate the kinetic data. Such annotations support ...
详细信息
This paper briefly describes the SABIO-RK database model for the storage of reaction kinetics information and the guidelines followed within the SABIO-RK project to annotate the kinetic data. Such annotations support the definition of cross links to other related databases and augment the semantics of the data stored in the database.
暂无评论