Quality assurance (QA) is an important task in manufacturing to assess whether products meet their specifications. However, QA might be expensive, time-consuming, or incomplete. This paper presents a solution for pred...
详细信息
Quality assurance (QA) is an important task in manufacturing to assess whether products meet their specifications. However, QA might be expensive, time-consuming, or incomplete. This paper presents a solution for predictive analytics in QA based on machine sensor values during production while employing specialized machine-learning models for classification in a controlled environment. Furthermore, we present lessons learned while implementing this model, which helps to reduce complexity in further industrial applications. The paper’s outcome proves that the developed model was able to predict product quality, as well as to identify the correlation between machine-status and faulty product occurrence.
PyKEEN is a framework, which integrates several approaches to compute knowledge graph embeddings (KGEs). We demonstrate the usage of PyKEEN in an biomedical use case, i.e. we trained and evaluated several KGE models o...
详细信息
PyKEEN is a framework, which integrates several approaches to compute knowledge graph embeddings (KGEs). We demonstrate the usage of PyKEEN in an biomedical use case, i.e. we trained and evaluated several KGE models on a biological knowledge graph containing genes’ annotations to pathways and pathway hierarchies from well-known databases. We used the best performing model to predict new links and present an evaluation in collaboration with a domain expert*. Copyright 2019 for this paper by its authors.
Biological and medical researchers explore the mechanisms of living organisms and tend to gain a better understanding of underlying fundamental biological processes of life. To tackle such complex tasks they constantl...
详细信息
Relevance feedback for document retrieval systems is a technique where user feedback is used to improve a query response. In this work we propose a system that uses multiple clusterings and a semi-supervised heuristic...
详细信息
This volume of LNCSE is a collection of the papers from the proceedings of the third workshop on sparse grids and applications. Sparse grids are a popular approach for the numerical treatment of high-dimensional probl...
ISBN:
(纸本)9783319803098
This volume of LNCSE is a collection of the papers from the proceedings of the third workshop on sparse grids and applications. Sparse grids are a popular approach for the numerical treatment of high-dimensional problems. Where classical numerical discretization schemes fail in more than three or four dimensions, sparse grids, in their different guises, are frequently the method of choice, be it spatially adaptive in the hierarchical basis or via the dimensionally adaptive combination technique. Demonstrating once again the importance of this numerical discretization scheme, the selected articles present recent advances on the numerical analysis of sparse grids as well as efficient data structures. The book also discusses a range of applications, including uncertainty quantification and plasma physics.
The formulation of transport network problems is represented as a translation between two domain specific languages: From a network description language, used by network simulation community, to a problem description ...
详细信息
Background Data collected in cohort studies lay the groundwork for a plethora of Alzheimer’s disease (AD) research endeavours. While there exist numerous cohort datasets in our field, few dedicated efforts have focus...
Background Data collected in cohort studies lay the groundwork for a plethora of Alzheimer’s disease (AD) research endeavours. While there exist numerous cohort datasets in our field, few dedicated efforts have focused on comparing these datasets to each other. In recent years, large research consortia, such as EMIF and ROADMAP, were formed to investigate and evaluate the AD data landscape. To the best of our knowledge, such previous approaches have solely investigated cohorts at the metadata level without considering the cohort dataset itself. In this work, we explored eight major AD cohort datasets with the aim of 1) characterizing their underlying data, 2) assessing the quantity and availability of data, 3) evaluating the interoperability across distinct cohort datasets, and 4) presenting our findings in an interactive web application. Thus, we allow researchers to investigate the AD data landscape and find the most suited datasets for their research. Method We collected data from multiple major AD cohort studies. Through intensive manual curation of these datasets, we identified which data modalities can be found in the shared data. Using descriptive statistics and visualizations, we highlighted differences and similarities between cohorts focusing on key demographic characteristics and AD biomarkers. Finally, we assessed and compared the longitudinal follow-up data for each study. Result We show that demographics can vary substantially across cohorts (Table 1). The overlap between datasets is limited with regard to assessed measurements (Figure 1). Additionally, distributions of the same variables often differ across cohorts due to differences in processing and normalization; thus, hampering dataset interoperability. By investigating longitudinal follow-up data, we demonstrate that although there is reasonable longitudinal coverage on cognitive assessments, important biomarkers were only collected for a small fraction of study participants over time (Figure 2).
A lot of problems in natural language processing can be interpreted using structures from discrete mathematics. In this paper we will discuss the search query and topic finding problem using a generic context-based ap...
详细信息
A lot of problems in natural language processing can be interpreted using structures from discrete mathematics. In this paper we will discuss the search query and topic finding problem using a generic context-based approach. This problem can be described as a Minimum Set Cover Problem with several constraints. The goal is to find a minimum covering of documents with the given context for a fixed weight function. The aim of this problem reformulation is a deeper understanding of both the hierarchical problem using union and cut as well as the non-hierarchical problem using the union. We thus choose a modeling using bipartite graphs and suggest a novel reformulation using an integer linear program as well as novel graph-theoretic approaches.
Parkinson’s disease (PD) exhibits a variety of symptoms, with approximately 25% of patients experiencing mild cognitive impairment and 45% developing dementia within ten years of diagnosis. Predicting this progressio...
Alkaline methanol oxidation is an electrochemical process, perspective for the design of efficient high energy density fuel cells. The process involves a large number of elementary reactions, forming a complex reactio...
详细信息
暂无评论