In the era of big data, knowledge engineering faces fundamental challenges induced by fragmented knowledge from heterogeneous, autonomous sources with complex and evolving relationships. The knowledge representation, ...
详细信息
Patient acquisition of carbapenem resistant bacteria in hospitals is a serious problem that leads to adverse outcomes for infected patients. The most common carbapenem resistance mechanism in US hospitals is a mobile ...
详细信息
Computer science offers a large set of tools for prototyping, writing, running, testing, validating, sharing and reproducing results, however computational science lags behind. In the best case, authors may provide th...
详细信息
Supernova (SN) 2016bdu is an unusual transient resembling SN 2009ip. SN 2009ip-like events are characterized by a long-lasting phase of erratic variability which ends with two luminous outbursts a few weeks apart. The...
详细信息
Categorical data are ubiquitous in real-world databases. However, due to the lack of an intrinsic proximity measure, many powerful algorithms for numerical data analysis may not work well on their categorical counterp...
详细信息
Recent developments in the field of gene sequencing technology greatly accelerated discovery of mutations that cause various genetic disorders. At the same time, a typical sequencing experiment generates a large numbe...
详细信息
ISBN:
(纸本)9781479975617
Recent developments in the field of gene sequencing technology greatly accelerated discovery of mutations that cause various genetic disorders. At the same time, a typical sequencing experiment generates a large number of candidate mutations, hence detecting single or few causative variants is still a formidable problem. Many computational methods have been proposed to assist this process, from which a large portion employ statistical learning in some form. Consequently, each newly designed algorithm is routinely compared to other competing systems in hope to demonstrate advantageous performance. In this work we review and discuss several issues related to the current practice of evaluation of mutation prioritization algorithms and suggest possible directions for improvements.
We study the problem of rank aggregation: given a set of ranked lists, we want to form a consensus ranking. Furthermore, we consider the case of extreme lists: i.e., only the rank of the best or worst elements are kno...
We study the problem of rank aggregation: given a set of ranked lists, we want to form a consensus ranking. Furthermore, we consider the case of extreme lists: i.e., only the rank of the best or worst elements are known. We impute missing ranks and generalise Spearman's ρ to extreme ranks. Our main contribution is the derivation of a non-parametric estimator for rank aggregation based on multivariate extensions of Spearman's ρ, which measures correlation between a set of ranked lists. Multivariate Spearman's ρ is defined using copulas, and we show that the geometric mean of normalised ranks maximises multivariate correlation. Motivated by this, we propose a weighted geometric mean approach for learning to rank which has a closed form least squares solution. When only the best (top-k) or worst (bottom-k) elements of a ranked list are known, we impute the missing ranks by the average value, allowing us to apply Spearman's ρ. We discuss an optimistic and pessimistic imputation of missing values, which respectively maximise and minimise correlation, and show its effect on aggregating university rankings. Finally, we demonstrate good performance on the rank aggregation benchmarks MQ2007 and MQ2008.
Following the publication of the US National Research Council (N RC) report " Toward PrecMon Medicine." Building a Knowledge Network for Biomedical Research and a New Taxonomy of Diseases" in 2011 [1], several n...
详细信息
Following the publication of the US National Research Council (N RC) report " Toward PrecMon Medicine." Building a Knowledge Network for Biomedical Research and a New Taxonomy of Diseases" in 2011 [1], several nations have announced that their national research programs would definitely head toward this direction. Now,
The discovery of the first electromagnetic counterpart to a gravitational wave signal has generated follow-up observations by over 50 facilities world-wide, ushering in the new era of multi-messenger astronomy. In thi...
详细信息
暂无评论