European laws on privacy and data security are not explicit about the storage and processing of genetic data. Especially whole-genome data is identifying and contains a lot of personal information. Is processing of su...
详细信息
To enable discovery in large, heterogenious information networks a tool is needed that allows exploration in changing graph structures and integrates advanced graph mining methods in an interactive visualization frame...
详细信息
Computational neuroscience is a new field of research in which neurodegenerative diseases are studied with the aid of new imaging techniques and computation facilities. Researchers with different expertise collaborate...
详细信息
MicroRNAs can regulate hundreds of target genes and play a pivotal role in a broad range of biological process. However, relatively little is known about how these highly connected miRNAs-target networks are remodelle...
详细信息
MicroRNAs can regulate hundreds of target genes and play a pivotal role in a broad range of biological process. However, relatively little is known about how these highly connected miRNAs-target networks are remodelled in the context of various diseases. Here we examine the dynamic alteration of context-specific miRNA regulation to determine whether modified microRNAs regulation on specific biological processes is a useful information source for predicting cancer prognosis. A new concept, Context-specific miRNA activity (CoMi activity) is introduced to describe the statistical difference between the expression level of a miRNA's target genes and non-targets genes within a given gene set (context).
This paper represents a technique, applying user action patterns in order to distinguish between users and identify them. In this method, users' actions sequences are mapped to numerical sequences and each user...
详细信息
The rapid burgeoning of available data in the form of categorical sequences, such as biological sequences, natural language texts, network and retail transactions, makes the classification of categorical sequences inc...
详细信息
The rapid burgeoning of available data in the form of categorical sequences, such as biological sequences, natural language texts, network and retail transactions, makes the classification of categorical sequences inc...
详细信息
The rapid burgeoning of available data in the form of categorical sequences, such as biological sequences, natural language texts, network and retail transactions, makes the classification of categorical sequences increasingly important. The main challenge is to identify significant features hidden behind the chronological and structural dependencies characterizing their intrinsic properties. Almost all existing algorithms designed to perform this task are based on the matching of patterns in chronological order, but categorical sequences often have similar features in non-chronological order. In addition, these algorithms have serious difficulties in outperforming domain-specific algorithms. In this paper we propose CLASS, a general approach for the classification of categorical sequences. By using an effective matching scheme called SPM for Significant Patterns Matching, CLASS is able to capture the intrinsic properties of categorical sequences. Furthermore, the use of Latent Semantic Analysis allows capturing semantic relations using global information extracted from large number of sequences, rather than comparing merely pairs of sequences. Moreover, CLASS employs a classifier called SNN for Significant Nearest Neighbours, inspired from the K Nearest Neighbours approach with a dynamic estimation of K, which allows the reduction of both false positives and false negatives in the classification. The extensive tests performed on a range of datasets from different fields show that CLASS is oftentimes competitive with domain-specific approaches.
The characteristic framework types of zeolite crystals are routinely determined by calculating coordination sequences and vertex symbols of the 3D crystal structures. This method has limitations and tends to fail when...
详细信息
ISBN:
(纸本)9781601321091
The characteristic framework types of zeolite crystals are routinely determined by calculating coordination sequences and vertex symbols of the 3D crystal structures. This method has limitations and tends to fail when the synthesized crystals are not close to perfect and present some types of crystallographic disorder. A machine learning based Zeolite-Structure-Predictor (ZSP) model is developed to predict framework types for both near perfect and moderately disordered zeolite crystals. The ZSP uses various attributes, including topological descriptors based on a computational geometry approach and relevant physical, chemical properties of the crystals. Trained with 41 framework types, the ZSP can correctly classify zeolite crystals with over 98% accuracy. Additionally, it is shown that the ZSP model is able to predict the framework types for strongly disordered zeolite crystals with reliable success rate.
A machine learning approach is applied to classify zeolite crystals according to their framework type. The Zeolite-Structure-Predictor is introduced based on the Random Forest algorithm. Zeolites structural data from ...
详细信息
ISBN:
(纸本)1601320620
A machine learning approach is applied to classify zeolite crystals according to their framework type. The Zeolite-Structure-Predictor is introduced based on the Random Forest algorithm. Zeolites structural data from the Inorganic Crystal Structure database (ICSD) are used to train the model. The ZSP uses sixteen attributes including topological descriptors obtained with statistical geometry and physical and chemical properties of individual zeolites. Trained with 40 framework types containing at least 5 instances per class, the ZSP can correctly classify zeolites with over 95% accuracy. The performance is shown to improve when more zeolite instances per class are available.
暂无评论