Chemical reactions occur in cells for survival and adaptation to various conditions. After these chemical reactions, the reactants and products are often sequentially modified through metabolic pathways. In this study...
Chemical reactions occur in cells for survival and adaptation to various conditions. After these chemical reactions, the reactants and products are often sequentially modified through metabolic pathways. In this study, we defined new features to evaluate the possibility of such inferred metabolic pathways. We focused on the main chain structure of a compound as a non-directional graph, and developed a method to define the similarity between these main chain structure graphs. In this study, we defined four features: 1) the number of main chain graph nodes, 2) the graphical density of the main chain graph, 3) the chemical density of the main chain, and 4) the graph centrality of the reaction group in the main chain graph. We defined the main chain structures of about 16,000 chemical compounds, and calculated the values of the four features by the defined equations for each compound. Finally, we calculated the correlation coefficients between all chemical compound pairs from the four defined features. A comparison of the similarities of the main chain graphs between known chemical reactions revealed that our defined features are suitable for detecting the possible reactions.
作者:
Murtagh, FionnBig Data Lab
Department of Computer Science and Mathematics University of Derby DerbyDE22 1GB United Kingdom Department of Computing
Goldsmiths University of London LondonSW14 6NW United Kingdom
As a platform for unsupervised data mining and pattern recognition, we use Correspondence Analysis on Twitter content from May to December 2015. The following data characteristics are well addressed: exponentially dis...
详细信息
As a platform for unsupervised data mining and pattern recognition, we use Correspondence Analysis on Twitter content from May to December 2015. The following data characteristics are well addressed: exponentially distributed data properties, and major imbalance between categories. Contextualization is supported. To both focus on informative resolution scale in one's data, and to handle large data sets, the granularity of point clouds offers benefits.
The CL-SciSumm 2016 Shared Task is the first medium-scale shared task on scientific document summarization in the computational linguistics (CL) domain. The task built off of the experience and training data set creat...
详细信息
The CL-SciSumm 2016 Shared Task is the first medium-scale shared task on scientific document summarization in the computational linguistics (CL) domain. The task built off of the experience and training data set created in its namesake pilot task, which was conducted in 2014 by the same organizing committee. The track included three tasks involving: (1A) identifying relationships between citing documents and the referred document, (1B) classifying the discourse facets, and (2) generating the abstractive summary. The dataset comprised 30 annotated sets of citing and reference papers from the open access research papers in the CL domain. This overview paper describes the participation and the official results of the second CL-SciSumm Shared Task, organized as a part of the Joint Workshop onBibliometric-enhanced Information Retrieval and Natural Language Processing for Digital Libraries (BIRNDL 2016), held in New Jersey,USA in June, 2016. The annotated dataset used for this shared task and the scripts used for evaluation can be accessed and used by the community at: https://***/WING-NUS/scisumm-corpus.
Event stream dissemination dominates the workloads in large-scale Online Social Network (OSN) systems. Based on the de facto per-user view data storage, event stream dissemination raises a large amount of inter-server...
详细信息
ISBN:
(纸本)9781509032822
Event stream dissemination dominates the workloads in large-scale Online Social Network (OSN) systems. Based on the de facto per-user view data storage, event stream dissemination raises a large amount of inter-server traffics due to the complex interconnection among OSN users. The state-of-the-art schemes mainly explore the structure features of social graphs to reduce the inter-server messages for event stream dissemination. Different sub-graph structures are exploited for achieving the approximated optimal assignment. However, such schemes incur high costs of computation or communication. In this work, we follow a different design philosophy by using a game theoretic approach, which decomposes the high complex graph computation problem into individuals' rational strategy selection of each node. Specifically, we propose a novel social piggyback game to achieve a more efficient solution. We mathematically prove the existing of the Nash Equilibrium of the social piggyback game. Moreover, we propose an efficient best response dynamic algorithm to achieve the Nash Equilibrium, which quickly converges in a small number of iterations for large-scale OSNs. We further show that the communication cost of this design achieves a 1.5-approximation of the theoretical social optimal. We conduct comprehensive experiments to evaluate the performance of this design using large-scale real-world traces from popular OSN systems. Results show that the social piggyback game achieves a significant 302× improvement in system efficiency compared to existing schemes.
This paper comes up with a SDN based On-Demand Routing Protocol, SVAO, which separates data forwarding layer and network control layer, as in SDN, to enhance the data transmission efficiency within VANETs. The Roadsid...
详细信息
ISBN:
(纸本)9781509056972
This paper comes up with a SDN based On-Demand Routing Protocol, SVAO, which separates data forwarding layer and network control layer, as in SDN, to enhance the data transmission efficiency within VANETs. The Roadside Service Unit plays the role of Local Controller and is in charge of selecting vehicles to forward packet within a road segment. All the vehicles state in the road. Correspondingly, a two-level design is used. Global Level is distributed and adopts a ranked query scheme to collect vehicle information and determine the road segments along which a message should be forwarded. And the Local Level is in charge of selecting forwarding vehicles in each road segment determined by the global level. We compare SVAO with popular ad-hoc network routing protocols, including OLSR, DSR, DSDV, and DB via simulations. We consider the impact of vehicle density, speed on data transmission rate and average packet delay. The simulation results show that in case of large network scales or high vehicle speed, SVAO performs better than the others.
The frequent directions (FD) technique is a deterministic approach for online sketching that has many applications in machine learning. The conventional FD is a heuristic procedure that often outputs rank deficient ma...
详细信息
User information sharing is an important behavior in online social networks. Understanding such behavior could help in various applications such as user modeling, information cascade analysis, viral marketing, etc. In...
详细信息
ABSTRACTIntroductionClusters of subclonal mutations attributed to subclonal expansions have been observed across most cancer types. To date it is not clear what fraction of these expansions can be ascribed to selectiv...
ABSTRACTIntroductionClusters of subclonal mutations attributed to subclonal expansions have been observed across most cancer types. To date it is not clear what fraction of these expansions can be ascribed to selective sweeps or genetic drifts. The widely held theory states that tumour expansion arises from a series of selective sweeps. However, Williams et al. (Nat. Genet. 48:238–224, 2016) recently claimed to have identified neutral tumour evolution in one third of 904 samples from The Cancer Genome Atlas (TCGA). Material and methodsFirst, using the same equations we simulated data of tumour undergoing selective sweeps. Second, we simulated data using branching processes, i.e. more realistic models. Third, we analysed the mutations from ~1600 exomes and ~1500 whole genomes of real tumours from TCGA and the International Cancer Genome Consortium, respectively. We grouped them into neutral and non-neutral tumours according to the rationale described by Williams et al. We then applied dN/dS, an orthogonal widely used approach to detect selection in non-synonymous mutations of coding regions. Finally, we ran dN/dS on subclonal mutations within individual cancer types. Results and discussionsApplication of the method described by Williams et al. led to over-calling of neutrality and led to random predictions (area under the curve~=50%) in simulations based on the deterministic or stochastic models, respectively. dN/dS analyses revealed significant and widespread positive selection in clonal and subclonal mutations within both neutral and non-neutral groups. Despite power limits, the subclonal mutations of many individual cancer types displayed dN/dS>1, evidence for positive selection. ConclusionAltogether, our results point to the lack of evidence for neutral tumour evolution and uncovered strong positive selection within subclonal mutations across cancers.
The watermarking technique can be used to protect the ownership of relational databases by hiding some ownership information into the relational databases. In this paper, a novel robust reversible watermarking scheme ...
详细信息
暂无评论