We present a vision of genetic improvement firmly embedded in, and supported by, programming languages. Genetic improvement has already been envisioned as the next compiler, which would take human written programs as ...
详细信息
ISBN:
(纸本)9781450349390
We present a vision of genetic improvement firmly embedded in, and supported by, programming languages. Genetic improvement has already been envisioned as the next compiler, which would take human written programs as input and return versions optimised with respect to various objectives. As an intermediate stage, or perhaps to complement the fully automated vision, we imagine genetic improvement processes that are hinted at and directed by humans but understood and undertaken by programming languages and their runtimes, via interactions through the source code. We examine existing similar ideas and examine the benefits of embedding them within programming languages.
Gene regulatory networks (GRNs) have been widely used as a fundamental tool to reveal the genomic mechanisms that underlie the individual's response to environmental and developmental cues. Standard approaches inf...
详细信息
Gene regulatory networks (GRNs) have been widely used as a fundamental tool to reveal the genomic mechanisms that underlie the individual's response to environmental and developmental cues. Standard approaches infer GRNs as holistic graphs of gene coexpression, but such graphs cannot quantify how gene-gene interactions vary among individuals and how they alter structurally across spatiotemporal gradients. Here, we develop a general framework for inferring informative, dynamic, omnidirectional, and personalized networks (idopNetworks) from routine transcriptional experiments. This framework is constructed by a system of quasi-dynamic ordinary differential equations (qd0DEs) derived from the combination of ecological and evolutionary theories. We reconstruct idopNetworks using genomic data from a surgical experiment and illustrate how network structure is associated with surgical response to infrainguinal vein bypass grafting and the outcome of grafting. idopNetworks may shed light on genotype-phenotype relationships and provide valuable information for personalized medicine.
Unprecedented technological advances in single-cell RNA-sequencing (scRNA-seq) technology have now made it possible to profile genome-wide expression in single cells at low cost and high throughput. There is substanti...
详细信息
Unprecedented technological advances in single-cell RNA-sequencing (scRNA-seq) technology have now made it possible to profile genome-wide expression in single cells at low cost and high throughput. There is substantial ongoing effort to use scRNA-seq measurements to identify the “cell types” that form components of a complex tissue, akin to taxonomizing species in ecology. Cell type classification from scRNA-seq data involves the application of computational tools rooted in dimensionality reduction and clustering, and statistical analysis to identify molecular signatures that are unique to each type. As datasets continue to grow in size and complexity, computational challenges abound, requiring analytical methods to be scalable, flexible, and robust. Moreover, careful consideration needs to be paid to experimental biases and statistical challenges that are unique to these measurements to avoid artifacts. This chapter introduces these topics in the context of cell-type identification, and outlines an instructive step-by-step example bioinformatic pipeline for researchers entering this field. less
Prolog was a modern, promising and a very popular programming language thirty years ago and Logic programming acquired a firm place within Computer Science as well as in ICT programs curricula especially at European a...
详细信息
ISBN:
(纸本)9781538677117
Prolog was a modern, promising and a very popular programming language thirty years ago and Logic programming acquired a firm place within Computer Science as well as in ICT programs curricula especially at European and Asian Universities. However, the rapid development of new programming languages moved Prolog far away from the beginning of the list of the most widely utilized programming languages. Therefore the logical question is whether there are still some good and justified reasons to keep Logic programming in the present ICT specialists curricula. This paper outlines some general reasons for maintaining it on the list of the subjects taught and it brings out some basic examples showing the usefulness of the logic programming paradigm especially when explaining and teaching the efficient utilization of recursion.
Although the number of sequenced insect genomes numbers in the hundreds, little is known about gene regulatory sequences in any species other than the well-studied Drosophila melanogaster. We provide here a detailed p...
详细信息
Although the number of sequenced insect genomes numbers in the hundreds, little is known about gene regulatory sequences in any species other than the well-studied Drosophila melanogaster. We provide here a detailed protocol for using SCRMshaw, a computational method for predicting cis-regulatory modules (CRMs, also “enhancers”) in sequenced insect genomes. SCRMshaw is effective for CRM discovery throughout the range of holometabolous insects and potentially in even more diverged species, with true-positive prediction rates of 75% or better. Minimal requirements for using SCRMshaw are a genome sequence and training data in the form of known Drosophila CRMs; a comprehensive set of the latter can be obtained from the SCRMshaw download site. For basic applications, a user with only modest computational know-how can run SCRMshaw on a desktop computer. SCRMshaw can be run with a single, narrow set of training data to predict CRMs regulating a specific pattern of gene expression, or with multiple sets of training data covering a broad range of CRM activities to provide an initial rough regulatory annotation of a complete, newly-sequenced genome. less
Gene expression can be monitored in living cells via the binding of fluorescently tagged proteins to RNA repeats engineered into a reporter transcript. This approach makes it possible to trace temporal changes of RNA ...
详细信息
Gene expression can be monitored in living cells via the binding of fluorescently tagged proteins to RNA repeats engineered into a reporter transcript. This approach makes it possible to trace temporal changes of RNA production in real time in living cells to dissect transcription regulation. For a mechanistic analysis of the underlying activation process, it is essential to induce gene expression with high accuracy. Here, we describe how this can be accomplished with an optogenetic approach termed blue light-induced chromatin recruitment (BLInCR). It employs the recruitment of an activator protein to a target promoter via the interaction between the PHR and CIBN plant protein domains. This process occurs within seconds after setting the light trigger and is reversible. Protocols for continuous activation as well as pulsed activation and reactivation with imaging either by laser scanning confocal microscopy or automated widefield microscopy are provided. For the semiautomated quantification of the resulting image series, an approach has been implemented in a set of scripts in the R programming language. Thus, the complete workflow of the BLInCR method is described for mechanistic studies of the transcription activation process as well as the persistence and memory of the activated state. less
ML-Rules is a rule-based language for multi-level modeling and simulation. ML-Rules supports dynamic nesting of entities and applying arbitrary functions on entity attributes and content, as well as for defining kinet...
详细信息
ML-Rules is a rule-based language for multi-level modeling and simulation. ML-Rules supports dynamic nesting of entities and applying arbitrary functions on entity attributes and content, as well as for defining kinetics of reactions. This allows describing and simulating complex cellular dynamics operating at different organizational levels, e.g., to combine intra-, inter-, and cellular dynamics, like the proliferation of cells, or to include compartmental dynamics like merging and splitting of mitochondria or endocytosis. The expressiveness of the language is bought with additional efforts in executing ML-Rules models. Therefore, various simulators have been developed from which the user and automatic procedures can select. The experiment specification language SESSL facilitates design, execution, and reuse of simulation experiments. The chapter illuminates the specific features of ML-Rules as a rule-based modeling language, the implications for an efficient execution, and shows ML-Rules at work. less
Graphics Processing Units (GPUs) perform the majority of computations in state-of-the-art supercomputers. programming these GPUs is often assisted using a programming model such as (amongst others) the directive-drive...
详细信息
ISBN:
(纸本)9783319699530;9783319699523
Graphics Processing Units (GPUs) perform the majority of computations in state-of-the-art supercomputers. programming these GPUs is often assisted using a programming model such as (amongst others) the directive-driven OpenACC. Unfortunately, OpenACC (and other similar models) are incapable of automatically targeting and distributing work across several GPUs, which decreases productivity and forces needless manual labor upon programmers. We propose a method that enables OpenACC applications to target multi-GPU. Workload distribution, data transfer and inter-GPU communication (including modern GPU-to-GPU links) are automatically and transparently handled by our compiler with no user intervention and no changes to the program code. Our method leverages existing OpenMP and OpenACC backends, ensuring easy integration into existing HPC infrastructure. Empirically we quantify performance gains and losses in our data coherence method compared to similar approaches, and also show that our approach can compete with the performance of hand-written MPI code.
Untargeted metabolite profiling based upon LC-MS methodology can be used to identify unique metabolic phenotypes associated with stress, disease or environmental exposure of cells using mathematical clustering. Here, ...
详细信息
Untargeted metabolite profiling based upon LC-MS methodology can be used to identify unique metabolic phenotypes associated with stress, disease or environmental exposure of cells using mathematical clustering. Here, we show how unsupervised data analysis is a powerful tool for both quality control and answering simple biological questions. We will demonstrate how to format untargeted mass spectrometry data for import into R, a programming language and software environment for statistical computing (R Development Core Team. R: A language and environment for statistical computing, reference index version 2.15. R Foundation for Statistical Computing, Vienna, 2012). Using R, we transform untargeted metabolite data using hierarchical clustering and principal component analysis (PCA) to create visual representations of change between biological samples and explore how these can be used predictively, in determining environmental stress, health and metabolic insight. less
The subcellular localization of proteins is a posttranslational modification of paramount importance. The ability to study subcellular and organelle proteomes improves our understanding of cellular homeostasis and cel...
详细信息
The subcellular localization of proteins is a posttranslational modification of paramount importance. The ability to study subcellular and organelle proteomes improves our understanding of cellular homeostasis and cellular dynamics. In this chapter, we describe a protocol for the unbiased and high-throughput study of protein subcellular localization in the yeast Saccharomyces cerevisiae: hyperplexed localization of organelle proteins by isotope tagging (hyperLOPIT), which involves biochemical fractionation of Saccharomyces cerevisiae and high resolution mass spectrometry-based protein quantitation using TMT 10-plex isobaric tags. This protocol enables the determination of the subcellular localizations of thousands of proteins in parallel in a single experiment and thereby deep sampling and high-resolution mapping of the spatial proteome. less
暂无评论