检索结果-内蒙古大学图书馆

71 Genomic regions flanking E-box binding sites influence DNA binding specificity of bHLH transcription factors through DNA shape

引用

Journal of Biomolecular Structure and Dynamics 2013年第SUPP 1期31卷 45-46页

作者： Raluca Gordan[a][b][*] Ning Shen[c] Iris Dror[d] Tianyin Zhou[d] John Horton[b] Remo Rohs[d] & Martha L. Bulyk[e][f] [a] Departments of Biostatistics and Bioinformatics Computer Science and Molecular Genetics and Microbiology Duke University Durham NC USA [b] Institute for Genome Sciences and Policy Duke University Durham NC 27708 USA Phone: Phone: +1 (919) 667-8673 Fax: Phone: +1 (919) 667-8673 [c] Department of Pharmacology and Cancer Biology Duke University Durham NC 27708 USA [d] Molecular and Computational Biology Program Departments of Biological Sciences Chemistry and Physics and Astronomy University of Southern California 1050 Childs Way Los Angeles CA 90089 USA [e] Division of Genetics Departments of Medicine and Pathology Brigham and Women’s Hospital and Harvard Medical School Boston MA 02115 USA [f] Harvard-MIT Division of Health Sciences and Technology (HST) Harvard Medical School Boston MA 02115 USA

Transcriptional regulation of gene expression is enacted mainly through binding of transcription factors (TFs) to specific, short DNA sites in cis-regulatory regions of genes. Most TFs are members of protein families that share a common DNA-binding domain and thus recognize similar DNA-binding sequences. It is not well understood why paralogous TFs often bind different genomic target sitesin vivoto effect different regulatory programs, despite apparently recognizing the same sequence motifs. Here, we designed custom protein-binding microarrays (PBMs) to analyze the DNA-binding specificities of twoSaccharomyces cerevisiaebasic helix-loop-helix (bHLH) proteins, Tye7 and Cbf1, as a model system. Our data reveal that E-box DNA-binding sequences (CAnnTG), when tested in the context of their native genomic flanking sequences, are bound differently by Cbf1 and Tye7. Computational models of the PBM data indicate that DNA sequence features located in the genomic sequences outside the E-box contribute to DNA-binding specificityin vitro. Our analyses suggest that these flanking regions affect DNA-binding specificity indirectly by influencing the three-dimensional structure of the E-box binding sites. Finally, we show that these subtle differences in intrinsic sequence preferences of Cbf1 and Tye7in vitrohelp to explain their differential DNA-binding preferencesin vivo. Our results provide further evidence that the local shape of DNA-binding sites may be an important feature in distinguishing the DNA-binding preferences among paralogous TFs and thus may play a widespread role in determining how transcriptional regulatory specificity within TF families is achieved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Secondary structure predictions for long RNA sequences based on inversion excursions - Preliminary results 12

Secondary structure predictions for long RNA sequences based...

引用

2012 ACM Conference on bioinformatics, Computational Biology and Biomedicine, BCB 2012

作者： Yehdego, Daniel T. Zhang, Boyu Taufer, Michela Kodimala, Vikram Kumar Reddy Vegesna, Rahulsimham Viswakula, Sameera Johnson, Kyle L. Leung, Ming-Ying Computational Science Program University of Texas El Paso United States Department of Computer and Information Sciences University of Delaware United States Bioinformatics Program Border Biomedical Research Center University of Texas El Paso United States Department of Mathematical Sciences University of Texas El Paso United States Department of Biological Sciences University of Texas El Paso United States Department of Mathematics and Statistics Old Dominion University Norfolk VA United States University of Texas M. D. Anderson Cancer Center Houston TX United States

ISBN: (纸本)9781450316705

The tremendous demand on computer memory and computing time for prediction of complex secondary structures limits the applicability of most RNA secondary structure prediction programs available to short RNA sequences. We propose to approach this problem by segmenting a long RNA sequence into shorter non-overlapping chunks, predicting the secondary structures of each chunk individually, and then assembling the prediction results to give the structure of the original sequence. The selection of cutting points is a crucial component of the approach. Noting that stem-loops and pseudoknots always contain an inversion, we developed two cutting methods, the centered and optimized methods, for segmenting long RNA sequences based on inversion excursions. For the majority of the sequences in a dataset of 50 RNAs from the RFAM database, the prediction algorithm PKnotsRG used with these cutting methods produces more accurate secondary structures than those predicted for the whole sequence without segmentation. Both the centered and optimized cutting methods outperform the naïve regular segmentation. These results support our claim that cutting is a promising approach for the prediction of long RNA sequences, and choosing the cutting points intelligently by considering sequence features such as inversion excursions can further enhance prediction accuracy.

关键词： Forecasting

来源：评论

学校读者我要写书评

暂无评论

Corrigendum to “Weight loss intervention for young adults using mobile technology: Design and rationale of a randomized controlled trial — Cell phone Intervention for You (CITY)” [Contemp Clin Trials 37/2 (2014) 333–341]

引用

Contemporary Clinical Trials 2014年第2期39卷 351-351页

作者： Bryan C. Batch Crystal Tyson Jacqueline Bagwell Leonor Corsino Stephen Intille Pao-Hwa Lin Tony Lazenka Gary Bennett Hayden B. Bosworth Corrine Voils Steven Grambow Aziza Sutton Rachel Bordogna Matthew Pangborn Jenifer Schwager Kate Pilewski Carla Caccia Jasmine Burroughs Laura P. Svetkey Department of Medicine Division of Endocrinology Duke University Medical Center DUMC Box 3921 Durham NC 27710 USA Sarah W. Stedman Nutrition and Metabolism Center 3475 Erwin Road Duke University Medical Center Durham NC 27710 USA Department of Medicine Division of Nephrology Duke University Medical Center DUMC Box 103105 Durham NC 27710 USA College of Computer and Information Science Northeastern University 202 West Village H Office 450 360 Huntington Avenue Boston MA 02115 USA Bouvé College of Health Sciences Northeastern University 202 West Village H Office 450 360 Huntington Avenue Boston MA 02115 USA Department of Psychology & Neuroscience Duke University Medical Center Box 90086 417 Chapel Drive Duke University Durham NC 27708-0086 USA Duke Obesity Prevention Program Duke University Medical Center Durham NC USA Duke Global Health Institute Duke University Medical Center 310 Trent Drive Durham NC 27710 USA Center for Health Services Research in Primary Care Durham Veterans Affairs Medical Center 508 Fulton Street Durham NC 27705 USA Department of Medicine Division of General Internal Medicine Duke University Medical Center Box 3240 Durham NC 27710 USA Department of Psychiatry Duke University Medical Center 2301 Erwin Road Durham NC 27710 USA Duke University School of Nursing 307 Trent Drive DUMC 3322 Durham NC 27710 USA Department of Biostatistics and Bioinformatics Duke University Medical Center DUMC Box 2721 Durham NC 27710 USA

来源：评论

学校读者我要写书评

暂无评论

Mapping short sequencing reads to distant relatives 11

Mapping short sequencing reads to distant relatives

引用

2011 ACM Conference on bioinformatics, Computational Biology and Biomedicine, ACM-BCB 2011

作者： Reynoso, Vinicio Putonti, Catherine Department of Biology Bioinformatics Program Loyola University Chicago 1032 W Sheridan Rd. Chicago IL 60660 United States Department of Computer Science Bioinformatics Program Loyola University Chicago 1032 W Sheridan Rd. Chicago IL 60660 United States

ISBN: (纸本)9781450307963

Numerous different algorithmic approaches have been developed to map the short-reads produced by next-generation sequencing technologies onto reference genome sequences. When sufficiently close reference genomes do not exist, less rigorous approaches must be taken, as is the case for analysis of diverse environmental samples. We have developed a new suite of data structures and algorithms specifically for the mapping of reads from environmental sequencing projects. A pipeline was developed which can rigorously map reads to genomes with many mismatches between the two. Using 50+ million reads generated from soil samples, we present the results of our performance analysis of our approach. Copyright © 2011 ACM.

关键词： Mapping

来源：评论

学校读者我要写书评

暂无评论

Active clustering of biological sequences

The Journal of Machine Learning Research

引用

The Journal of Machine Learning Research 2012年第1期13卷

作者： Konstantin Voevodski Maria-Florina Balcan Heiko Röglin Shang-Hua Teng Yu Xia Google New York NY College of Computing Georgia Institute of Technology Atlanta GA Department of Computer Science University of Bonn Bonn Germany Computer Science Department University of Southern California Los Angeles CA Bioinformatics Program and Department of Chemistry Boston University Boston MA

Given a point set S and an unknown metric d on S, we study the problem of efficiently partitioning S into k clusters while querying few distances between the points. In our model we assume that we have access to one versus all queries that given a point s ∈ S return the distances between s and all other points. We show that given a natural assumption about the structure of the instance, we can efficiently find an accurate clustering using only O(k) distance queries. Our algorithm uses an active selection strategy to choose a small set of points that we call landmarks, and considers only the distances between landmarks and other points to produce a clustering. We use our procedure to cluster proteins by sequence similarity. This setting nicely fits our model because we can use a fast sequence database search program to query a sequence against an entire data set. We conduct an empirical study that shows that even though we query a small fraction of the distances between the points, we produce clusterings that are close to a desired clustering given by manual classification.

关键词： active clustering approximation algorithms approximation stability clustering clustering accuracy k-median protein sequences

来源：评论

学校读者我要写书评

暂无评论

Evaluating the quality of conformation sampling methods using experimental residual dipolar coupling data 11

Evaluating the quality of conformation sampling methods usin...

引用

2011 ACM Conference on bioinformatics, Computational Biology and Biomedicine, ACM-BCB 2011

作者： Lin, Tu-Liang Vammi, Santhosh Kumar Song, Guang Computer Science Department Iowa State University Ames IA 50010 United States Program of Bioinformatics and Computational Biology Iowa State University Ames IA 50010 United States

ISBN: (纸本)9781450307963

Many computational approaches have been developed and used for sampling protein conformations near the native state. However, it has been difficult to evaluate the quality of the conformations sampled or to compare them among the various sampling schemes. In this work, we develop a novel method for evaluating the quality of conformation ensembles and apply it to evaluate ubiquitin conformations generated from four widely-used conformation sampling approaches, namely, MD simulation, Elastic Network Model (ENM), CONCOORD, and *** choose ubiquitin because there exists abundant experimental residual dipolar coupling (RDC) data on this protein. RDC data contains rich ensemble-averaged information about a given protein and thus provide tight constraints that can be used for probing what conformations should make up the protein ensemble. Our results demonstrate that the conformations generated by MD simulations are the best among all sampling methods. Specifically, MD simulation performs significantly better than the other methods in capturing the side chain motions. The backbone flexibility modeled and sampled by tCONCOORD comes quite close, with CONCOORD and ENM trailing behind. Copyright © 2011 ACM.

关键词： Conformations

来源：评论

学校读者我要写书评

暂无评论

Modularity detection in protein-protein interaction networks

引用

BMC Research Notes 2011年第1期4卷 1-6页

作者： Narayanan, Tejaswini Gersten, Merril Subramaniam, Shankar Grama, Ananth Department of Electrical and Computer Engineering University of California San Diego United States Graduate Program in Bioinformatics and Systems Biology University of California San Diego United States Department of Bioengineering University of California San Diego United States Department of Computer Science Purdue University West Lafayette IN United States

Background: Many recent studies have investigated modularity in biological networks, and its role in functional and structural characterization of constituent biomolecules. A technique that has shown considerable promise in the domain of modularity detection is the Newman and Girvan (NG) algorithm, which relies on the number of shortest-paths across pairs of vertices in the network traversing a given edge, referred to as the betweenness of that edge. The edge with the highest betweenness is iteratively eliminated from the network, with the betweenness of the remaining edges recalculated in every iteration. This generates a complete dendrogram, from which modules are extracted by applying a quality metric called modularity denoted by Q. This exhaustive computation can be prohibitively expensive for large networks such as Protein-Protein Interaction Networks. In this paper, we present a novel optimization to the modularity detection algorithm, in terms of an efficient termination criterion based on a target edge betweenness value, using which the process of iterative edge removal may be terminated. Results: We validate the robustness of our approach by applying our algorithm on real-world protein-protein interaction networks of Yeast, *** and Drosophila, and demonstrate that our algorithm consistently has significant computational gains in terms of reduced runtime, when compared to the NG algorithm. Furthermore, our algorithm produces modules comparable to those from the NG algorithm, qualitatively and quantitatively. We illustrate this using comparison metrics such as module distribution, module membership cardinality, modularity Q, and Jaccard Similarity Coefficient. Conclusions: We have presented an optimized approach for efficient modularity detection in networks. The intuition driving our approach is the extraction of holistic measures of centrality from graphs, which are representative of inherent modular structure of the underlying network, and the applic

关键词： Biological Network Jaccard Index Input Network High Betweenness Jaccard Similarity Coefficient

来源：评论

学校读者我要写书评

暂无评论

Prediction of mitochondrial proteins of malaria parasite using improved hybrid method and reduced amino acid alphabet

Prediction of mitochondrial proteins of malaria parasite usi...

引用

2011 4th International Conference on Biomedical Engineering and Informatics, BMEI 2011

作者： Chen, Ying-Li Li, Qian-Zhong Zhang, Li-Qing Laboratory of Theoretical Biophysics School of Physical Science and Technology Inner Mongolia University Hohhot China Department of Computer Science Virginia Tech Blacksburg VA United States Program in Genetics Bioinformatics and Computational Biology Virginia Tech Blacksburg VA United States

ISBN: (纸本)9781424493524

The rate of human death and morbidity due to malaria is increasing in many parts of the developing countries. Thus, there is a great need to understand the critical pathways in malaria parasite in order to develop effective drugs and vaccines. In this work, based on the measure of diversity definition, we introduce the increment of diversity fusion (IDF), an improved hybrid method to predict mitochondrial proteins of malaria parasite. We conduct our experiment on an expanded protein dataset where we require the pairwise identity between two proteins is less than 25%. By choosing amino acids composition as the only input vector, we are able to achieve 65.4% accuracy with 0.32 Mathew's correlation coefficient (MCC) for the jackknife test. Further, incorporting the compositions of the N-terminal and C-terminal regions into the input vector, we show that the prediction results are improved to 82.0% accuracy with 0.64 MCC in the jackknife test. In addition, by combining with the several reduced amino acid alphabet and the hydropathy distribution along protein sequence, we achieve maximum 83.4% accuracy with 0.67 MCC in the jackknife test by using the 64 dipeptide compositions of the reduced amino acid alphabet obtained from Protein Blocks method. © 2011 IEEE.

关键词： Proteins

来源：评论

学校读者我要写书评

暂无评论

Ranking docked models of protein-protein complexes using predicted partner-specific protein-protein interfaces: A preliminary study 11

Ranking docked models of protein-protein complexes using pre...

引用

2011 ACM Conference on bioinformatics, Computational Biology and Biomedicine, ACM-BCB 2011

作者： Xue, Li C. Jordan, Rafael A. El-Manzalawy, Yasser Dobbs, Drena Honavar, Vasant Bioinformatics and Computational Biology Program Iowa State University Ames IA 50011 United States Department of Genetics Development and Cell Biology Iowa State University Ames 50011 United States Department of Computer Science Iowa State University Ames IA 50011 United States Department of Computer Science Pontificia Universidad Javeriana Cali Colombia Department of Systems and Computer Engineering AI-Azhar University Cairo Egypt

ISBN: (纸本)9781450307963

Computational protein-protein docking is a valuable tool for determining the conformation of complexes formed by interacting proteins. Selecting near-native conformations from the large number of possible models generated by docking software presents a significant challenge in practice. We introduce a novel method for ranking docked conformations based on the degree of overlap between the interface residues of a docked conformation formed by a pair of proteins with the set of predicted interface residues between them. Our approach relies on a method, called PS-HomPPI, for reliably predicting proteinprotein interface residues by taking into account information derived from both interacting proteins. PS-HomPPI infers the residues of a query protein that are likely to interact with a partner protein based on known interface residues of the homo-interologs of the query-partner protein pair, i.e., pairs of interacting proteins that are homologous to the query protein and partner protein. Our results on Docking Benchmark 3.0 show that the quality of the ranking of docked conformations using our method is consistently superior to that produced using ClusPro cluster-size-based and energy-based criteria for 61 out of the 64 docking complexes for which PS-HomPPI produces interface predictions. An implementation of our method for ranking docked models is freely available at: http://***/DockRank/. Copyright © 2011 ACM.

关键词： Proteins

来源：评论

学校读者我要写书评

暂无评论

Improving protein-RNA interface prediction by combining sequence homology based method with a naive bayes classifier: Preliminary results 11

Improving protein-RNA interface prediction by combining sequ...

引用

2011 ACM Conference on bioinformatics, Computational Biology and Biomedicine, ACM-BCB 2011

作者： Xue, Li C. Walia, Rasna EL-Manzalawy, Yasser Dobbs, Drena Honavar, Vasant Bioinformatics and Computational Biology Program Iowa State University Ames IA 50011 United States Department of Genetics Development and Cell Biology Iowa State University Ames IA 50011 United States Department of Computer Science Iowa State University Ames IA 50011 United States Department of Systems and Computer Engineering Al-Azhar University Cairo Egypt

ISBN: (纸本)9781450307963

Protein-RNA interactions play important roles in cellular processes like protein synthesis, RNA processing, and gene expression regulation. Reliable identification of the interfaces involved in RNA-protein interactions is essential for comprehending the mechanisms and the functional implications of these interactions and provides a valuable guide for rational drug discovery and design. Because the determination of 3D structures of protein-RNA complexes has various technical limitations and is typically costly, reliable in silico interface prediction methods that require only the sequence information are urgently needed. We present HomPRIP, a homologous sequence based method for predicting protein-RNA interfaces, based on our conservation analysis of protein-RNA interfaces. We test Hom-PRIP on a benchmark dataset of 199 proteins and compare it with the state-of-the-art protein-RNA interface prediction methods. Our results show that HomPRIP can reliably identify protein-RNA interface residues in 71% of test proteins with at least one putative sequence homolog passing the similarity thresholds of HomPRIP. Moreover, to facilitate predictions for proteins with no identified homologs, we develop HomPRIP-NB, a method combining the HomPRIP predictor and a Naive Bayes (NB) classifier trained using evolutionary information derived from alignments against the NCBI nr database. Our results suggest that HomPRIP-NB significantly outperforms the state-of-the-art machine learning methods for predicting protein-RNA interface residues. Copyright © 2011 ACM.

关键词： Proteins

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：