检索结果-内蒙古大学图书馆

Bayesian multinomial logistic normal models through marginally latent matrix-T processes

The Journal of Machine Learning Research 2022年第1期23卷 255-296页

作者： Justin D. Silverman Kimberly Roche Zachary C. Holmes Lawrence A. David Sayan Mukherjee College of Information Science and Technology Department of Statistics and Institute for Computational and Data Science Penn State University University Park PA Program in Computational Biology and Bioinformatics Duke University Durham NC Department of Molecular Genetics and Microbiology Duke University Durham NC Department of Molecular Genetics and Microbiology and Center for Genomic and Computational Biology Duke University Durham NC Departments of Statistical Science Mathematics Computer Science Biostatistics & Bioinformatics Duke University Durham NC

Bayesian multinomial logistic-normal (MLN) models are popular for the analysis of sequence count data (e.g., microbiome or gene expression data) due to their ability to model multivariate count data with complex covariance structure. However, existing implementations of MLN models are limited to small datasets due to the non-conjugacy of the multinomial and logistic-normal distributions. Motivated by the need to develop efficient inference for Bayesian MLN models, we develop two key ideas. First, we develop the class of Marginally Latent Matrix-T Process (Marginally LTP) models. We demonstrate that many popular MLN models, including those with latent linear, non-linear, and dynamic linear structure are special cases of this class. Second, we develop an efficient inference scheme for Marginally LTP models with specific accelerations for the MLN subclass. Through application to MLN models, we demonstrate that our inference scheme are both highly accurate and often 4-5 orders of magnitude faster than MCMC.

关键词： Bayesian statistics multivariate analysis count data microbiome gene expression

来源：评论

学校读者我要写书评

暂无评论

Enhanced Performance and data Privacy in Lung Nodule Classification via Federated Deep Learning Approach 24

Enhanced Performance and Data Privacy in Lung Nodule Classif...

引用

2024 7th International Conference on Healthcare Service Management, ICHSM 2024

作者： Nguyen, Duc-Khanh Li, Ai-Hsien Adam Lai, Yen-Jun Chiu, Yen-Ling Phan, Dinh-Van Chien, Ting-Ying Chan, Chien-Lung Department of Information Management Yuan Ze University Taoyuan Taiwan Department of Statistics and Informatics University of Economics The University of Danang Danang Viet Nam Division of Cardiology Far Eastern Memorial Hospital New Taipei Taiwan Graduate Program in Biomedical Informatics Yuan Ze University Taoyuan Taiwan Department of Radiology Far Eastern Memorial Hospital New Taipei Taiwan Graduate Institute of Medicine and Graduate Program in Biomedical Informatics Yuan Ze University Taoyuan Taiwan Department of Medical Research Far Eastern Memorial Hospital New Taipei Taiwan Department of Computer Science and Engineering Yuan Ze University Taoyuan Taiwan ZDT Group- YZU Joint Research and Development Center for Big Data Taoyuan Taiwan Yuan Ze University Taoyuan Taiwan

ISBN: (纸本)9798400710162

This study explores the feasibility of deep learning for classifying nodule neoplasms, analyzing their performance on two openly available datasets, LUNGx SPIE, and LIDC-IDRI. These datasets offer valuable diversity in image quantity, quality, and malignancy annotation methodologies, posing a real-world challenge for effective model development. To handle the disparities in these diverse datasets, we aim to develop a robust classification model trained using federated learning. This decentralized approach allows model training across datasets without sharing patient data, preserving privacy and overcoming data fragmentation. Our experiments demonstrate that federated learning enables the development of a robust and superior deep learning model compared to individual models trained on limited data. This combination of deep learning, and federated learning holds immense promise for nodule neoplasm classification and can significantly improve patient care and outcomes. © 2024 Copyright held by the owner/author(s).

关键词： Federated learning

来源：评论

学校读者我要写书评

暂无评论

MultiCalib4DEB: A toolbox exploiting multimodal optimisation in Dynamic Energy Budget parameters calibration

arXiv

引用

arXiv 2023年

作者： Robles, Juan Francisco Chica, Manuel Filgueira, Ramón Agüera, Antonio Damas, Sergio Department of Computer Science and Artificial Intelligence Andalusian Research Institute in Data Science and Computational Intelligence DaSCI University of Granada Granada18071 Spain Department of Benthic resources Norwegian Institute of Marine Research BergenNO-5817 Norway Department of Software Engineering Andalusian Research Institute in Data Science and Computational Intelligence DaSCI University of Granada Granada18071 Spain Marine Affairs Program Life Sciences Centre Dalhousie University 1459 Oxford Street HalifaxNSB3H 4R2 Canada

1. Calibration is a crucial step for the validation of computational models and a challenging task to accomplish. 2. Dynamic Energy Budget (DEB) theory has experienced an exponential rise in the number of published papers, which in large part has been made possible by the DEBtool toolbox. Multimodal evolutionary optimisation could provide DEBtool with new capabilities, particularly relevant on the provisioning of equally optimal and diverse solutions. 3. In this paper we present MultiCalib4DEB, a MATLAB toolbox directly integrated into the existing DEBtool toolbox, which uses multimodal evolutionary optimisation algorithms to find multiple global and local optimal and diverse calibration solutions for DEB models. 4. MultiCalib4DEB adds powerful calibration mechanisms, statistical analysis, and visualisation methods to the DEBtool toolbox and provides a wide range of outputs, different calibration alternatives, and specific tools to strengthen the DEBtool calibration module and to aid DEBtool users to evaluate the performance of the calibration results. © 2023, CC BY-NC-ND.

关键词： Calibration

来源：评论

学校读者我要写书评

暂无评论

When data disappear: public health pays as US policy strays

引用

The Lancet Digital Health 2025年 100874页

作者： McAndrew, Thomas Lover, Andrew A Hoyt, Garrik Majumder, Maimuna S Department of Biostatistics and Health Data Science College of Health Lehigh University BethlehemPA United States Department of Biostatistics and Epidemiology School of Public Health and Health Sciences University of Massachusetts Amherst AmherstMA United States Department of Computer Science and Engineering PC Rossin College of Engineering and Applied Sciences Lehigh University BethlehemPA United States Department of Pediatrics Harvard Medical School BostonMA United States Computational Health Informatics Program Boston Children's Hospital BostonMA United States

Presidential actions on Jan 20, 2025, by President Donald Trump, including executive orders, have delayed access to or led to the removal of crucial public health data sources in the USA. The continuous collection and maintenance of health data support public health, safety, and security associated with diseases such as seasonal influenza. To show how public health data surveillance enhances public health practice, we analysed data from seven US Government-maintained sources associated with seasonal influenza. We fit two models that forecast the number of national incident influenza hospitalisations in the USA: (1) a data-rich model incorporating data from all seven Government data sources;and (2) a data-poor model built using a single Government hospitalisation data source, representing the minimal required information to produce a forecast of influenza hospitalisations. The data-rich model generated reliable forecasts useful for public health decision making, whereas the predictions using the data-poor model were highly uncertain, rendering them impractical. Thus, health data can serve as a transparent and standardised foundation to improve domestic and global health. Therefore, a plan should be developed to safeguard public health data as a public good. © 2025 The Author(s)

关键词： Health insurance

来源：评论

学校读者我要写书评

暂无评论

Embedding Space Augmentation for Weakly Supervised Learning in Whole-Slide Images

Embedding Space Augmentation for Weakly Supervised Learning ...

引用

IEEE International Symposium on Biomedical Imaging

作者： Imaad Zaffar Guillaume Jaume Nasir Rajpoot Faisal Mahmood Department of Computer Science University College London UK Department of Pathology Harvard Medical School Brigham and Women’s Hospital Boston MA USA Department of Pathology Harvard Medical School Massachusetts General Hospital Boston MA USA Cancer Program Broad Institute of Harvard and MIT Cambridge MA USA Data Science Program Dana-Farber Cancer Institute Boston MA USA Department of Computer Science Tissue Image Analytics Centre University of Warwick Coventry UK Department of Pathology University Hospitals Coventry and Warwickshire NHS Trust Coventry UK The Alan Turing Institute London UK

Multiple Instance Learning (MIL) is a widely employed framework for learning on gigapixel whole-slide images (WSIs) from WSI-level annotations. In most MIL-based analytical pipelines for WSI-level analysis, the WSIs are divided into patches, and deep features for patches (i.e., patch embeddings) are extracted prior to training to reduce the overall computational cost and cope with the GPUs’ limited RAM. Because of this bottleneck, incorporating patch-level data augmentations during training adds an extra computational burden. To overcome this limitation, we present EmbAugmenter, a data augmentation generative adversarial network (DA-GAN) that can synthesize data augmentations in the embedding space rather than in the pixel space, thereby significantly reducing the computational requirements. Experiments on the SICAPv2 dataset show that our approach outperforms MIL without augmentation and is on par with traditional patch-level augmentation for MIL training while being substantially faster.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Hybrid ACO and PSO to minimize makespan in job shop scheduling problems

引用

AIP Conference Proceedings 2024年第1期3222卷

作者： Priljen Habeahan Erna Budhiarti Nababan Mahyuddin K. M. Nasution Informatics Engineering Master Program Faculty of Computer Science and Information Technology Universitas Sumatera Utara Medan Indonesia Data Science and Artificial Intelligence Faculty of Computer Science and Information Technology Universitas Sumatera Utara Medan Indonesia

The lowest time search in the dataset that E. Taillard utilized employs a heuristic approach based on tabu search techniques to get the predicted solution. Glover’s study gives a broad description of tabu search, which is commonly encountered in Taillard’s job shop scheduling difficulties and Widmer et al.’s flow shop sorting challenges. Although tabu search is relatively simple to use and typically yields excellent results, it takes a long time to complete. In this research a hybrid ACO and PSO was carried out to minimize makespan in the Job Shop Scheduling Problem which was used as sourced from benchmark data which is secondary data obtained from E. Taillard “Benchmarks for basic scheduling problems” which consists of job shop matrix data (job × machine) measuring 4 × 4, 5 × 5, 7 × 7, 10 × 10, 15 × 15, 20 × 20, 30 × 15, 30 × 20, 50 × 15 and 50 × 20. Hybrid is carried out by calculating the Pbest value, namely the process position of each job on the machine to get the best solution using the PSO algorithm. Next, calculate the Gbest (Global best) value for the position of each job on the best machine on the entire machine using the PSO algorithm and initialize the ACO parameters using the PBest and Gbest values. The results of research on datasets with sizes 10×10, 15×15, 20×20, 30×15, 30×20, 50×15 and 50×20 produce smaller makespan compared to the lower bound on the dataset with an average minimum makespan improvement value of 1.184.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Generative AI for Unmanned Vehicle Swarms: Challenges, Applications and Opportunities

arXiv

引用

arXiv 2024年

作者： Liu, Guangyuan Van Huynh, Nguyen Du, Hongyang Hoang, Dinh Thai Niyato, Dusit Zhu, Kun Kang, Jiawen Xiong, Zehui Jamalipour, Abbas In Kim, Dong The School of Computer Science and Engineering The Energy Research Institute @ NTU Interdisciplinary Graduate Program Nanyang Technological University Singapore The Department of Electrical Engineering and Electronics University of Liverpool Liverpool L69 3GJ United Kingdom The School of Electrical and Data Engineering University of Technology Sydney Australia The School of Computer Science and Engineering Nanyang Technological University Singapore College of Computer Science and Technology Nanjing University of Aeronautics and Astronautics Nanjing China The School of Automation Guangdong University of Technology China The Pillar of Information Systems Technology and Design Singapore University of Technology and Design Singapore The School of Electrical and Information Engineering University of Sydney Australia The College of Information and Communication Engineering Sungkyunkwan University Korea Republic of

With recent advances in artificial intelligence (AI) and robotics, unmanned vehicle swarms have received great attention from both academia and industry due to their potential to provide services that are difficult and dangerous to perform by humans. However, learning and coordinating movements and actions for a large number of unmanned vehicles in complex and dynamic environments introduce significant challenges to conventional AI methods. Generative AI (GAI), with its capabilities in complex data feature extraction, transformation, and enhancement, offers great potential in solving these challenges of unmanned vehicle swarms. For that, this paper aims to provide a comprehensive survey on applications, challenges, and opportunities of GAI in unmanned vehicle swarms. Specifically, we first present an overview of unmanned vehicles and unmanned vehicle swarms as well as their use cases and existing issues. Then, an in-depth background of various GAI techniques together with their capabilities in enhancing unmanned vehicle swarms are provided. After that, we present a comprehensive review on the applications and challenges of GAI in unmanned vehicle swarms with various insights and discussions. Finally, we highlight open issues of GAI in unmanned vehicle swarms and discuss potential research directions. © 2024, CC BY.

关键词： Metadata

来源：评论

学校读者我要写书评

暂无评论

Supervised Contrastive Learning Enhances MHC-II Peptide Binding Affinity Prediction

SSRN

引用

SSRN 2024年

作者： Shen, Long-Chen Liu, Yan Liu, Zi Zhang, Yumeng Wang, Zhikang Guo, Yuming Rossjohn, Jamie Song, Jiangning Yu, Dong-Jun School of Computer Science and Engineering Nanjing University of Science and Technology 200 Xiaolingwei Nanjing210094 China Department of Computer Science Yangzhou University Yangzhou225100 China School of Information Enginnering Jingdezhen Ceramic University Jingdezhen333403 China Monash Biomedicine Discovery Institute Department of Biochemistry and Molecular Biology Monash University MelbourneVIC3800 Australia Monash Data Futures Institute Monash University MelbourneVIC3800 Australia School of Life Sciences and Biotechnology Shanghai Jiao Tong University Shanghai200240 China Department of Epidemiology and Preventive Medicine Xiangya Hospital China Infection and Immunity Program Netherlands

Accurate prediction of major histocompatibility complex (MHC)-peptide binding affinity can improve our understanding of cellular immune responses and guide personalized immunotherapies. Nevertheless, the existing deep learning-based approaches for predicting MHC-II peptide interactions fall short of satisfactory performance and offer restricted model interpretability. In this study, we propose a novel deep neural network, termed ConBoTNet, to address the above issues by introducing the designed supervised contrastive learning and bottleneck transformer extractors. Specifically, the supervised contrastive learning pre-training enhances the model’s representative and generalizable capabilities on MHC-II peptides by pulling positive pairs closer and pushing negative pairs further in the feature space, while the bottleneck transformer module focuses on MHC-II peptide interactions to precisely identify binding cores and anchor positions in an unsupervised manner. Extensive experiments on benchmark datasets under 5-fold cross-validation, leave-one-molecule-out validation, independent testing, and binding core prediction settings highlighted the superiority of our proposed ConBoTNet over current state-of-the-art methods. data distribution analysis in the latent feature space demonstrated that supervised contrastive learning can aggregate MHC-II-peptide samples with similar affinity labels and learn common features of similar affinity. Additionally, we interpreted the trained neural network by associating the attention weights with peptides and innovatively find both well-established and potential peptide motifs. This work not only introduces an innovative tool for accurately predicting MHC-II peptide affinity, but also provides new insights into a new paradigm for modeling essential biological interactions, advancing data-driven discovery in biomedicine. © 2024, The Authors. All rights reserved.

关键词： Peptides

来源：评论

学校读者我要写书评

暂无评论

Statistical analysis and data visualization of Indonesia and Malaysia SARS Cov-2 metadata

引用

AIP Conference Proceedings 2023年第1期2594卷

作者： D. Sudigyo A. Budiarto B. Pardamean Bioinformatics & Data Science Research Center Bina Nusantara University Jakarta Indonesia 11480 Computer Science Department School of Computer Science Nusantara University Jakarta Indonesia 11480 Computer Science Department BINUS Graduate Program - Master of Computer Science Program Bina Nusantara University Jakarta Indonesia 11480

SARS CoV-2 is a fascinating topic to investigate, especially in Indonesia and Malaysia, which share similar racial demographics. However, statistical analysis of information on the SARS CoV-2 from a database, especially GISAID, does not contain specific customizations related to virus comparisons between selected countries. Therefore, the researchers conducted statistical analysis and data visualization using the Python programming language to describe and investigate SARS CoV-2 Indonesia and Malaysia from the GISAID database. SARS CoV-2 metadata from Indonesia (N=117) and Malaysia (N=250), which were gathered during 2020, were compared. This comparison was aimed to investigate the discrepancies of COVID-19 cases in closely related populations. Firstly, data visualization was conducted using the Python Matplotlib library to create bar charts for clades and mutation comparison. Additionally, a series of boxplots were generated to show age discrepancies stratified by gender. Furthermore, the statistical tests showed that only the dominant Malaysian (G and O) clades were found to be significantly different compared to Indonesian cases (p-value=0.016). The proportion of two major mutations (G614D and NSP12 P323L) were also significantly different in the two countries caused by the dominant clade differences (p-value=0.007). Lastly, the differences in the age distribution of COVID-19 cases between the two countries were significant only in the male group (p-value=0.017).

关键词：

来源：评论

学校读者我要写书评

暂无评论

Parameter Optimization of Support Vector Regression Using Harris Hawks Optimization

引用

Procedia computer science 2021年 179卷 17-24页

作者： I Nyoman Setiawan Robert Kurniawan Budi Yuniarto Rezzy Eko Caraka Bens Pardamean Computational Statistics Department Polytechnic of Statistics - STIS Jakarta 13330 Indonesia Department of Information Management College of Informatics Chaoyang University of Technology Taiwan (ROC) Bioinformatics and Data Science Research Center Bina Nusantara University Indonesia 11530 Computer Science Department BINUS Graduate Program - Master of Computer Science Program Bina Nusantara University Jakarta Indonesia. 11530.

Support Vector Regression (SVR) is often used in forecasting. Adjustment of parameters in the SVR affects the results of forecasting. This study aims to analyze the SVR method that is optimized using Harris Hawks Optimization (HHO), hereinafter referred to as HHO-SVR. The HHO-SVR was evaluated using five benchmark datasets to determine the performance of this method. The HHO process is also compared based on the type of kernel and other metaheuristic algorithms. The results showed that the HHO-SVR has almost the same performance as other methods but is less efficient in terms of time. In addition, the type of kernel also affects the process and results.

关键词： SVR harris hawks optimization parameter optimization kernel forecasting

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：