检索结果-内蒙古大学图书馆

Proceedings of the 37th International Conference on Neural Information Processing Systems

作者： Yubin Shi Yixuan Chen Mingzhi Dong Xiaochen Yang Dongsheng Li Yujiang Wang Robert Dick Qin Lv Yingying Zhao Fan Yang Tun Lu Ning Gu Li Shang China and Shanghai Key Laboratory of Data Science School of Computer Science Fudan University School of Mathematics Statistics University of Glasgow Microsoft Research Asia Shanghai China Department of Engineering Science University of Oxford Department of Electrical Engineering and Computer Science University of Michigan Department of Computer Science University of Colorado Boulder School of Microelectronics Fudan University

Despite their prevalence in deep-learning communities, over-parameterized models convey high demands of computational costs for proper training. This work studies the fine-grained, modular-level learning dynamics of over-parameterized models to attain a more efficient and fruitful training strategy. Empirical evidence reveals that when scaling down into network modules, such as heads in self-attention models, we can observe varying learning patterns implicitly associated with each module's trainability. To describe such modular-level learning capabilities, we introduce a novel concept dubbed modular neural tangent kernel (mNTK), and we demonstrate that the quality of a module's learning is tightly associated with its mNTK's principal eigenvalue λmax. A large λmax indicates that the module learns features with better convergence, while those miniature ones may impact generalization negatively. Inspired by the discovery, we propose a novel training strategy termed Modular Adaptive Training (MAT) to update those modules with their λmax exceeding a dynamic threshold selectively, concentrating the model on learning common features and ignoring those superfluous ones. Unlike most existing training schemes with a complete BP cycle across all network modules, MAT can significantly save computations by its partially-updating strategy and can further improve performance. Experiments show that MAT nearly halves the computational cost of model training and outperforms the accuracy of baselines.

关键词：

来源：评论

学校读者我要写书评

暂无评论

What Makes Good In-context Demonstrations for Code Intelligence Tasks with LLMs?

arXiv

引用

arXiv 2023年

作者： Gao, Shuzheng Wen, Xin-Cheng Gao, Cuiyun Wang, Wenxuan Zhang, Hongyu Lyu, Michael R. School of Computer Science and Technology Harbin Institute of Technology Shenzhen China Department of Computer Science and Engineering The Chinese University of Hong Kong China School of Big Data and Software Engineering Chongqing University China

Pre-trained models of source code have gained widespread popularity in many code intelligence tasks. Recently, with the scaling of the model and corpus size, large language models have shown the ability of in-context learning (ICL). ICL employs task instructions and a few examples as demonstrations, and then inputs the demonstrations to the language models for making predictions. This new learning paradigm is training-free and has shown impressive performance in various natural language processing and code intelligence tasks. However, the performance of ICL heavily relies on the quality of demonstrations, e.g., the selected examples. It is important to systematically investigate how to construct a good demonstration for code-related tasks. In this paper, we empirically explore the impact of three key factors on the performance of ICL in code intelligence tasks: the selection, order, and number of demonstration examples. We conduct extensive experiments on three code intelligence tasks including code summarization, bug fixing, and program synthesis. Our experimental results demonstrate that all the above three factors dramatically impact the performance of ICL in code intelligence tasks. Additionally, we summarize our findings and provide takeaway suggestions on how to construct effective demonstrations, taking into account these three perspectives. We also show that a carefully-designed demonstration based on our findings can lead to substantial improvements over widely-used demonstration construction methods, e.g., improving BLEU-4, EM, and EM by at least 9.90%, 175.96%, and 50.81% on code summarization, bug fixing, and program synthesis, respectively. © 2023, CC BY.

关键词： Demonstrations

来源：评论

学校读者我要写书评

暂无评论

Hybrid Deep Learning Models for Accurate Early Detection of Lung Cancer from MRI Scans

Hybrid Deep Learning Models for Accurate Early Detection of ...

引用

Artificial Intelligence, Computational Electronics and Communication System (AICECS), International Conference on

作者： Ashwini S Shinde Tejaswini K Zope Deepali Bongulwar Manjusha N Chavan Deepti A. Chaudhari Atharv Sonawane Electronics & Telecommunication Nutan Maharashtra Institute of Engineering and Technology Pune India Computer Science & Engineering Nutan College of Engineering & Research Pune India CSE - AI & ML Department Brainware University Barasat Kolkata West Bengal Computer Science Sanjeevan Engineering and Technology Institute Kolhapur India Artificial Intelligence-Data Science AISSMS-IOIT Pune India

ISBN: (数字)9798350391244

ISBN: (纸本)9798350391251

Early identification of lung cancer is vital for improving patient outcomes, since it is a major contributor to cancer-related mortality, often resulting from late-stage diagnosis. This study addresses the challenge of accurately identifying early-stage lung cancer from MRI scans, where subtle tumors are often overlooked. Traditional diagnostic methods, reliant on manual interpretation and conventional machine learning models, fall short in handling the complexity of MRI data. In order to address these constraints, we suggest using a hybrid deep learning model that combines Convolutional Neural Networks (CNNs) for spatial feature extraction with Transformer networks for contextual analysis. This innovative approach significantly enhances the accuracy of early-stage lung cancer detection. Performance evaluation on extensive MRI datasets demonstrates that the hybrid model achieves an accuracy of 95%, a sensitivity of 93%, and a specificity of 96%, outperforming traditional diagnostic methods. The results highlight the potential of this hybrid model to revolutionize early detection strategies, ultimately improving treatment outcomes and survival rates for lung cancer patients.

关键词： Deep learning Accuracy Sensitivity Magnetic resonance imaging Lung cancer Transformers Feature extraction Convolutional neural networks Context modeling Tumors

来源：评论

学校读者我要写书评

暂无评论

What Makes Good In-Context Demonstrations for Code Intelligence Tasks with LLMs?

What Makes Good In-Context Demonstrations for Code Intellige...

引用

IEEE International Conference on Automated Software engineering (ASE)

作者： Shuzheng Gao Xin-Cheng Wen Cuiyun Gao Wenxuan Wang Hongyu Zhang Michael R. Lyu School of Computer Science and Technology Harbin Institute of Technology Shenzhen China Department of Computer Science and Engineering The Chinese University of Hong Kong China School of Big Data and Software Engineering Chongqing University China

关键词：

来源：评论

学校读者我要写书评

暂无评论

Multi-Level and Multi-Segment Learning Multitask Optimization via a Niching Method

引用

IEEE Transactions on Evolutionary Computation 2024年

作者： Xue, Zhao-Feng Wang, Zi-Jia Jiang, Yi Zhan, Zhi-Hui Kwong, Sam Zhang, Jun Guangzhou University School of Computer Science and Cyber Engineering Guangzhou510006 China South China University of Technology School of Computer Science and Engineering Guangzhou510006 China Nankai University College of Artificial Intelligence Tianjin300350 China Lingnan University Department of Data Science Hong Kong Hong Kong Nankai University China ERICA Hanyang University Korea Republic of

Knowledge transfer (KT) has been regarded as an efficient method in evolutionary multitask optimization (EMTO) by utilizing the information of other tasks to promote the optimization of the current task. Most KT methods achieve information communication across index-aligned dimensions. However, the index-aligned dimensions are not always similar or related, which is not always suitable for communication and causes the low efficiency in KT. Moreover, when the KT occurs in the heterogeneous tasks with different dimensions, the task with lower dimensions often pads the extra dimensions to make their dimensions equal. However, the dimension-padding often involves the redundant or useless information, which may mislead the KT process. In this paper, a novel multi-level and multi-segment learning multitask optimization (MMLMTO) algorithm based on niching technique is proposed to achieve high-quality KT. Firstly, a multi-level learning strategy is proposed to divide the population into three levels according to fitness values for better selecting the individuals for KT. Secondly, a multi-segment learning strategy is proposed to split some top individuals in each level into several segments, and each segment will find its closest segment to form a niche, where the KT is executed. This ensures that KT occurs in the similar or related dimensions and avoids the dimension-padding to eliminate the influence of the redundant information. Experimental results on IEEE CEC2017 and IEEE CEC2022 multitask benchmarks fully demonstrate the effectiveness of MMLMTO, which can significantly outperform other state-of-the-art multitask algorithms. Finally, MMLMTO is applied to a real-world multitask rover navigation application problem to further demonstrate its applicability. © 1997-2012 IEEE.

关键词： Benchmarking

来源：评论

学校读者我要写书评

暂无评论

HFGS: 4D Gaussian Splatting with Emphasis on Spatial and Temporal High-Frequency Components for Endoscopic Scene Reconstruction

arXiv

引用

arXiv 2024年

作者： Zhao, Haoyu Zhao, Xingyue Zhu, Lingting Zheng, Weixi Xu, Yongchao School of Computer Science Wuhan University Hubei China School of Software Engineering Xi'an Jiaotong University Shaanxi China School of Computing and Data Science The University of Hong Kong Hong Kong

Robot-assisted minimally invasive surgery benefits from enhancing dynamic scene reconstruction, as it improves surgical outcomes. While Neural Radiance Fields (NeRF) have been effective in scene reconstruction, their slow inference speeds and lengthy training durations limit their applicability. To overcome these limitations, 3D Gaussian Splatting (3D-GS) based methods have emerged as a recent trend, offering rapid inference capabilities and superior 3D quality. However, these methods still struggle with under-reconstruction in both static and dynamic scenes. In this paper, we propose HFGS, a novel approach for deformable endoscopic reconstruction that addresses these challenges from spatial and temporal frequency perspectives. Our approach incorporates deformation fields to better handle dynamic scenes and introduces Spatial High-Frequency Emphasis Reconstruction (SHF) to minimize discrepancies in spatial frequency spectra between the rendered image and its ground truth. Additionally, we introduce Temporal High-Frequency Emphasis Reconstruction (THF) to enhance dynamic awareness in neural rendering by leveraging flow priors, focusing optimization on motion-intensive parts. Extensive experiments on two widely used benchmarks demonstrate that HFGS achieves superior rendering quality. Source code is available at https://***/zhaohaoyu376/HFGS. Copyright © 2024, The Authors. All rights reserved.

关键词： Endoscopy

来源：评论

学校读者我要写书评

暂无评论

Deep Hashing with Minimal-Distance-Separated Hash Centers

Deep Hashing with Minimal-Distance-Separated Hash Centers

引用

Conference on computer Vision and Pattern Recognition (CVPR)

作者： Liangdao Wang Yan Pan Cong Liu Hanjiang Lai Jian Yin Ye Liu School of Computer Science and Engineering Sun Yat-Sen University Big Data Department Lizhi Inc.

Deep hashing is an appealing approach for large-scale image retrieval. Most existing supervised deep hashing methods learn hash functions using pairwise or triple image similarities in randomly sampled mini-batches. They suffer from low training efficiency, insufficient coverage of data distribution, and pair imbalance problems. Recently, central similarity quantization (CSQ) attacks the above problems by using “hash centers” as a global similarity metric, which encourages the hash codes of similar images to approach their common hash center and distance themselves from other hash centers. Although achieving SOTA retrieval performance, CSQ falls short of a worst-case guarantee on the minimal distance between its constructed hash centers, i.e. the hash centers can be arbitrarily close. This paper presents an optimization method that finds hash centers with a constraint on the minimal distance between any pair of hash centers, which is non-trivial due to the non-convex nature of the problem. More importantly, we adopt the Gilbert-Varshamov bound from coding theory, which helps us to obtain a large minimal distance while ensuring the empirical feasibility of our optimization approach. With these clearly-separated hash centers, each is assigned to one image class, we propose several effective loss functions to train deep hashing networks. Extensive experiments on three datasets for image retrieval demonstrate that the proposed method achieves superior retrieval performance over the state-of-the-art deep hashing methods.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Self Attention-based Sparse Graph Convolutional Neural Network for Instant Parameter Prediction to Optimize Wireless Sensor Network Performance in Electrical Systems

Self Attention-based Sparse Graph Convolutional Neural Netwo...

引用

Trends in Material science and Inventive Materials (ICTMIM), International Conference on

作者： Asma Anjum Thilak Raj Balasubramanian M. Lakshminarayana Daxa Vekariya Madhu B K Ramya Maranan Department of Computer Science and Engineering HKBK College of Engineering Bengaluru Karnataka Personalization Data Infrastructure/ ML Infrastructure Walmart Inc San Jose California USA Department of Medical Electronics Engineering M S Ramaiah Institute Of Technology Bengaluru Karnataka Department of Computer Science & Engineering Parul Institute of Engineering and Technology Parul University Department of Computer Science and Engineering Vidya Vikas Institute of Engineering and Technology Mysore Karnataka Department of Research and Innovation Saveetha School of Engineering SIMATS Chennai Tamil Nadu

ISBN: (数字)9798331501488

ISBN: (纸本)9798331501495

WSNs function as key components across environmental monitoring systems as well as healthcare facilities and industrial automation networks. WSN performance optimization faces ongoing challenges because the networks must deal with restricted energy supplies in addition to shifting operational environments and restricted data transfer capabilities. The current optimization techniques implement static hierarchical approaches combined with heuristic methods yet these prove unable to adjust in real-time thus leading to deteriorating network conditions and resource inefficiency. The research puts forward Self Attention-based Sparse Graph Convolutional Neural Network (SA-SGCN) as an instant parameter prediction framework to optimize WSN performance through automatic adjustments of critical metrics including energy usage and packet success ratios as well as latency and data transfer speed. Real-time prediction becomes possible through the self-attention mechanisms used in SA- SGCN models to detect distant dependencies in the sensor network in combination with sparse graph convolution that decreases computational requirements. The Crested Porcupine Optimizer (CPO) optimizes the hyperparameters of the SA- SGCN model for better efficiency and accuracy measurement. The experimental evidence shows that using the proposed method results in superior performance compared to current approaches because it reaches a prediction accuracy level of 99.9% for WSN configurations. The research presents a dependable energy-efficient adaptive framework for WSNs which enables high scalability and reliable operation within fast-changing environments.

关键词： Wireless sensor networks Adaptation models Accuracy Convolution Computational modeling Predictive models data transfer Real-time systems Convolutional neural networks Optimization

来源：评论

学校读者我要写书评

暂无评论

IOMatch: Simplifying Open-Set Semi-Supervised Learning with Joint Inliers and Outliers Utilization

IOMatch: Simplifying Open-Set Semi-Supervised Learning with ...

引用

International Conference on computer Vision (ICCV)

作者： Zekun Li Lei Qi Yinghuan Shi Yang Gao State Key Laboratory for Novel Software Technology and National Institute of Healthcare Data Science Nanjing University School of Computer Science and Engineering Southeast University

Semi-supervised learning (SSL) aims to leverage massive unlabeled data when labels are expensive to obtain. Unfortunately, in many real-world applications, the collected unlabeled data will inevitably contain unseen-class outliers not belonging to any of the labeled classes. To deal with the challenging open-set SSL task, the mainstream methods tend to first detect outliers and then filter them out. However, we observe a surprising fact that such approach could result in more severe performance degradation when labels are extremely scarce, as the unreliable outlier detector may wrongly exclude a considerable portion of valuable inliers. To tackle with this issue, we introduce a novel open-set SSL framework, IOMatch, which can jointly utilize inliers and outliers, even when it is difficult to distinguish exactly between them. Specifically, we propose to employ a multi-binary classifier in combination with the standard closed-set classifier for producing unified open-set classification targets, which regard all outliers as a single new class. By adopting these targets as open-set pseudo-labels, we optimize an open-set classifier with all unlabeled samples including both inliers and outliers. Extensive experiments have shown that IOMatch significantly outperforms the baseline methods across different benchmark datasets and different settings despite its remarkable simplicity. Our code and models are available at https://***/nukezil/IOMatch.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Enhanced Model for Mango Detection and Quality Classification Using Optimized Feature Extraction Techniques

Enhanced Model for Mango Detection and Quality Classificatio...

引用

IEEE Students' Conference on Electrical, Electronics and computer science (SCEECS)

作者： Adla Aryan Abdul Aleem Mohammed Manish Chabra Syed Saarib Rasheed Mohammed Adnan Mohammed Abdul Raoof Department of Artificial Intelligence & Machine Learning Vardhaman College of Engineering Hyderabad Telangana India Department of Computer Science and Engineering Muffakham Jah College of Engineering and Technology Hyderabad India Department of Artificial Intelligence & Data Science Methodist College of Engineering and Technology Hyderabad Telangana India

ISBN: (数字)9798331529833

ISBN: (纸本)9798331529840

This paper introduces an automated grading system for mangoes, enhancing efficiency and accuracy compared to human-based methods. The system uses the Lion Assisted Firefly Algorithm (LA-FF) to extract the best features from multiple highlights, enhancing grading efficiency and accuracy. The LA-FF algorithm is then used to fine-tune the convolutional layers of a deep CNN based on the specific requirements of mango grading. The system integrates the latest algorithms, automation, and adaptation to create an even more effective and precise grading system suitable for rural agricultural contexts. The LA-FF algorithm is used to extract the best features from multiple highlights, resulting in a more accurate and efficient grading process.

关键词： Measurement computer science Accuracy Feature extraction Classification algorithms Convolutional neural networks Reliability Optimization Consumer electronics Convergence

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：