检索结果-内蒙古大学图书馆

AsyCo: an asymmetric dual-task co-training model for partial-label learning

science China(Information sciences) 2025年第5期68卷 332-347页

作者： Beibei LI Yiyuan ZHENG Beihong JIN Tao XIANG Haobo WANG Lei FENG College of Computer Science Chongqing University State Key Laboratory of Computer Science Institute of Software Chinese Academy of Sciences University of Chinese Academy of Sciences School of Software Technology Zhejiang University School of Computer Science and Engineering Nanyang Technological University

Partial-label learning(PLL) is a typical problem of weakly supervised learning, where each training instance is annotated with a set of candidate labels. Self-training PLL models achieve state-of-the-art performance but suffer from error accumulation problems caused by mistakenly disambiguated instances. Although co-training can alleviate this issue by training two networks simultaneously and allowing them to interact with each other, most existing co-training methods train two structurally identical networks with the same task, i.e., are symmetric, rendering it insufficient for them to correct each other due to their similar limitations. Therefore, in this paper, we propose an asymmetric dual-task co-training PLL model called AsyCo,which forces its two networks, i.e., a disambiguation network and an auxiliary network, to learn from different views explicitly by optimizing distinct tasks. Specifically, the disambiguation network is trained with a self-training PLL task to learn label confidence, while the auxiliary network is trained in a supervised learning paradigm to learn from the noisy pairwise similarity labels that are constructed according to the learned label confidence. Finally, the error accumulation problem is mitigated via information distillation and confidence refinement. Extensive experiments on both uniform and instance-dependent partially labeled datasets demonstrate the effectiveness of AsyCo.

关键词： machine learning weakly supervised learning partial-label learning co-training models candidate label sets

来源：评论

学校读者我要写书评

暂无评论

Predicting early ASD traits of adults and toddlers using machine learning and deep learning with explainable AI and optimization

引用

Neural Computing and Applications 2025年 1-28页

作者： Rahman, Md. Ashiqur Hossain, Md. Mamun Singh, Sondip Poul Sharmin, Nusrat Dhaka1216 Bangladesh Department of Computer Science and Engineering Military Institute of Science and Technology Dhaka Bangladesh

Autism spectrum disorder (ASD) is a complex neurodevelopmental condition characterized by challenges in social interaction, communication difficulties, repetitive behaviors, and a range of strengths and differences in cognitive abilities. Early ASD diagnosis using machine learning and deep learning techniques is crucial for preventing its severity and long-term effects. The articles published in this area have only applied different machine learning algorithms, and a notable gap observed is the absence of an in-depth analysis in terms of hyperparameter tuning and the type of dataset used in this context. This study investigated predictive modeling for ASD traits by leveraging two distinct datasets: (i) a raw CSV dataset with tabular data and (ii) an image dataset with facial expression. This study aims to conduct an in-depth analysis of ASD trait prediction in adults and toddlers by doing hyper optimized and interpreting the result through explainable AI. In the CSV dataset, a comprehensive exploration of machine learning and deep learning algorithms, including decision trees, Naive Bayes, random forests, support vector machines (SVM), k-nearest neighbors (KNN), logistic regression, XGBoost, and ANN, was conducted. XGBoost emerged as the most effective machine learning algorithm, achieving an accuracy of 96.13%. The deep learning ANN model outperformed the traditional machine learning algorithms with an accuracy of 99%. Additionally, an ensemble model combining a decision tree, random forest, SVM, KNN, and logistic regression demonstrated superior performance, yielding an accuracy of 96.67%. The XGBoost model, utilized in hyperparameter optimization for CSV data, exhibited a substantial accuracy increase, reaching 98%. For the image dataset, advanced deep learning models, such as ResNet50, VGG16, Boosting, and Bagging, were employed. The bagging model outperformed the others, achieving an impressive accuracy of 99%. Subsequent hyperparameter optimization was conduct

关键词： Random forests

来源：评论

学校读者我要写书评

暂无评论

Boosting cervical cancer detection with a multi-stage architecture and complementary information fusion

引用

Soft Computing 2025年第2期29卷 1191-1206页

作者： Sahoo, Pranab Saha, Sriparna Sharma, Saksham Kumar Mondal, Samrat Computer Science and Engineering Indian Institute of Technology Patna India Computer Science University of Maryland Baltimore CountyMD United States

Cervical cancer is one of the most fatal and prevalent illnesses affecting women globally. Early detection of cervical cancer is crucial for effective treatment. Pap smear tests are commonly used, but population-based screening is time-consuming, expensive, and requires expert physicians. computer-Aided Diagnosis (CAD) has shown promise in addressing this challenge. However, accurately predicting the disease using a single model can be difficult due to the complex data patterns involved. This research proposes a multi-stage architecture to improve cervical cancer screening. Initially, three pre-trained models are employed for image classification, after which the proposed advanced fusion technique is applied to combine the predictions. Additionally, we introduce a filtering approach in the third stage to refine the predictions. Unlike traditional fusion methods, the proposed architecture considers the confidence score of the base classifiers in making the final predictions on test samples. To enhance the performance of the models, we incorporate advanced augmentation techniques, including CutMix, CutOut, and MixUp. We assessed the performance of the proposed framework using a 5-fold cross-validation technique on two benchmark datasets. We evaluated the performance of the proposed framework through 5-fold cross-validation on two benchmark datasets. Remarkably, our framework achieved a classification accuracy of 97.62% and an F1-score of 97.64% on the SIPaKMeD dataset, demonstrating its effectiveness in accurately categorizing various cell types in the dataset. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2025.

关键词： Cervical cancer computer-aided detection Deep learning Ensemble learning

来源：评论

学校读者我要写书评

暂无评论

Enhancing Predictive Capabilities for Cyber Physical Systems Through Supervised Learning

Informatica (Slovenia)

引用

Informatica (Slovenia) 2025年第16期49卷 77-86页

作者： Dhanalakshmi, B. Tamije Selvy, P. Department of Computer Science and Engineering Dr.N.G. P Institute of technology India Department of Computer Science and Engineering Hindusthan College of Engineering and Technology India

The rapid advancement and proliferation of Cyber-Physical Systems (CPS) have led to an exponential increase in the volume of data generated continuously. Efficient classification of this streaming data is crucial for predicting system behaviors and enabling proactive decision-making. This research aims to extract actionable knowledge from the continuous data streams of CPS and predict their behavior using advanced supervised learning algorithms. The predictions facilitate timely interventions and necessary actions within the interconnected physical network. The background of this work lies in the intersection of CPS, machine learning, and data stream mining. Traditional batch processing methods are inadequate for real-time analysis of CPS data due to their inherent latency and computational inefficiency. This research employs state-of-the-art techniques for real-time data processing, including incremental learning, sliding window models, and ensemble methods tailored for streaming data. Our approach differs from existing works by focusing on a comprehensive framework that integrates real-time data ingestion, preprocessing, feature extraction, and model updating in a seamless pipeline. Unlike previous studies that often rely on static datasets and offline analysis, our method ensures continuous learning and adaptation to evolving data patterns. Comparative analysis with existing techniques demonstrates superior performance in terms of accuracy, latency, and scalability. Specifically, our models achieved an average classification accuracy of 92%, with a precision of 90%, recall of 89%, and an F1 score of 89.5%. These metrics indicate significant improvements over traditional batch processing methods, which typically lag in responsiveness and adaptability. This research provides a robust and efficient solution for the real-time classification of streaming data from CPS, enhancing the system's ability to predict behaviors and take necessary actions promptly. © 2025 Slov

关键词： Self-supervised learning

来源：评论

学校读者我要写书评

暂无评论

Cognitive Transformation in Personal IoT: Pioneering Intelligent Automation

引用

Cyber-Physical Systems 2025年第2期11卷 183-240页

作者： Gulzar, Bisma Ahmad Sofi, Shabir Sholla, Sahil Department of Information Technology National Institute of Technology Srinagar India Department of Computer Science Engineering Islamic University of Science & Technology Srinagar India

In recent years, IoT has transformed personal environments by integrating diverse smart devices. This paper presents an advanced IoT architecture that optimizes network infrastructure, focusing on the adoption of MQTT protocol and introducing Cognitive Smart Objects for managing personal IoT applications. These objects use Neural Networks to predict optimal actions based on user behavior patterns. A Continuous Learning mechanism enables real-time adaptation of the network to evolving user interactions. The study highlights the role of Cognitive Transformation in Personal IoT, driving intelligent automation and enhancing user experience. © 2024 Informa UK Limited, trading as Taylor & Francis Group.

关键词： Internet of Things (IoT) Personal Internet of Things (PIoT) Social Internet of Things (SIoT) IoT architecture network infrastructure

来源：评论

学校读者我要写书评

暂无评论

Enhancing Real-Time Object Detection with Optical Flow-Guided Streaming Perception

引用

IEEE Transactions on Circuits and Systems for Video technology 2025年第5期35卷 4816-4830页

作者： Wang, Tongbo Zhu, Lin Huang, Hua Beijing Institute of Technology School of Computer Science and Technology Beijing100081 China Beijing Institute of Technology School of Computer Science Beijing100081 China Beijing Normal University School of Artificial Intelligence Beijing100875 China

Real-time object detection in Unmanned Aerial Vehicle (UAV) videos remains a significant challenge due to the fast motion and small scale of objects. Existing streaming perception models struggle to accurately capture fine-grained motion cues between consecutive frames, leading to suboptimal performance in dynamic UAV scenarios. To address these challenges, Stream-Flow is proposed to integrate optical flow information and enhance real-time object detection in UAV videos. StreamFlow incorporates Flow-Guided Dynamic Prediction (FGDP) to refine position predictions using local optical flow information and Optical Flow Guided Optimization (OFGO) to optimize model parameters considering both localization loss and optical flow reliability. Central to OFGO is the Adaptive Flow Weighting (AFW) module, which focuses on reliable flow samples during training. The proposed integration of optical flow and adaptive weighting scheme significantly enhances the ability of streaming perception models to handle fast-moving objects in dynamic UAV environments. Extensive experiments on four challenging UAV video datasets demonstrate the superior performance of StreamFlow compared to state-of-the-art methods in terms of accuracy. © 1991-2012 IEEE.

关键词： Stream flow

来源：评论

学校读者我要写书评

暂无评论

DRMSpell:dynamically reweighting multimodality for Chinese spelling correction

引用

Frontiers of Information technology & Electronic Engineering 2025年第3期26卷 354-366页

作者： Yinghao LI Heyan HUANG Baojun WANG Yang GAO School of Computer Science and Technology Beijing Institute of TechnologyBeijing 100081China Southeast Academy of Information Technology Beijing Institute of TechnologyPutian 351100China Huawei Noahs Ark Lab Shenzhen 518129China

Chinese spelling correction(CSC)is a task that aims to detect and correct the spelling errors that may occur in Chinese ***,the Chinese language exhibits a high degree of complexity,characterized by the presence of multiple phonetic representations known as pinyin,which possess distinct tonal variations that can correspond to various *** the complexity inherent in the Chinese language,the CSC task becomes imperative for ensuring the accuracy and clarity of written *** research has included external knowledge into the model using phonological and visual ***,these methods do not effectively target the utilization of modality information to address the different types of *** this paper,we propose a multimodal pretrained language model called DRMSpell for CSC,which takes into consideration the interaction between the modalities.A dynamically reweighting multimodality(DRM)module is introduced to reweight various modalities for obtaining more multimodal *** fully use the multimodal information obtained and to further strengthen the model,an independent-modality masking strategy(IMS)is proposed to independently mask three modalities of a token in the pretraining *** method achieves state-of-the-art performance on most metrics constituting widely used *** findings of the experiments demonstrate that our method is capable of modeling the interactive information between modalities and is also robust to incorrect modal information.

关键词： Chinese spelling correction Multimodality Masking strategy

来源：评论

学校读者我要写书评

暂无评论

Research Progress in Solar Flare Prediction Methods

引用

Research in Astronomy and Astrophysics 2025年第3期25卷 280-309页

作者： Ke Han Zhen Liu Xian-Yi Zhao Yi-Fei Li De-Quan Zheng Jie Wan School of Computer and Information Engineering Harbin University of Commerce Faculty of Computing Harbin Institute of Technology School of Energy Science and Engineering Harbin Institute of Technology

Solar flares are one of the strongest outbursts of solar activity,posing a serious threat to Earth’s critical infrastructure,such as communications,navigation,power,and ***,it is essential to accurately predict solar flares in order to ensure the safety of human ***,the research focuses on two directions:first,identifying predictors with more physical information and higher prediction accuracy,and second,building flare prediction models that can effectively handle complex observational *** terms of flare observability and predictability,this paper analyses multiple dimensions of solar flare observability and evaluates the potential of observational parameters in *** flare prediction models,the paper focuses on data-driven models and physical models,with an emphasis on the advantages of deep learning techniques in dealing with complex and high-dimensional *** reviewing existing traditional machine learning,deep learning,and fusion methods,the key roles of these techniques in improving prediction accuracy and efficiency are *** prevailing challenges,this study discusses the main challenges currently faced in solar flare prediction,such as the complexity of flare samples,the multimodality of observational data,and the interpretability of *** conclusion summarizes these findings and proposes future research directions and potential technology advancement.

关键词： (Sun:) sunspots magnetohydrodynamics (MHD) Sun: activity Sun: flares Sun: magnetic fields

来源：评论

学校读者我要写书评

暂无评论

Exploring image data association: A hybrid mining approach

引用

Multimedia Tools and Applications 2025年第9期84卷 5725-5740页

作者： Parashar, Nishtha Tiwari, Akhilesh Gupta, Rajendra Kumar Department of Computer Science and Engineering Madhav Institute of Technology & Science M.P. Gwalior India Department of Information Technology Madhav Institute of Technology & Science M.P. Gwalior India

In this paper, a new approach for mining image association rules is presented, which involves the fine-tuned CNN model, as well as the proposed FIAR and OFIAR algorithms. Initially, the image transactional database is generated using feature vectors obtained from the fine-tuned CNN architecture. The proposed FIAR algorithm is used to generate hash-indexed image association rules, which are further optimized using the proposed OFIAR algorithm. This methodology combines the strengths of the CNN model to extract histogram features from images, the FIAR algorithm to efficiently mine frequent image itemsets, and the OFIAR algorithm to optimize image association rules. The proposed methodology can be used to discover hidden relationships among images, leading to new insights in image processing and analysis. Efficient results were obtained with a minimum support of 0.50 and a minimum confidence of 0.50. Experiments were performed on the fruits image dataset consisting of 2618 images from six different classes, and the results show that image mining is feasible and can produce strong optimized image association rules that can be further used for classification purposes. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2024.

关键词： Association rules

来源：评论

学校读者我要写书评

暂无评论

Halo reduction multi-exposure image fusion technique

引用

Multimedia Tools and Applications 2025年第13期84卷 12347-12370页

作者： Sharif, Rizwan Amin, Benish Sukhia, Komal Nain Department of Electrical Engineering Institute of Space Technology Islamabad Pakistan Department of Computer Science Institute of Space Technology Islamabad Pakistan

Multi-exposure image fusion (MEF) involves combining images captured at different exposure levels to create a single, well-exposed fused image. MEF has a wide range of applications, including low light, low contrast, night photography, medical imaging, and remote sensing. However, MEF methods often face issues like artifacts, halos around edges, color inconsistencies, noise amplification, and difficulty in preserving fine details. Moreover, assessing the quality of fused images objectively is complex due to the subjective nature of human perception. Solving the challenges is essential to developing efficient MEF techniques that produce high-quality results across various scenarios. The proposed technique introduces an approach to handling halo artifacts and implementing MEF. The Dense Scale-Invariant Feature Transform (DSIFT) is used to capture vital information about image brightness, texture, and edges from source images. Three weight maps are computed from the local mean, signal strength, and the global gradient for initial weight estimation. The local mean represents the brightness of specific image areas, signal strength preserves essential details like textures and edges while reducing image noise, and global gradient helps identify regions with significant pixel value shifts across multiple exposure images. The weight maps are then combined using a weighted average and a Gaussian smoothing filter is applied to reduce inherent noise and discontinuities in the original weights. Subsequently, pyramid decomposition is performed to generate a fused image. The efficiency of the proposed method is extensively tested on challenging multi-exposure image sequences. The results of the proposed approach demonstrate its superiority in both subjective evaluation and objective metrics like the MEF-Structural Similarity Index (MEF-SSIM), Natural Image Quality Evaluator (NIQE) and Gradient based performance measure (QAB/f). © The Author(s), under exclusive licence to Springer

关键词： Image fusion

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：