检索结果-内蒙古大学图书馆

Iraqi Journal for Computer science and Mathematics 2024年第1期5卷 160-167页

作者： Abdalrada, Ahmad Shaker Neamah, Ali Fahem Murad, Hayder Department of Software Faculty of Computer Science and Information Technology Wasit University Iraq Department of Computer Faculty of Computer Science and Information Technology Wasit University Iraq College of Medicine Wasit University Iraq

Diabetes disease is prevalent worldwide, and predicting its progression is crucial. Several model have been proposed to predict such disease. Those models only determine the disease label, leaving the likelihood of developing the disease unclear. Proposing a model for predicting the progression of disease becomes essential. Therefore, this article proposes a logistic regression model to anticipate the likelihood of Diabetes syndrome incidence. The model exploit capabilities of logistic regression by using sigmoid function. The model's performance was evaluated using the Pima Indians Diabetes dataset and demonstrated high accuracy, sensitivity, and specificity. The prediction accuracy rate was 77.6%, with a sensitivity of 72.4%, specificity of 79.6%, Type I Error of 27.6%, and Type II Error of 20.4%. Furthermore, the model indicates the feasibility of using laboratory tests, such as Pregnancies, Glucose, Blood Pressure, BMI, and DiabetesPedigreeFunction, to predict disease progress. The proposed model can aid patients and physicians in understanding the disease's progression and implementing timely interventions © 2024 College of Education, Al-Iraqia University. All rights reserved.

关键词： Forecasting

来源：评论

学校读者我要写书评

暂无评论

Channel pruning on frequency response

引用

science China(Information sciences) 2025年第1期68卷 159-170页

作者： Hang LIN Yifan PENG Lin BIE Chenggang YAN Xibin ZHAO Yue GAO School of Software Tsinghua University School of Automation Hangzhou Dianzi University

Network pruning has a significant role in reducing network parameters and accelerating the inference time of the network. Some existing methods prune the network based on the frequency of the data, and finally obtain a sub-network with high accuracy. However, according to our experimental analysis, different frequencies of information in the data contribute differently to the accuracy of the model, and using this information directly for pruning without making a selection will lead to incorrect results. We believe that pruning should retain the convolutional kernels in the network that process important information, while those kernels that process unimportant information should be removed. In this paper, we first investigate the meaning of each frequency band information in the spectrum and their contribution to the prediction accuracy of the network,and according to these results, we propose a new pruning method based on frequency response(PFR). Our PFR finds and removes the convolutional kernels in the network that specialize in processing unimportant information, resulting in a compact neural network model. PFR obtains significant experimental results on different datasets, for example, a 56.0% raduction of float points operations(FLOPs) on Res Net-50 and only 0.37% of Top-1 accuracy degradation on the Image Net dataset.

关键词： deep learning model compression filter pruning channel pruning frequency response

来源：评论

学校读者我要写书评

暂无评论

Visual Perturbation for Text-Based Person Search 39

Visual Perturbation for Text-Based Person Search

引用

39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025

作者： Zhang, Pengcheng Yu, Xiaohan Bai, Xiao Zheng, Jin School of Computer Science and Engineering State Key Laboratory of Complex & Critical Software Environment Jiangxi Research Institute Beihang University Beijing China School of Computing Macquarie University Sydney Australia

ISBN: (纸本)157735897X

Text-based person search aims at locating a person described by natural language in uncropped scene images. Recent works for TBPS mainly focus on aligning multi-granularity vision and language representations, neglecting a key discrepancy between training and inference where the former learns to unify vision and language features where the visual side covers all clues described by language, yet the latter matches image-text pairs where the images may capture only part of the described clues due to perturbations such as occlusions, background clutters and misaligned boundaries. To alleviate this issue, we present ViPer: a Visual Perturbation network that learns to match language descriptions with perturbed visual clues. On top of a CLIP-driven baseline, we design three visual perturbation modules: (1) Spatial ViPer that varies person proposals and produces visual features with misaligned boundaries, (2) Attentive ViPer that estimates visual attention on the fly and manipulates attentive visual tokens within a proposal to produce global features under visual perturbations, and (3) Fine-grained ViPer that learns to recover masked visual clues from detailed language descriptions to encourage matching language features with perturbed visual features at the fine granularity. This overall framework thus simulates real-world scenarios at the training stage to minimize the discrepancy and improve the generalization ability of the model. Experimental results demonstrate that the proposed method clearly surpasses previous TBPS methods on the PRW-TBPS and CUHK-SYSU-TBPS datasets. Copyright © 2025, Association for the Advancement of Artificial Intelligence (***). All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

AMS Publications Support for Open, Transparent, and Equitable Research

引用

Journal of the Atmospheric sciences 2023年第11期80卷 2585-2586页

作者： Schuster, Douglas Friedman, Michael Chair of AMS Board on Open Science Data and Software AMS Publications Associate Director–Publishing Technology

来源：评论

学校读者我要写书评

暂无评论

STDNet:Improved lip reading via short-term temporal dependency modeling

引用

虚拟现实与智能硬件(中英文) 2025年第2期7卷 173-187页

作者： Xiaoer WU Zhenhua TAN Ziwei CHENG Yuran RU Software College Northeastern UniversityShenyang 110819China Faculty of Software College Northeastern UniversityShenyang 110819China

Background Lip reading uses lip images for visual speech ***-learning-based lip reading has greatly improved performance in current datasets;however,most existing research ignores the significance of short-term temporal dependencies of lip-shape variations between adjacent frames,which leaves space for further improvement in feature *** This article presents a spatiotemporal feature fusion network(STDNet)that compensates for the deficiencies of current lip-reading approaches in short-term temporal dependency ***,to distinguish more similar and intricate content,STDNet adds a temporal feature extraction branch based on a 3D-CNN,which enhances the learning of dynamic lip movements in adjacent frames while not affecting spatial feature *** particular,we designed a local–temporal block,which aggregates interframe differences,strengthening the relationship between various local lip regions through multiscale *** incorporated the squeeze-and-excitation mechanism into the Global-Temporal Block,which processes a single frame as an independent unitto learn temporal variations across the entire lip region more ***,attention pooling was introduced to highlight meaningful frames containing key semantic information for the target *** Experimental results demonstrated STDNet's superior performance on the LRW and LRW-1000,achieving word-level recognition accuracies of 90.2% and 53.56%,*** ablation experiments verified the rationality and effectiveness of its *** The proposed model effectively addresses short-term temporal dependency limitations in lip reading,and improves the temporal robustness of the model against variable-length *** advancements validate the importance of explicit short-term dynamics modeling for practical lip-reading systems.

关键词： Lip reading Spatio-temporal feature fusion Short-term temporal dependency modeling

来源：评论

学校读者我要写书评

暂无评论

DOMAINEVAL: An Auto-Constructed Benchmark for Multi-Domain Code Generation 39

DOMAINEVAL: An Auto-Constructed Benchmark for Multi-Domain C...

引用

39th Annual AAAI Conference on Artificial Intelligence, AAAI 2025

作者： Zhu, Qiming Cao, Jialun Lu, Yaojie Lin, Hongyu Han, Xianpei Sun, Le Cheung, Shing-Chi Chinese Information Processing Laboratory Institute of Software Chinese Academy of Sciences Beijing China University of Chinese Academy of Sciences Beijing China The Hong Kong University of Science and Technology Hong Kong

ISBN: (纸本)157735897X

Code benchmarks such as HumanEval are widely adopted to evaluate capabilities of Large Language Models (LLMs), providing insights into their strengths and weaknesses. However, current benchmarks primarily exercise LLMs’ capability on common coding tasks (e.g., bubble sort, greatest common divisor), leaving domain-specific coding tasks (e.g., computation, system, cryptography) unexplored. To fill this gap, we propose a multi-domain code benchmark, DOMAINEVAL, designed to evaluate LLMs’ coding capabilities thoroughly. Our pipeline works in a fully automated manner, enabling a push-button pipeline from code repositories into formatted subjects under study. Interesting findings are observed by evaluating 12 representative LLMs against DOMAINEVAL. We notice that LLMs are generally good at computation tasks while falling short on cryptography and system coding tasks. The performance gap can be as much as 68.94% (80.94% - 12.0%) in some LLMs. We also observe that generating more samples can increase the overall performance of LLMs, while the domain bias may even increase. The contributions of this study include a code generation benchmark dataset DOMAINEVAL, encompassing six popular domains, a fully automated pipeline for constructing code benchmarks, and an identification of the limitations of LLMs in code generation tasks based on their performance on DOMAINEVAL, providing directions for future research improvements. © 2025, Association for the Advancement of Artificial Intelligence (***). All rights reserved.

关键词： Multiprogramming

来源：评论

学校读者我要写书评

暂无评论

Sentence-Guided Comment Tree Fusion for Fake News Detection 4

Sentence-Guided Comment Tree Fusion for Fake News Detection

引用

4th International Conference on Artificial Intelligence, Robotics, and Communication, ICAIRC 2024

作者： Xiong, Shiyong Huang, Tingting Xie, Huajian Shen, Jun Chongqing University of Posts and Telecommunications School of Software Engineering Chongqing China Chongqing Education Science Research Institute School of Software Engineering Chongqing China

ISBN: (纸本)9798331531225

The rapid spread of fake news significantly impacts social cognition and media credibility, making the effective detection of fake news a critical issue. This paper proposes a fake news detection method based on a comment tree structure. By constructing a hierarchical comment tree and integrating cosine similarity filtering with a cross-attention mechanism, the method optimizes the comment selection process. Finally, a gated mechanism is employed to fuse the features of news sentences and comment information, thereby improving the accuracy of fake news detection. Experiments conducted on two public datasets, FakeNewsNet and BuzzFace, demonstrate that the proposed method achieves significant performance improvements in fake news detection compared to baseline models. © 2024 IEEE.

关键词： Fake detection

来源：评论

学校读者我要写书评

暂无评论

A New End-to-End Image Contrastive Clustering Based on Mining Latent Positive Samples 7

A New End-to-End Image Contrastive Clustering Based on Minin...

引用

7th International Conference on Machine Learning and Natural Language Processing, MLNLP 2024

作者： Feng, Bo Yan, Xi Yan, Li Luo, Bing Pei, Zheng School of Science Xihua University Chengdu China School of Computer Science and Software Engineering Xihua University Chengdu China

ISBN: (纸本)9798350354973

The integration of the contrastive learning paradigm into deep clustering has led to enhanced performance in image clustering. However, in existing researches, the samples in the class of the target may be still treated as negative samples, this means that more semantic sample pairs cannot be constructed and the intra-class compactness may be compromised. In this paper, we propose a new contrastive clustering method based on mining latent positive samples (CCLPS), which utilizes the nearest neighbor relationship between samples to mine semantic positive samples in the same class. In more detail, by mining latent positive samples in both the feature and cluster spaces, CCLPS can obtain more cluster-friendly feature representation of samples to enhance clustering performance. Samples mined from the feature space fuel the cluster-level contrastive learning task, while samples mined from the cluster space fuel the instance-level task. Three image datasets CIFAR-10, CIFAR-100 and Tiny-ImageNet are employed to experimentally demonstrate usefulness and effectiveness of the proposed method in image clustering, and transfer learning based on mining latent positive samples is also discussed and comparisons are made with 19 representative clustering methods on the datasets. Experimental results indicate that the proposed method can serve as a pre-training model for transfer learning and exhibits commendable feature modeling capabilities. © 2024 IEEE.

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

AMS Publications Support for Open, Transparent, and Equitable Research

引用

Monthly Weather Review 2023年第11期151卷 2849-2850页

作者： Schuster, Douglas Friedman, Michael Chair of AMS Board on Open Science Data and Software AMS Publications Associate Director–Publishing Technology

来源：评论

学校读者我要写书评

暂无评论

Knowledge-guided Network Pruning for EEG-based Emotion Recognition

Knowledge-guided Network Pruning for EEG-based Emotion Recog...

引用

2023 IEEE International Conference on Bioinformatics and Biomedicine, BIBM 2023

作者： Rao, Wenjie Zhong, Sheng-Hua Zhang, Zhi Liu, Yan Shenzhen University Guangdong Laboratory of Artificial Intelligence and Digital Economy College of Computer Science and Software Shenzhen China Hong Kong Polytechnic University Department of Computer Science Hong Kong

ISBN: (纸本)9798350337488

With the development of deep learning in EEG-related tasks, the complexity of learning models has gradually increased. These complex models often result in long inference times, high energy consumption, and an increased risk of overfitting. Therefore, model compression has become an important consideration. Although some EEG models have used lightweight techniques, such as separable convolution, no existing work has directly attempted to compress EEG models to reduce their complexity. In this paper, we integrate neuroscience knowledge into EEG model pruning recovery, and innovatively propose two loss functions in the learning process, the knowledge-guided region-wise loss that enforces the classification evidence consistent with the importance of the prefrontal lobe, and the knowledge-guided sample-wise loss that constrains the learning process by distinguishing the importance of different samples. © 2023 IEEE.

关键词： EEG-based emotion recognition knowledge-guided network pruning model compression

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：