检索结果-内蒙古大学图书馆

PromptFusion:Harmonized Semantic Prompt Learning for Infrared and Visible Image Fusion

IEEE/CAA Journal of Automatica Sinica 2025年第3期12卷 502-515页

作者： Jinyuan Liu Xingyuan Li Zirui Wang Zhiying Jiang Wei Zhong Wei Fan Bin Xu IEEE the School of Software Technology Dalian University of Technology the School of Mechanical Engineering Beijing Institute of Technology

The goal of infrared and visible image fusion(IVIF)is to integrate the unique advantages of both modalities to achieve a more comprehensive understanding of a scene. However, existing methods struggle to effectively handle modal disparities,resulting in visual degradation of the details and prominent targets of the fused images. To address these challenges, we introduce Prompt Fusion, a prompt-based approach that harmoniously combines multi-modality images under the guidance of semantic prompts. Firstly, to better characterize the features of different modalities, a contourlet autoencoder is designed to separate and extract the high-/low-frequency components of different modalities, thereby improving the extraction of fine details and textures. We also introduce a prompt learning mechanism using positive and negative prompts, leveraging Vision-Language Models to improve the fusion model's understanding and identification of targets in multi-modality images, leading to improved performance in downstream tasks. Furthermore, we employ bi-level asymptotic convergence optimization. This approach simplifies the intricate non-singleton non-convex bi-level problem into a series of convergent and differentiable single optimization problems that can be effectively resolved through gradient *** approach advances the state-of-the-art, delivering superior fusion quality and boosting the performance of related downstream tasks. Project page: https://***/hey-it-s-me/PromptFusion.

关键词： Bi-level optimization image fusion infrared and visible image prompt learning

来源：评论

学校读者我要写书评

暂无评论

MSCM-Net:Rail Surface Defect Detection Based on a Multi-Scale Cross-Modal Network

引用

Computers, Materials & Continua 2025年第3期82卷 4371-4388页

作者： Xin Wen Xiao Zheng Yu He School of Software Engineering Shenyang University of TechnologyShenyang110870China

Detecting surface defects on unused rails is crucial for evaluating rail quality and durability to ensure the safety of rail ***,existing detection methods often struggle with challenges such as complex defect morphology,texture similarity,and fuzzy edges,leading to poor accuracy and missed *** order to resolve these problems,we propose MSCM-Net(Multi-Scale Cross-Modal Network),a multiscale cross-modal framework focused on detecting rail surface ***-Net introduces an attention mechanism to dynamically weight the fusion of RGB and depth maps,effectively capturing and enhancing features at different scales for each *** further enrich feature representation and improve edge detection in blurred areas,we propose a multi-scale void fusion module that integrates multi-scale feature *** improve cross-modal feature fusion,we develop a cross-enhanced fusion module that transfers fused features between layers to incorporate interlayer *** also introduce a multimodal feature integration module,which merges modality-specific features from separate decoders into a shared decoder,enhancing detection by leveraging richer complementary ***,we validate MSCM-Net on the NEU RSDDS-AUG RGB-depth dataset,comparing it against 12 leading methods,and the results show that MSCM-Net achieves superior performance on all metrics.

关键词： Surface defect detection multiscale framework cross-modal fusion edge detection

来源：评论

学校读者我要写书评

暂无评论

Adversarial Patterns: Building Robust Android Malware Classifiers

引用

ACM Computing Surveys 2025年第8期57卷 1-34页

作者： Bhusal, Dipkamal Rastogi, Nidhi Department of Software Engineering Rochester Institute of Technology RochesterNY United States Software Engineering Rochester Institute of Technology RochesterNY United States

Machine learning models are increasingly being adopted across various fields, such as medicine, business, autonomous vehicles, and cybersecurity, to analyze vast amounts of data, detect patterns, and make predictions or recommendations. In the field of cybersecurity, these models have made significant improvements in malware detection. However, despite their ability to understand complex patterns from unstructured data, these models are susceptible to adversarial attacks that perform slight modifications in malware samples, leading to misclassification from malignant to benign. Numerous defense approaches have been proposed to either detect such adversarial attacks or improve model robustness. These approaches have resulted in a multitude of attack and defense techniques and the emergence of a field known as 'adversarial machine learning.' In this survey paper, we provide a comprehensive review of adversarial machine learning in the context of Android malware classifiers. Android is the most widely used operating system globally and is an easy target for malicious agents. The paper first presents an extensive background on Android malware classifiers, followed by an examination of the latest advancements in adversarial attacks and defenses. Finally, the paper provides guidelines for designing robust malware classifiers and outlines research directions for the future. © 2025 Copyright held by the owner/author(s). Publication rights licensed to ACM.

关键词： Adversarial machine learning

来源：评论

学校读者我要写书评

暂无评论

ASLP-DL—A Novel Approach Employing Lightweight Deep Learning Framework for Optimizing Accident Severity Level Prediction

引用

Computers, Materials & Continua 2024年第2期78卷 2535-2555页

作者： Saba Awan Zahid Mehmood Department of Software Engineering University of Engineering and TechnologyTaxila47050Pakistan Department of Computer Engineering University of Engineering and TechnologyTaxila47050Pakistan

Highway safety researchers focus on crash injury severity,utilizing deep learning—specifically,deep neural networks(DNN),deep convolutional neural networks(D-CNN),and deep recurrent neural networks(D-RNN)—as the preferred method for modeling accident *** learning’s strength lies in handling intricate relation-ships within extensive datasets,making it popular for accident severity level(ASL)prediction and *** prior success,there is a need for an efficient system recognizing ASL in diverse road *** address this,we present an innovative Accident Severity Level Prediction Deep Learning(ASLP-DL)framework,incorporating DNN,D-CNN,and D-RNN models fine-tuned through iterative hyperparameter selection with Stochastic Gradient *** framework optimizes hidden layers and integrates data augmentation,Gaussian noise,and dropout regularization for improved *** and factor contribution analyses identify influential *** on three diverse crash record databases—NCDB 2018–2019,UK 2015–2020,and US 2016–2021—the D-RNN model excels with an ACC score of 89.0281%,a Roc Area of 0.751,an F-estimate of 0.941,and a Kappa score of 0.0629 over the NCDB *** proposed framework consistently outperforms traditional methods,existing machine learning,and deep learning techniques.

关键词： Injury severity prediction deep learning feature

来源：评论

学校读者我要写书评

暂无评论

E-PRedictor: an approach for early prediction of pull request acceptance

引用

Science China(Information Sciences) 2025年第5期68卷 380-395页

作者： Kexing CHEN Lingfeng BAO Xing HU Xin XIA Xiaohu YANG State Key Laboratory of Blockchain and Data Security Zhejiang University Software Engineering Application Technology Lab

A pull request(PR) is an event in Git where a contributor asks project maintainers to review code he/she wants to merge into a project. The PR mechanism greatly improves the efficiency of distributed software development in the opensource community. Nevertheless, the massive number of PRs in an open-source software(OSS) project increases the workload of developers. To reduce the burden on developers, many previous studies have investigated factors that affect the chance of PRs getting accepted and built prediction models based on these factors. However, most prediction models are built on the data after PRs are submitted for a while(e.g., comments on PRs), making them not useful in practice. Because integrators still need to spend a large amount of effort on inspecting PRs. In this study, we propose an approach named E-PRedictor(earlier PR predictor) to predict whether a PR will be merged when it is created. E-PRedictor combines three dimensions of manual statistic features(i.e., contributor profile, specific pull request, and project profile) and deep semantic features generated by BERT models based on the description and code changes of PRs. To evaluate the performance of E-PRedictor, we collect475192 PRs from 49 popular open-source projects on GitHub. The experiment results show that our proposed approach can effectively predict whether a PR will be merged or not. E-PRedictor outperforms the baseline models(e.g., Random Forest and VDCNN) built on manual features significantly. In terms of F1@Merge, F1@Reject, and AUC(area under the receiver operating characteristic curve), the performance of E-PRedictor is 90.1%, 60.5%, and 85.4%, respectively.

关键词： pull request prediction model GitHub

来源：评论

学校读者我要写书评

暂无评论

Two stage-network: Automatic localization of Optic Disc (OD) and classification of glaucoma in fundus images using deep learning techniques

引用

Multimedia Tools and Applications 2025年第14期84卷 12949-12977页

作者： Sheraz, Huma Shehryar, Tehmina Khan, Zuhaib Ahmed Department of Software Engineering Mirpur University of Science & Technology Mirpur10240 Pakistan Department of Software Engineering Capital University of Science & Technology Islamabad Pakistan CareCloud Islamabad Pakistan

Glaucoma is an ophthalmic disorder which results in permanent vision loss because high intraocular pressure damages the optic nerve in the eye. This paper proposes a two-stage network for automated glaucoma identification utilizing fundus images. In the first stage, Yolo-v4 is used to locate and extract the optic disc from a retinal fundus image, and ResNet-101 is used in the second stage to determine whether the retrieved disc is glaucomatous or healthy. Unfortunately, none of the publicly accessible retinal fundus image datasets contain the necessary bounding box ground truth for disc localization. In this regard, a semi-automatic ground truth creation strategy has been proposed that gives the essential annotations enabling the training of the Yolo-v4 based model for autonomous disc localization. The proposed method is evaluated on ORIGA publicly available dataset. The proposed automated OD localization showed good results with 87.4% accuracy, 89.79% precision and 88.7% recall. Whereas the proposed glaucoma diagnosis module attained good results with 88.5% accuracy and AUC of.920. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.

关键词： Computer aided diagnosis

来源：评论

学校读者我要写书评

暂无评论

Multi-Level Parallel Network for Brain Tumor Segmentation

引用

Computer Modeling in engineering & Sciences 2024年第4期139卷 741-757页

作者： Juhong Tie Hui Peng School of Software Engineering Chengdu University of Information TechnologyChengdu610225China

Accurate automatic segmentation of gliomas in various sub-regions,including peritumoral edema,necrotic core,and enhancing and non-enhancing tumor core from 3D multimodal MRI images,is challenging because of its highly heterogeneous appearance and *** convolution neural networks(CNNs)have recently improved glioma segmentation ***,extensive down-sampling such as pooling or stridden convolution in CNNs significantly decreases the initial image resolution,resulting in the loss of accurate spatial and object parts information,especially information on the small sub-region tumors,affecting segmentation ***,this paper proposes a novel multi-level parallel network comprising three different level parallel subnetworks to fully use low-level,mid-level,and high-level information and improve the performance of brain tumor *** also introduce the Combo loss function to address input class imbalance and false positives and negatives imbalance in deep *** proposed method is trained and validated on the BraTS 2020 training and validation *** the validation dataset,ourmethod achieved a mean Dice score of 0.907,0.830,and 0.787 for the whole tumor,tumor core,and enhancing tumor core,*** with state-of-the-art methods,the multi-level parallel network has achieved competitive results on the validation dataset.

关键词： Convolution neural network brain tumor segmentation parallel network

来源：评论

学校读者我要写书评

暂无评论

GPIO-Based Continuous Sliding Mode Control for Networked Control Systems Under Communication Delays With Experiments on Servo Motors

引用

IEEE/CAA Journal of Automatica Sinica 2025年第1期12卷 99-113页

作者： Kamal Rsetam Zhenwei Cao Zhihong Man Xian-Ming Zhang the School of Software and Electrical Engineering Swinburne University of Technology the Department of Automated Manufacturing Al Khwarizmi College of Engineering University of Baghdad IEEE

To handle input and output time delays that commonly exist in many networked control systems(NCSs), a new robust continuous sliding mode control(CSMC) scheme is proposed for the output tracking in uncertain single input-single-output(SISO) networked control systems. This scheme consists of three consecutive steps. First, although the network-induced delay in those systems can be effectively handled by using Pade approximation(PA), the unmatched disturbance cames out as another difficulty in the control design. Second, to actively estimate this unmatched disturbance, a generalized proportional integral observer(GPIO) technique is utilized based on only one measured state. Third, by constructing a new sliding manifold with the aid of the estimated unmatched disturbance and states, a GPIO-based CSMC is synthesized, which is employed to cope with not only matched and unmatched disturbances, but also networkinduced delays. The stability of the entire closed-loop system under the proposed GPIO-based CSMC is detailedly *** promising tracking efficiency and feasibility of the proposed control methodology are verified through simulations and experiments on Quanser's servo module for motion control under various test conditions.

关键词： Continuous sliding mode control (CSMC) generalized proportional integral observer (GPIO) networked control systems (NCSs) pade approximation (PA) time-delay unsmatched disturbances

来源：评论

学校读者我要写书评

暂无评论

An Intelligent Privacy Protection Scheme for Efficient Edge Computation Offloading in IoV

引用

Chinese Journal of Electronics 2024年第4期33卷 910-919页

作者： Liang YAO Xiaolong XU Wanchun DOU Muhammad Bilal School of Software Nanjing University of Information Science and Technology State Key Laboratory for Novel Software Technology Nanjing University Department of Computer and Electronics Systems Engineering Hankuk University of Foreign Studies

As a pivotal enabler of intelligent transportation system(ITS), Internet of vehicles(Io V) has aroused extensive attention from academia and industry. The exponential growth of computation-intensive, latency-sensitive,and privacy-aware vehicular applications in Io V result in the transformation from cloud computing to edge computing,which enables tasks to be offloaded to edge nodes(ENs) closer to vehicles for efficient execution. In ITS environment,however, due to dynamic and stochastic computation offloading requests, it is challenging to efficiently orchestrate offloading decisions for application requirements. How to accomplish complex computation offloading of vehicles while ensuring data privacy remains challenging. In this paper, we propose an intelligent computation offloading with privacy protection scheme, named COPP. In particular, an Advanced Encryption Standard-based encryption method is utilized to implement privacy protection. Furthermore, an online offloading scheme is proposed to find optimal offloading policies. Finally, experimental results demonstrate that COPP significantly outperforms benchmark schemes in the performance of both delay and energy consumption.

关键词： Industries Privacy Energy consumption Transportation Computational efficiency Encryption Protection

来源：评论

学校读者我要写书评

暂无评论

Deep learning-based open API recommendation for Mashup development

引用

Science China(Information Sciences) 2023年第7期66卷 94-111页

作者： Ye WANG Junwu CHEN Qiao HUANG Xin XIA Bo JIANG School of Computer and Information Engineering Zhejiang Gongshang University Software Engineering Application Technology Lab

Mashup developers often need to find open application programming interfaces(APIs) for their composition application development. Although most enterprises and service organizations have encapsulated their businesses or resources online as open APIs, finding the right high-quality open APIs is not an easy task from a library with several open APIs. To solve this problem, this paper proposes a deep learning-based open API recommendation(DLOAR) approach. First, the hierarchical density-based spatial clustering of applications with a noise topic model is constructed to build topic models for Mashup clusters. Second,developers' requirement keywords are extracted by the Text Rank algorithm, and the language model is built. Third, a neural network-based three-level similarity calculation is performed to find the most relevant open APIs. Finally, we complement the relevant information of open APIs in the recommended list to help developers make better choices. We evaluate the DLOAR approach on a real dataset and compare it with commonly used open API recommendation approaches: term frequency-inverse document frequency, latent dirichlet allocation, Word2Vec, and Sentence-BERT. The results show that the DLOAR approach has better performance than the other approaches in terms of precision, recall, F1-measure, mean average precision,and mean reciprocal rank.

关键词： Mashup development open API recommendation deep learning neural network service discovery

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：