检索结果-内蒙古大学图书馆

Double-branch fusion network with a parallel attention selection mechanism for camouflaged object detection

Science China(information Sciences) 2023年第6期66卷 258-266页

作者： Junjiang XIANG Qing PAN Zhengrong ZHANG Songnian FU Yuwen QIN Advanced Institute of Photonics Technology School of Information EngineeringGuangdong University of Technology Guangdong Provincial Key Laboratory of Photonics Information Technology Guangdong University of Technology Guangxi Key Laboratory of Multimedia Communications and Network Technology School of Computer Electronics and InformationGuangxi University

To meet the challenge of camouflaged object detection （COD）,which has a high degree of intrinsic similarity between the object and background,this paper proposes a double-branch fusion network（DBFN）with a parallel attention selection mechanism （PASM）.In detail,a schismatic receptive field block（SRF）combined with an attention mechanism for low-level information is performed to learn texture features in one branch,and an integration of the SRF,a hybrid attention mechanism （HAM）,and a depth feature polymerization module （DFPM）is employed for high-level information to extract detection features in the other ***,both texture features and detection features are input into the PASM to acquire selective expression ***,the final result is obtained after further selective matrix optimization with atrous spatial pyramid pooling （ASPP）and a residual channel attention block （RCAB）being applied *** results on three public datasets verify that our method outperforms the state-of-the-art methods in terms of four evaluation metrics,i.e.,mean absolute error （MAE）,weighted F βmeasure （Fβω）,structural measure （Sα）,and E-measure （Eφ）

关键词： camouflaged object detection attention mechanism feature extraction feature aggregation texture information fuzzy boundary

来源：评论

学校读者我要写书评

暂无评论

IensNet: A novel and efficient approach for iris spoof detection via ensemble of deep models

引用

Multimedia Tools and Applications 2025年 1-30页

作者： Sharma, Deepika Selwal, Arvind School of Computer Science Engineering and Technology Bennett University Greater Noida201310 India Department of Computer Science and Information Technology Central University of Jammu Samba India

Iris biometrics allow contactless authentication, which makes it widely deployed human recognition mechanisms since the couple of years. Susceptibility of iris identification systems remains a challenging task due to diversity in spoof or presentation attacks (PAs) that fails to assure consistency while adopting them in real life scenarios. Hence, iris PAs are the growing concerns that gained significant attention in recent past decade. To alleviate these attacks or recognize presentation attack instruments (PAIs), iris presentation attacks detection (IPAD) algorithms are designed to distinguish a real and fabricated iris trait. Aiming at the efficient iris spoof detection mechanism, in this research work we expound a novel ensemble learning-enabled model (IensNet) that learns three pre-trained and fined-tuned deep models (i.e. DenseNet161, ResNet and VGGNet) for better accuracy and generalized performance. The novel IensNet approach offers several merits (i.e. consolidated strengths of multiple models, improved generalization ability, etc.) as compared to a simple transfer learning strategy where the knowledge is drawn from single pre-trained model. Finally, our approach learns a novel fully-connected dual layer classifier via outcome of three fine-tuned models to yield a final classification result as bonafide or spoof iris trait. Our approach is evaluated on Notre Dame LivDet iris 2017 and Notre Dame contact lenses 2015 anti-spoofing datasets. The experimental analysis of IensNet offers outstanding performance with a lower ACER of 0.2% and 1.4% for Iris-LivDet-2017 and Notre Dame contact lenses 2015 dataset respectively. Besides, IensNet exhibit promising results in cross-dataset environment with an ACA of 91.46%. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2025.

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

CrowdCL: Unsupervised Crowd Counting Network via Contrastive Learning

引用

IEEE Internet of Things Journal 2025年第12期12卷 21704-21719页

作者： Hu, Yingxiang Liu, Yanbo Cao, Guo Wang, Jin Nanjing University of Science and Technology School of Computer Science and Engineering Nanjing210096 China Nantong University School of Information Science and Technology Nantong226000 China

With the continuous growth of the population, crowd counting plays a crucial role in intelligent monitoring systems for the Internet of Things (IoT) and smart city development. Accurate monitoring of crowd density not only helps maintain public safety but also effectively promotes the development of smart cities. Currently, supervised crowd counting techniques have made significant progress in improving accuracy, but these methods rely on expensive manual annotations and have limited generalization performance. To address these challenges, this article proposes an unsupervised crowd counting network based on contrastive learning, named CrowdCL. CrowdCL primarily leverages image-image contrastive learning and text-image contrastive learning to achieve unsupervised crowd counting. Specifically, in image-image contrastive learning, we strengthen the network’s ability to distinguish crowd features by designing progressive occlusion strategies and patch matching strategies, effectively differentiating crowd information from background information. In text-image contrastive learning, we construct ordered textual prompts to match ordered feature maps and use modality matching loss (Lm) to guide the image encoder. Additionally, to reduce the loss of fine details and alleviate the interference of complex backgrounds, we design a coarse-grained filtering strategy during the testing phase, assigning higher weights to crowd patches with greater potential. Experiments on multiple public datasets show that CrowdCL not only achieves outstanding performance but also outperforms some fully supervised methods in cross-dataset testing. © 2014 IEEE.

关键词： Contrastive Learning

来源：评论

学校读者我要写书评

暂无评论

Clustering and Artificial Intelligence-based Prediction of Ecologically Sustainable Species Introductions

IAENG International Journal of Computer Science

引用

IAENG International Journal of computer Science 2025年第4期52卷 1159-1168页

作者： Liu, Shuqiao Zhang, Zhao Zhou, Hongyan Chen, Xue-Bo School of Electronic and Information Engineering University of Science and Technology Liaoning Anshan114051 China School of Computer Science and Software Engineering University of Science and Technology Liaoning Anshan114051 China

There is a growing interest in sustainable ecosystem development, which includes methods such as scientific modeling, environmental assessment, and development forecasting and planning. However, due to insufficient survey data in many current development areas, development progress is delayed and stagnant. To address this situation, this paper proposes a SWOT-TOPSIS-K-Means (STK) data analysis and evaluation model to analyze ecological factors, which can realize a comprehensive and complete data analysis with fewer samples. Decision tree (DT), random forest (RF), and multilayer perceptron (MLP) neural network models were constructed from the results of this analysis, and statistical tests such as r-squared, mean absolute error, and cross-validation are used to further confirm the performance efficiency of the computational prediction models to provide real-time prediction research solutions. For this purpose, data from research scholars on species introduction in ecosystem development were selected for testing. The results show that the proposed assessment model and modeling results satisfy all accuracy-related acceptance requirements. Among them, MLP is better than DT and RF. In summary, the STK assessment model and the MLP prediction model can provide a basis for the selection and development of ecological factors. © (2025), (International Association of Engineers). All rights reserved.

关键词： Prediction models

来源：评论

学校读者我要写书评

暂无评论

PATTERN-BASED TRILATERATION POSITIONING ALGORITHM WITH LOW COMPUTING COST

UPB Scientific Bulletin, Series C: Electrical Engineering an...

引用

UPB Scientific Bulletin, Series C: Electrical engineering and computer Science 2023年第3期85卷 263-272页

作者： Shi, Yong Chu, Zhaoling Qu, Yan Bian, Guiyang School of Computer Information and Engineering Changzhou Institute of Technology Changzhou21300 China

Trilateral positioning with low computing cost is suitable for the large-scale promotion of location-based service. The paper proposed a novel trilateral positioning method for practical engineering applications to reduce the computational workload. Firstly, proposes six-zone patterns, and defines the vertex coordinates and anchor node deployment identification. On this basis, the positioning process is divided into the zone confirmation stage, which is actually to find three anchor nodes;and divided into locate calculation stage, which is optimized according to the pattern. A simulation experiment shows the effectiveness of the proposed algorithm. When the anchor scale is 2000, the zone finding time is reduced by 50%, the trilateral positioning time is reduced by 6.4%, and the overall process is reduced by 6.8%. When the anchor scale is 5000, the zone finding time is reduced by 68%, the trilateral location time is reduced by 10.9%, and the overall process is reduced by 15.9%. When the anchor scale is 10000, the zone finding time is reduced by 58%, the trilateral positioning time is reduced by 8.9%, and the overall process is reduced by 16.3%. So it can reduce energy consumption and prolong the working time of mobile devices. © 2023, Politechnica University of Bucharest. All rights reserved.

关键词： Location based services

来源：评论

学校读者我要写书评

暂无评论

DAFPN-YOLO: An Improved UAV-Based Object Detection Algorithm Based on YOLOv8s

引用

computers, Materials & Continua 2025年第5期83卷 1929-1949页

作者： Honglin Wang Yaolong Zhang Cheng Zhu School of Artificial Intelligence Nanjing University of Information Science and TechnologyNanjing210044China School of Computer Science Nanjing University of Information Science and TechnologyNanjing210044China Electrical&Computer Engineering University of Illinois at Urbana-ChampaignUrbanaIL 61801USA

UAV-based object detection is rapidly expanding in both civilian and military applications,including security surveillance,disaster assessment,and border ***,challenges such as small objects,occlusions,complex backgrounds,and variable lighting persist due to the unique perspective of UAV *** address these issues,this paper introduces DAFPN-YOLO,an innovative model based on YOLOv8s(You Only Look Once version 8s).Themodel strikes a balance between detection accuracy and speed while reducing parameters,making itwell-suited for multi-object detection tasks from drone perspectives.A key feature of DAFPN-YOLO is the enhanced Drone-AFPN(Adaptive Feature Pyramid Network),which adaptively fuses multi-scale features to optimize feature extraction and enhance spatial and small-object *** leverage Drone-AFPN’smulti-scale capabilities fully,a dedicated 160×160 small-object detection head was added,significantly boosting detection accuracy for small *** the backbone,the C2f_Dual(Cross Stage Partial with Cross-Stage Feature Fusion Dual)module and SPPELAN(Spatial Pyramid Pooling with Enhanced LocalAttentionNetwork)modulewere *** components improve feature extraction and information aggregationwhile reducing parameters and computational complexity,enhancing inference ***,Shape-IoU(Shape Intersection over Union)is used as the loss function for bounding box regression,enabling more precise shape-based object *** results on the VisDrone 2019 dataset demonstrate the effectiveness *** to YOLOv8s,the proposedmodel achieves a 5.4 percentage point increase inmAP@0.5,a 3.8 percentage point improvement in mAP@0.5:0.95,and a 17.2%reduction in parameter *** results highlight DAFPN-YOLO’s advantages in UAV-based object detection,offering valuable insights for applying deep learning to UAV-specific multi-object detection tasks.

关键词： YOLOv8 UAV-based object detection AFPN small-object detection head SPPELAN DualConv loss function

来源：评论

学校读者我要写书评

暂无评论

Bootstrap-Based Layerwise Refining for Causal Structure Learning

IEEE Transactions on Artificial Intelligence

引用

IEEE Transactions on Artificial Intelligence 2024年第6期5卷 2708-2722页

作者： Xiang, Guodu Wang, Hao Yu, Kui Guo, Xianjie Cao, Fuyuan Song, Yukun Hefei University of Technology Key Laboratory of Knowledge Engineering with the Big Data of Ministry of Education Hefei230601 China Hefei University of Technology School of Computer Science and Information Engineering Hefei230601 China Shanxi University School of Computer and Information Technology Taiyuan030006 China

Learning causal structures from observational data is critical for causal discovery and many machine learning tasks. Traditional constraint-based methods first adopt conditional independence (CI) tests to learn a global skeleton layer by layer and then orient the undirected edges to obtain a causal structure. However, the reliability of these statistical tests largely depends on the quality of data samples. In real-life scenarios, the presence of data noise or limited samples often makes many CI tests unreliable at each layer in the skeleton learning phase, leading to an inaccurate skeleton. As the number of layers increases, the inaccurate skeleton will continue to impair the skeleton construction of subsequent layers. Furthermore, an unreliable skeleton hampers the skeleton orientation procedure, resulting in an unsatisfactory causal structure. In this article, we propose a Bootstrap-based layerwise refining (BLR) algorithm for causal structure learning, which includes two new procedures to solve the above problems. First, BLR utilizes a novel layerwise skeleton refining procedure to construct the global skeleton layer by layer based on the bootstrap sampling. Second, BLR employs a collective skeleton orientation procedure that incorporates scoring techniques to collectively orient the global skeleton. The experimental results show that BLR outperforms the state-of-the-art methods on the benchmark Bayesian Network datasets. © 2020 IEEE.

关键词： Refining

来源：评论

学校读者我要写书评

暂无评论

BEV-Locator:an end-to-end visual semantic localization network using multi-view images

引用

Science China(information Sciences) 2025年第2期68卷 134-150页

作者： Zhihuang ZHANG Meng XU Wenqiang ZHOU Tao PENG Liang LI Stefan POSLAD School of Vehicle and Mobility Tsinghua University School of Information Technology & Management University of International Business and Economics Qcraft Inc. School of Electronic Engineering and Computer Science Queen Mary University of London

Accurate localization ability is fundamental in autonomous driving. Traditional visual localization frameworks approach the semantic map-matching problem with geometric models, which rely on complex parameter tuning and thus hinder large-scale deployment. In this paper, we propose BEV-Locator: an end-to-end visual semantic localization neural network using multi-view camera images. Specifically, a visual BEV(bird-eye-view) encoder extracts and flattens the multi-view images into BEV space. While the semantic map features are structurally embedded as map query sequences. Then a cross-model transformer associates the BEV features and semantic map queries. The localization information of ego-car is recursively queried out by cross-attention modules. Finally, the ego pose can be inferred by decoding the transformer outputs. This end-to-end model speaks to its broad applicability across different driving environments, including high-speed scenarios. We evaluate the proposed method in large-scale nuScenes and Qcraft datasets. The experimental results show that the BEV-Locator is capable of estimating the vehicle poses under versatile scenarios, which effectively associates the cross-model information from multi-view images and global semantic maps. The experiments report satisfactory accuracy with mean absolute errors of 0.052 m, 0.135 m and 0.251° in lateral, longitudinal translation and heading angle degree.

关键词： visual localization semantic map bird-eye-view transformer pose estimation

来源：评论

学校读者我要写书评

暂无评论

An efficient multi-objective task scheduling in edge computing using adaptive honey badger optimisation

引用

International Journal of Web engineering and technology 2024年第2期19卷 110-126页

作者： Nagalakshmi, Bantupalli Subramanian, Sumathy School of Computer Science and Engineering Vellore Institute of Technology Tamil Nadu Vellore632014 India School of Computer Science Engineering and Information Systems Vellore Institute of Technology Tamil Nadu Vellore632014 India

Task scheduling, which is important in cloud computing, is one of the most challenging issues in this area. Hence, an efficient and reliable task scheduling approach is needed to produce more efficient resource employment. So, a multi-objective-based task scheduling for edge computing is suggested in this study. This paper develops the adaptive honey badger optimisation algorithm (AHBA) to accomplish this goal. The lack of population, the original honey badger algorithm (HBO) has the issue of becoming trapped in local optima. To maintain population variety and improve convergence towards the ideal solution, HBO is combined with the opposition-based learning technique (OBL). Based on makespan, cost, energy consumption, and resource usage, the multi-objective function is created. According to simulation results, the proposed approach has a lot of potential in this field. Java and cloud Simulator are used to implement the suggested model. Copyright © 2024 Inderscience Enterprises Ltd.

关键词： Energy utilization

来源：评论

学校读者我要写书评

暂无评论

Self-Supervised ECG Anomaly Detection Based on Time-Frequency Specific Waveform Mask Feature Fusion

引用

IEEE Access 2025年 13卷 97585-97596页

作者： Tian, Chongrui Zhang, Fengbin Harbin University of Science and Technology School of Computer Science and Technology Harbin150080 China East University of Heilongjiang School of Information Engineering Harbin150086 China

The imbalance of ECG signal data and the complexity of labeling pose significant challenges for deep learning-based anomaly detection. Traditional contrastive learning approaches for ECG anomaly detection often rely on reconstruction or generation;however, normal signals that resemble abnormal ECG samples may be incorrectly clustered, leading to suboptimal performance. To address this issue, we propose an anomaly detection framework TFMAD that integrates ECG signal mask reconstruction with time-frequency contrastive learning, leveraging the correlation between time- and frequency-domain features for anomaly detection. Specifically, the proposed method incorporates an auto-encoder module, a time-frequency mask module, and a contrastive learning module to extract masked time-frequency domain features of ECG signals. The model then reconstructs the signal using time-frequency feature fusion and employs contrastive learning to structure the feature space, ensuring abnormal distributions are effectively learned. We evaluated this method on six datasets, and the results demonstrate that TFMAD outperforms nine state-of-the-art methods. © 2013 IEEE.

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：