检索结果-内蒙古大学图书馆

Enhancing Security in Distributed Drone-Based Litchi Fruit Recognition and Localization Systems

computers, Materials & Continua 2025年第2期82卷 1985-1999页

作者： Liang Mao Yue Li Linlin Wang Jie Li Jiajun Tan Yang Meng Cheng Xiong Guangdong-Hong Kong-Macao Greater Bay Area Artificial Intelligence Application Technology Research Institute Shenzhen Polytechnic UniversityShenzhen518055China School of Computer Science and Software Engineering University of Science and Technology LiaoningAnshan114051China

This paper introduces an advanced and efficient method for distributed drone-based fruit recognition and localization, tailored to satisfy the precision and security requirements of autonomous agricultural operations. Our method incorporates depth information to ensure precise localization and utilizes a streamlined detection network centered on the RepVGG module. This module replaces the traditional C2f module, enhancing detection performance while maintaining speed. To bolster the detection of small, distant fruits in complex settings, we integrate Selective Kernel Attention (SKAttention) and a specialized small-target detection layer. This adaptation allows the system to manage difficult conditions, such as variable lighting and obstructive foliage. To reinforce security, the tasks of recognition and localization are distributed among multiple drones, enhancing resilience against tampering and data manipulation. This distribution also optimizes resource allocation through collaborative processing. The model remains lightweight and is optimized for rapid and accurate detection, which is essential for real-time applications. Our proposed system, validated with a D435 depth camera, achieves a mean Average Precision (mAP) of 0.943 and a frame rate of 169 FPS, which represents a significant improvement over the baseline by 0.039 percentage points and 25 FPS, respectively. Additionally, the average localization error is reduced to 0.82 cm, highlighting the model’s high precision. These enhancements render our system highly effective for secure, autonomous fruit-picking operations, effectively addressing significant performance and cybersecurity challenges in agriculture. This approach establishes a foundation for reliable, efficient, and secure distributed fruit-picking applications, facilitating the advancement of autonomous systems in contemporary agricultural practices.

关键词： Objective detection deep learning machine learning

来源：评论

学校读者我要写书评

暂无评论

EventLens: Enhancing Visual Commonsense Reasoning by Leveraging Event-Aware Pretraining and Cross-modal Linking

EventLens: Enhancing Visual Commonsense Reasoning by Leverag...

引用

2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025

作者： Ma, Mingjie Yu, Zhihuan Ma, Yichao Li, Guohui Yang, Zhong School of Computer Science and Technology Huazhong University of Science and Technology Wuhan China School of Software Engineering Huazhong University of Science and Technology Wuhan China

ISBN: (纸本)9798350368741

Visual Commonsense Reasoning (VCR) is a cognitive task, challenging models to answer visual questions, and to explain the rationale behind their answers. While Large Language Models (LLMs) offer potential for this task, VCR's complex scenes require specialized approaches to activate their commonsense reasoning abilities, as existing Multimodal LLMs struggle with VCR's visual events and unique reference tags. To address these challenges, we propose EventLens, which enhances VCR through Event-Aware Pretraining and Cross-modal Linking in Supervised Fine-tuning. First, we introduce a new pretraining stage that emulates human cognitive processes to improve LLM comprehension of complex scenarios. Second, during supervised fine-tuning, we leverage reference tags to explicitly bridge RoI features with text, maintaining semantic integrity across modalities. Additionally, instruct prompts and task-specific adapters help integrate LLMs' knowledge with new commonsense reasoning. Experimental results demonstrate competitive performance with state-of-the-art methods and ablation studies verify the effectiveness of proposed EventLens. © 2025 IEEE.

关键词： Multimedia Understanding Pretrained Language Models Visual Commonsense Reasoning

来源：评论

学校读者我要写书评

暂无评论

Dual-encoder model for typhoon path prediction with multiscale spatiotemporal data fusion

引用

Earth science Informatics 2025年第2期18卷 1-17页

作者： Ren, Shuxia Zhong, Ruikun Guo, Zewei Zhang, Zining School of Software Engineering Tiangong University Tianjin 300387 China School of Computer Science and Technology Tiangong University Tianjin 300387 China

Extreme weather caused by typhoons poses a severe threat to human life safety and socio-economic development, making accurate prediction of typhoon paths crucial. However, existing prediction models struggle to effectively handle and integrate heterogeneous data, overlooking deep correlations between the data, which in turn affects the accuracy of the models. This paper proposes a Dual-Encoder Spatiotemporal Fusion Model for Typhoon Path Prediction (DESF-Typhoon), designed to effectively integrate data from different time scales and extract the underlying relationships between the data, thereby improving prediction accuracy. Specifically, we designed a dual-encoder module that effectively captures complex nonlinear data structures, accounting for the intricate influences of factors such as geography and environment on path, while integrating spatial features of typhoon at different time scales. Subsequently, we introduced a feature interaction module to explore the relationships between different features, which can adaptively learn the interaction weights between them, resulting in a richer feature representation. We evaluated the model using the dataset from the China Meteorological Administration (CMA) and compared its performance with traditional prediction methods and deep learning-based approaches. The results demonstrate that the model significantly improves accuracy and robustness, making innovative contributions in data fusion and spatiotemporal feature modeling. © The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2025.

关键词： Dual encoder Feature interaction Spatiotemporal data Tropical cyclone prediction

来源：评论

学校读者我要写书评

暂无评论

ViF-SD2E: a robust weakly-supervised framework for neural decoding

引用

Neural Computing and Applications 2025年第9期37卷 6645-6661页

作者： Feng, Jingyi Luo, Yong Song, Shuang Hu, Han National Engineering Research Center for Multimedia Software School of Computer Science Wuhan University Wuhan430072 China School of Information and Electronics Beijing Institute of Technology Beijing100081 China

Neural decoding plays a vital role in the interaction between the brain and the outside world. Our task in this paper is to decode the movement track of a finger directly based on the neural data. Existing neural decoding solutions primarily perform some preprocessing operations on neural data before feeding them into existing models (such as LSTM) for decoding. However, these solutions either are prone to overfitting or cannot well exploit the spatial and temporal information. In our previous observations, there is a symmetrical phenomenon between the unsupervised decoded trajectory and the ground truth trajectory within the activity space. This precisely motivates us to propose (or derive) a robust weakly-supervised framework (or model structure), called ViF-SD2E, for neural decoding. In particular, it consists of a space-division (SD) module and an exploration–exploitation (2E) strategy, to effectively exploit both the spatial information of the outside world and the temporal information of neural activity, where the SD2E output is analogized with the weak 0/1 vision feedback (ViF) label for training. Extensive experiments demonstrate the effectiveness of our method, which can sometimes be comparable to supervised counterparts. Therefore, we redirect our attention to the information (hidden in data) ViF-SD2E conveys to us. In other words, we believe that the advantage of ViF-SD2E lies in the fact that its processing steps are objectively determined by the inherent attributes (i.e., symmetry) of the neural data, or rather, the model structure is fixed. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2025.

关键词： Space division multiple access

来源：评论

学校读者我要写书评

暂无评论

Layout Decomposition via Boolean Satisfiability

引用

IEEE Transactions on computer-Aided Design of Integrated Circuits and Systems 2025年第3期44卷 1112-1125页

作者： Liu, Hongduo Liao, Peiyu Zou, Mengchuan Pang, Bowen Li, Xijun Yuan, Mingxuan Ho, Tsung-Yi Yu, Bei The Chinese University of Hong Kong Department of Computer Science and Engineering Hong Kong Hong Kong Huawei Noah's Ark Lab Hong Kong Hong Kong Institute of Software Chinese Academy of Sciences State Key Laboratory of Computer Science Beijing100190 China

Multiple patterning lithography (MPL) has been introduced in the integrated circuits manufacturing industry to enhance feature density as the technology node advances. A crucial step of MPL is assigning layout features to different masks, namely layout decomposition. Exact algorithms like integer linear programming (ILP) can solve layout decomposition to optimality but lack scalability for dense patterns. Relaxation algorithms (e.g., linear programming and semi-definite programming) and heuristics (e.g., exact cover) are capable of handling large cases at the cost of inferior solution quality. These methods rely on different mathematical solvers and expert-designed heuristics to offer a balance between solution quality and computational efficiency. In this article, we propose a unified layout decomposition framework comprising three algorithms: 1) satisfiability (SAT)-exact;2) SAT-bilevel;and 3) SAT-fast, all leveraging the capabilities of Boolean SAT solvers. The SAT-exact ensures optimality, but with faster convergence than ILP, SAT-bilevel addresses the decomposition as a bilevel optimization problem for rapid near-optimal solutions, and SAT-fast handles very large layouts in an incremental manner. Experimental results demonstrate our framework's superiority over existing state-of-the-art methods in terms of solution quality and runtime. © 2024 IEEE.

关键词： Integer linear programming

来源：评论

学校读者我要写书评

暂无评论

AI-Enhanced Secure Data Aggregation for Smart Grids with Privacy Preservation

引用

computers, Materials & Continua 2025年第1期82卷 799-816页

作者： Congcong Wang Chen Wang Wenying Zheng Wei Gu School of Software Nanjing University of Information Science and TechnologyNanjing210044China School of Information Science and Engineering Zhejiang Sci-Tech UniversityHangzhou310018China State Key Laboratory of Public Big Data Guizhou UniversityGuiyang550025China School of Computer Science and Technology(School of Artificial Intelligence) Zhejiang Sci-Tech UniversityHangzhou310018China School of Computer Science(School of Cyber Science and Engineering) Nanjing University of Information Science and TechnologyNanjing210044China

As smart grid technology rapidly advances,the vast amount of user data collected by smart meter presents significant challenges in data security and privacy *** research emphasizes data security and user privacy concerns within smart ***,existing methods struggle with efficiency and security when processing large-scale *** efficient data processing with stringent privacy protection during data aggregation in smart grids remains an urgent *** paper proposes an AI-based multi-type data aggregation method designed to enhance aggregation efficiency and security by standardizing and normalizing various data *** approach optimizes data preprocessing,integrates Long Short-Term Memory(LSTM)networks for handling time-series data,and employs homomorphic encryption to safeguard user *** also explores the application of Boneh Lynn Shacham(BLS)signatures for user *** proposed scheme’s efficiency,security,and privacy protection capabilities are validated through rigorous security proofs and experimental analysis.

关键词： Smart grid data security privacy protection artificial intelligence data aggregation

来源：评论

学校读者我要写书评

暂无评论

MapsTSF: efficient traffic prediction via hybrid Mamba 2-transformer spatiotemporal modeling and cross adaptive periodic sparse forecasting

引用

Journal of Supercomputing 2025年第7期81卷 1-31页

作者： Wang, Bing Cai, Chaoqi Zhang, Xingpeng Zhao, Chunlan Zhang, Chi Zhang, Youming School of Computer Science and Software Engineering Southwest Petroleum University Sichuan Chengdu610500 China School of Science Southwest Petroleum University Sichuan Chengdu610500 China

Traffic flow prediction is a critical yet challenging task due to the complex spatiotemporal dependencies in road networks. We propose MapsTSF, a novel framework that enhances traffic forecasting by integrating three innovative components. Spatial path embedding uses multiple traversal algorithms to capture dynamic spatial flow patterns across road networks. The hybrid Mamba2-transformer model combines Mamba2’s selective state-space modeling with transformer’s global attention to effectively capture intricate spatiotemporal dependencies. Additionally, the cross adaptive periodic sparse forecasting mechanism decouples periodic and trend features, reducing computational overhead while preserving high accuracy. Experiments on real-world datasets demonstrate that MapsTSF outperforms baseline methods by an average of 18.80% in MAE, 15.69% in RMSE, and 19.80% in MAPE. With its scalable and efficient design, MapsTSF is well-suited for real-time traffic management in smart cities and navigation applications, providing precise and reliable forecasts for large-scale, dynamic road networks. © The Author(s), under exclusive licence to Springer science+Business Media, LLC, part of Springer Nature 2025.

关键词： Street traffic control

来源：评论

学校读者我要写书评

暂无评论

Controllable chaos in the coupled waveguide-optomechanical system with phase shifter

引用

science China(Physics,Mechanics & Astronomy) 2025年第4期68卷 26-34页

作者： Deng-Wei Zhang Pei-Qi Qin Li-Li Zheng Zhi-Ming Zhan Xin-You Lü Department of Mathematics and Physics Luoyang Institute of Science and TechnologyLuoyang 471023China School of Computer and Information Engineering School of SoftwareLuoyang Institute of Science and TechnologyLuoyang 471023China School of Artificial Intelligence Jianghan UniversityWuhan 430056China School of Physics Huazhong University of Science and TechnologyWuhan 430074China

We theoretically investigate chaotic dynamics in an optomechanical system composed of a whispering-gallery-mode(WGM)microresonator and a *** find that tuning the optical phase using a phase shifter and modifying the coupling strength via a unidirectional waveguide(IWG)can induce chaotic *** underlying reason for this phenomenon is that adjusting the phase and coupling strength via the phase shifter and IWG bring the system close to an exceptional point(EP),where field localization dynamically enhances the optomechanical nonlinearity,leading to the generation of chaotic *** addition,due to the sensitivity of chaos to phase in the vicinity of the EP,we propose a theoretical scheme to measure the optical phase perturbations using *** work may offer an alternative approach to chaos generation with current experimental technology and provide theoretical guidance for optical signal processing and chaotic secure communication.

关键词： cavity optomechanics chaos exceptional point

来源：评论

学校读者我要写书评

暂无评论

DIEONet:Domain-Invariant Information Extraction and Optimization Network for Visual Place Recognition

引用

computers, Materials & Continua 2025年第3期82卷 5019-5033页

作者： Shaoqi Hou Zebang Qin Chenyu Wu Guangqiang Yin Xinzhong Wang Zhiguo Wang School of Computer Science and Technology Xinjiang UniversityUrumqi830046China School of Information and Software Engineering University of Electronic Science and Technology ofChinaChengdu611731China Institute of Public Security Kash Institute of Electronics and Information IndustryKashi844000China

Visual Place Recognition(VPR)technology aims to use visual information to judge the location of agents,which plays an irreplaceable role in tasks such as loop closure detection and *** is well known that previous VPR algorithms emphasize the extraction and integration of general image features,while ignoring the mining of salient features that play a key role in the discrimination of VPR *** this end,this paper proposes a Domain-invariant Information Extraction and Optimization Network(DIEONet)for *** core of the algorithm is a newly designed Domain-invariant Information Mining Module(DIMM)and a Multi-sample Joint Triplet Loss(MJT Loss).Specifically,DIMM incorporates the interdependence between different spatial regions of the feature map in the cascaded convolutional unit group,which enhances the model’s attention to the domain-invariant static object *** Loss introduces the“joint processing of multiple samples”mechanism into the original triplet loss,and adds a new distance constraint term for“positive and negative”samples,so that the model can avoid falling into local optimum during *** demonstrate the effectiveness of our algorithm by conducting extensive experiments on several authoritative *** particular,the proposed method achieves the best performance on the TokyoTM dataset with a Recall@1 metric of 92.89%.

关键词： Visual place recognition domain-invariant information mining module multi-sample joint triplet loss

来源：评论

学校读者我要写书评

暂无评论

Hyperspectral Image Classification Method Based on Global Space-Spectral Attention Mechanism

引用

Journal of Shanghai Jiaotong University (science) 2025年 1-11页

作者： Qin, Rui Wu, Benze Liu, Xinfu Wu, Yirui College of Computer Science and Software Engineering Hohai University Nanjing211100 China Nangjing Research Institute of Electronic Engineering Nanjing210007 China College of Artificial Intelligence and Automation Hohai University Jiangsu Changzhou213200 China

In hyperspectral remote sensing imagery, pixel interactions within defined spatial extents result in the mixing of adjacent pixels. Additionally, the high similarity of adjacent spectra leads to information redundancy, which hinders the extraction of global spatial and spectral correlations. In order to solve the problems of mixed adjacent pixels and redundant adjacent spectra, this work offers a hyperspectral image classification approach that uses a global space-spectral attention mechanism. First, the proposed method’s global spatial attention module uses multi-scale dilated convolution to produce a bigger receptive field to be capable of capturing global spatial correlation and obtain unmixed pixel information. Then, the global spectral attention module designs a spectral domain partition algorithm, using the combination of regional density as well as information entropy as the threshold to divide spectrum into dispersed subsets and eliminate redundant information. The global context information for entire spectral band is fully exploited, and correlation of the global spectral information is extracted. Finally, the two modules combine to provide a global correlation of space and spectrum. Experiments demonstrate that the suggested method obtains overall accuracies of 97.28%, 94.73%, and 95.76% on the three WHU-Hi hyperspectral datasets, surpassing comparison methods. © Shanghai Jiao Tong University 2024.

关键词： Pixels

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：