检索结果-内蒙古大学图书馆

31st International Conference on Computational Linguistics, COLING 2025

作者： Liu, Xinjing Li, Ruifan Ye, Shuqin Zhang, Guangwei Wang, Xiaojie School of Artificial Intelligence Beijing University of Posts and Telecommunications China School of Computer Science Beijing University of Posts and Telecommunications China Engineering Research Center of Information Networks Ministry of Education China Key Laboratory of Interactive Technology and Experience System Ministry of Culture and Tourism China

ISBN: (纸本)9798891761964

Multimodal Aspect-Based Sentiment Analysis (MABSA) aims to extract aspect terms from text-image pairs and identify their sentiments. Previous methods are based on the premise that the image contains the objects referred by the aspects within the text. However, this condition cannot always be met, resulting in a suboptimal performance. In this paper, we propose COnditional Relation based Sentiment Analysis framework (CORSA). Specifically, we design a conditional relation detector (CRD) to mitigate the impact of the unmet conditional image. Moreover, we design a visual object localizer (VOL) to locate the exact condition-related visual regions associated with the aspects. With CRD and VOL, our CORSA framework takes a multi-task form. In addition, to effectively learn CORSA we conduct two types of annotations. One is the conditional relation using a pretrained referring expression comprehension model;the other is the bounding boxes of visual objects by a pretrained object detection model. Experiments on our built C-MABSA dataset show that CORSA consistently outperforms existing methods. The code and data are available at https://***/Liuxj-Anya/CORSA. © 2025 Association for Computational Linguistics.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Event-based Video Person Re-identification via Cross-Modality and Temporal Collaboration

Event-based Video Person Re-identification via Cross-Modalit...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Renkai Li Xin Yuan Wei Liu Xin Xu School of Computer Science and Technology Wuhan University of Science and Technology Hubei Province Key Laboratory of Intelligent Information Processing and Real-Time Industrial System

ISBN: (数字)9798350368741

ISBN: (纸本)9798350368758

Video-based person re-identification (ReID) has become increasingly important due to its applications in video surveillance applications. By employing events in video-based person ReID, more motion information can be provided between continuous frames to improve recognition accuracy. Previous approaches have assisted by introducing event data into the video person ReID task, but they still cannot avoid the privacy leakage problem caused by RGB images. In order to avoid privacy attacks and to take advantage of the benefits of event data, we consider using only event data. To make full use of the information in the event stream, we propose a Cross-Modality and Temporal Collaboration (CMTC) network for event-based video person ReID. First, we design an event transform network to obtain corresponding auxiliary information from the input of raw events. Additionally, we propose a differential modality collaboration module to balance the roles of events and auxiliaries to achieve complementary effects. Furthermore, we introduce a temporal collaboration module to exploit motion information and appearance cues. Experimental results demonstrate that our method outperforms others in the task of event-based video person ReID.

关键词： Data privacy Correlation Event detection Collaboration Transforms Streaming media Signal processing Video surveillance Speech processing Identification of persons

来源：评论

学校读者我要写书评

暂无评论

Modeling Method for DFIG-Based Wind Farm in High-Efficiency Real-Time Electromagnetic Transient (EMT) Simulations

引用

IEEE Transactions on Power Electronics 2025年

作者： Liu, Yifan Xu, Jianzhong Zhu, Yiyang Tian, Zhaoxuan Zhao, Chengyong Li, Gen State Key Laboratory of Alternate Electrical Power System with Renewable Energy Sources China Energy Technology and Computer Science Section Department of Engineering Technology and Didactics Ballerup2750 Denmark

With the increasing integration of renewable energy into power systems, electromagnetic transient (EMT) simulation has become indispensable for accurate system analysis. However, the complexity of wind turbine (WT) modeling, characterized by a large number of electrical nodes, poses significant challenges and necessitates substantial real-time simulation hardware. Existing methods for reducing circuit complexity improve simulation efficiency but are each associated with inherent limitations. Aggregation methods sacrifice considerable internal station information, while existing decoupling techniques are constrained by specific requirements. This paper proposes a real-time simulation model for a Doubly Fed Induction Generator (DFIG)-based wind farm (WF) using latency decoupling and a multi-level nested fast and simultaneous solution (M-NFSS) approach, effectively reducing node count while preserving internal details. A WF test model is implemented in RTDS for validation. Results demonstrate highly accurate impedance characteristics and time-domain waveforms of the proposed model, utilizing only 33.3% of the hardware resources compared to the traditional detailed model. © 1986-2012 IEEE.

关键词： Electromagnetic transients

来源：评论

学校读者我要写书评

暂无评论

Flight delay causes determination based on adversarial defense of TSVM 4

Flight delay causes determination based on adversarial defen...

引用

2024 4th International Conference on Intelligent Traffic systems and Smart City, ITSSC 2024

作者： Liu, Li Chen, Haiyan Zhou, Wei Zhou, Xinding College of Computer Science and Technology Nanjing University of Aeronautics and Astronautics Nanjing China State Key Laboratory of Air Traffic Management System Nanjing China Operation Supervisory Center of CAAC Beijing China College of Civil Aviation Nanjing University of Aeronautics and Astronautics Nanjing China

ISBN: (纸本)9781510686304

With the rapid development of the aviation industry, flight delays have become a global issue, resulting in significant economic losses and passenger dissatisfaction. Accurately determining the causes of flight delays is crucial for airlines, air traffic management, and passengers. This study proposes an Adversarial Defense of the Transductive Support Vector Machine (ad-TSVM) algorithm to address the problem of flight delay causing determination. The ad-TSVM algorithm generates adversarial samples through adversarial training techniques and incorporates these samples into the Transductive Support Vector Machine (TSVM) to improve the model's classification accuracy and robustness. Compared to traditional methods, ad-TSVM can more precisely attribute the responsibility for flight delays to various stakeholders, such as airlines, air traffic control, weather, and air traffic flow, while resisting adversarial attacks. Experimental results demonstrate that the ad-TSVM algorithm significantly outperforms traditional TSVM and other standard machine-learning algorithms regarding classification accuracy on multiple real-world datasets. Additionally, the algorithm exhibits excellent performance in resisting adversarial attacks, effectively enhancing the model's robustness and reliability. This study addresses flight delay cause determination challenges, offering theoretical support for further research and practical applications in related fields. © 2025 SPIE ·

关键词： Air traffic control

来源：评论

学校读者我要写书评

暂无评论

Optimization Scheduling of Multi-Energy Microgrid Based on the QUBO Quantum Computing Model 19th

Optimization Scheduling of Multi-Energy Microgrid Based on t...

引用

19th Annual Conference of China Electrotechnical Society, ACCES 2024

作者： Wang, Baonan Wang, Hui Zhang, Dan College of Computer Science and Technology Shanghai University of Electric Power Shanghai China State Key Laboratory of Power System Operation and Control Dept. of Electrical Engineering Tsinghua University Beijing China Institute of Logistic Science and Engineering Shanghai Maritime University Shanghai China

ISBN: (纸本)9789819608966

Current optimization methods for microgrid scheduling face issues such as insufficient precision in energy distribution, high operational costs, and inefficiency. In response to these challenges, an optimization scheduling method for multi-energy microgrids based on the QUBO (Quadratic Unconstrained Binary Optimization) quantum computing model is proposed in this paper, alongside a discretization approach based on a grid partitioning strategy. To validate the feasibility of addressing microgrid optimization scheduling problems in a quantum computing environment, a QUBO model is constructed and simulation studies are conducted in the qbsolv environment provided by D-Wave. The results indicate that operational costs are significantly reduced with an increase in the number of grids, particularly a reduction of 4.52% when the number of grids increases from five to ten. This method not only effectively enhances the precision of electricity supply and energy utilization efficiency but also reduces unnecessary electrical waste and economic costs. © Beijing Paike Culture Commu. Co., Ltd. 2025.

关键词： Microgrids

来源：评论

学校读者我要写书评

暂无评论

Prior Knowledge-Driven Hybrid Prompter Learning for RGB-Event Tracking

引用

IEEE Transactions on Circuits and systems for Video technology 2025年

作者： Wang, Mianzhao Shi, Fan Cheng, Xu Chen, Shengyong Tianjin University of Technology Engineering Research Center of Learning-Based Intelligent System Ministry of Education Key Laboratory of Computer Vision and System School of Computer Science and Engineering Tianjin300384 China Technical University of Denmark Department of Technology Management and Economics Kongens Lyngby Denmark

Event data can asynchronously capture variations in light intensity, thereby implicitly providing valuable complementary cues for RGB-Event tracking. Existing methods typically employ a direct interaction mechanism to fuse RGB and event data. However, due to differences in imaging mechanisms, the representational disparity between these two data types is not fixed, which can lead to tracking failures in certain challenging scenarios. To address this issue, we propose a novel prior knowledge-driven hybrid prompter learning framework for RGB-Event tracking. Specifically, we develop a frame-event hybrid prompter that leverages prior tracking knowledge from the foundation model as intermediate modal support to mitigate the heterogeneity between RGB and event data. By leveraging its rich prior tracking knowledge, the intermediate modal reduces the gap between the dense RGB and sparse event data interactions, effectively guiding complementary learning between modalities. Meanwhile, to mitigate the internal learning disparities between the lightweight hybrid prompter and the deep transformer model, we introduce a pseudo-prompt learning strategy that lies between full fine-tuning and partial fine-tuning. This strategy adopts a divide-and-conquer approach to assign different learning rates to modules with distinct functions, effectively reducing the dominant influence of RGB information in complex scenarios. Extensive experiments conducted on two public RGB-Event tracking datasets show that the proposed HPL outperforms state-of-the-art tracking methods, achieving exceptional performance. © 1991-2012 IEEE.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

Harnessing Light Field Angular Cues and Spatial Geometries for Semantic Segmentation

Harnessing Light Field Angular Cues and Spatial Geometries f...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Chen Jia Fan Shi Xu Cheng School of Computer Science and Engineering The Engineering Research Center of Learning-Based Intelligent System (Ministry of Education) The Key Laboratory of Computer Vision and System (Ministry of Education) Tianjin University of Technology Tianjin China

ISBN: (数字)9798350368741

ISBN: (纸本)9798350368758

4D light field imaging captures rich spatial-angular information, providing essential geometric cues for semantic segmentation tasks. In this paper, we introduce a novel backbone network called the Light Field Extraction Interaction Network (LFEI-Net). LFEI-Net excels in extracting global structures and multi-scale spatial-angular features, capturing feature dependencies through channel modeling and diverse feature interactions. Unlike traditional methods that depend on pyramid and dilated feature extraction, LFEI-Net pioneers an efficient method by integrating large-scale horizontal depth-wise convolution (HDWC) and vertical depth-wise convolution (VDWC) with interactive operations for comprehensive spatial multi-scale feature extraction. Furthermore, we present the Multi-Angular Modeling (MAM) module, which effectively captures scene angle variations from multiple perspectives and precisely delineates object boundaries, thereby improving model adaptability. Our experimental evaluations on two datasets demonstrate that LFEI-Net significantly outperforms state-ofthe-art (SOTA) 2D and 4D light field semantic segmentation methods, achieving mean Intersection over Union (mIoU) of 83.72% and 86.88%, respectively.

关键词： Geometry Adaptation models Convolution Semantic segmentation Imaging Feature extraction Light fields Acoustics Speech processing

来源：评论

学校读者我要写书评

暂无评论

VAGeo: View-specific Attention for Cross-View Object Geo-Localization

VAGeo: View-specific Attention for Cross-View Object Geo-Loc...

引用

International Conference on Acoustics, Speech, and Signal Processing (ICASSP)

作者： Zhongyang Li Xin Yuan Wei Liu Xin Xu School of Computer Science and Technology Wuhan University of Science and Technology Hubei Province Key Laboratory of Intelligent Information Processing and Real-Time Industrial System

ISBN: (数字)9798350368741

ISBN: (纸本)9798350368758

Cross-view object geo-localization (CVOGL) aims to locate an object of interest in a captured ground- or drone-view image within the satellite image. However, existing works treat ground-view and drone-view query images equivalently, overlooking their inherent viewpoint discrepancies and the spatial correlation between the query image and the satellite-view reference image. To this end, this paper proposes a novel View-specific Attention Geo-localization method (VAGeo) for accurate CVOGL. Specifically, VAGeo contains two key modules: view-specific positional encoding (VSPE) module and channel-spatial hybrid attention (CSHA) module. In object-level, according to the characteristics of different viewpoints of ground and drone query images, viewpoint-specific positional codings are designed to more accurately identify the click-point object of the query image in the VSPE module. In feature-level, a hybrid attention in the CSHA module is introduced by combining channel attention and spatial attention mechanisms simultaneously for learning discriminative features. Extensive experimental results demonstrate that the proposed VAGeo gains a significant performance improvement, i.e., improving acc@0.25/acc@0.5 on the CVOGL dataset from 45.43%/42.24% to 48.21%/45.22% for ground-view, and from 61.97%/57.66% to 66.19%/61.87% for drone-view.

关键词： Location awareness Accuracy Attention mechanisms Image coding Signal processing Encoding Satellite images Object recognition Speech processing Drones

来源：评论

学校读者我要写书评

暂无评论

A third-party platoon coordination service: Pricing under government subsidies

A third-party platoon coordination service: Pricing under go...

引用

作者： Bai, Ting Johansson, Alexander Li, Shaoyuan Johansson, Karl Henrik Mårtensson, Jonas School of Electrical Engineering and Computer Science KTH Royal Institute of Technology Stockholm Sweden KTH Royal Institute of Technology Stockholm Sweden Department of Automation Shanghai Jiao Tong University Shanghai China Key Laboratory of System Control and Information Processing Shanghai Jiao Tong University Shanghai China Digital Futures Stockholm Sweden

This paper models a platooning system consisting of trucks and a third-party service provider (TPSP), which performs platoon coordination, distributes the platooning profit in platoons, and charges trucks in exchange for the services. Government subsidies used to incentivize platooning are also considered. We propose a pricing rule for the TPSP, which keeps part of the platooning profit including the subsidy each time a platoon is formed. In addition, a platoon coordination solution based on the distributed model predictive control (MPC) is proposed, in which the pricing rule under government subsidies is integrated. We perform a realistic simulation over the Swedish road network to evaluate the impact of the pricing rule and subsidies on the achieved profits and fuel savings. Our results show that subsidies are an effective mean to boost fuel savings from platooning. Moreover, the simulation study indicates that high pricing corresponds to a low platooning rate of the system, as trucks' incentives for platooning decrease. © 2023 The Authors. Asian Journal of Control published by John Wiley & Sons Australia, Ltd on behalf of Chinese Automatic Control Society.

关键词： Trucks

来源：评论

学校读者我要写书评

暂无评论

Gesture Recognition of sEMG Based on Res-LSTM 17th

Gesture Recognition of sEMG Based on Res-LSTM

引用

17th International Conference on Intelligent Robotics and Applications, ICIRA 2024

作者： Zhao, Yujia Zou, Chunlong Yun, Juntong Jiang, Du Huang, Li Liu, Ying Jiang, Guozhang Xie, Yuanmin Key Laboratory of Metallurgical Equipment and Control Technology of Ministry of Education Wuhan University of Science and Technology Wuhan430081 China College of Mechanical Engineering Hubei University of Automotive Technology Shiyan442000 China Research Center for Biomimetic Robot and Intelligent Measurement and Control Wuhan University of Science and Technology Wuhan430081 China Hubei Key Laboratory of Mechanical Transmission and Manufacturing Engineering Wuhan University of Science and Technology Wuhan430081 China Hubei Province Key Laboratory of Intelligent Information Processing and Real-Time Industrial System Wuhan University of Science and Technology Wuhan430081 China College of Computer Science and Technology Wuhan University of Science and Technology Wuhan430081 China School of Mechanical Engineering Hubei Engineering University Xiaogan432000 China

ISBN: (纸本)9789819607761

sEMG (surface electromyography) signal control of bionic prostheses has been widely studied over the past few years. In particular, sparse sEMG signals are rapidly developing in the field of gesture recognition for their convenience, noninvasiveness, and ease of access. However, compared with high-density EMG signals, sparse EMG signals lack rich feature information, which in turn affects gesture recognition accuracy. In order to reduce the loss of feature information of sparse EMG signals in the spatio-temporal dimension, this paper proposes a hybrid neural network Res-LSTM combining residual network and long short-term memory network. The ordinary convolutional blocks in the CNN network are replaced with residual blocks, and the final fully connected layer is removed and a constant mapping layer is added to adequate extraction of spatial feature information of the data. The Res layer’s output is utilized as the input for the long and short-term memory (LSTM) network, which further extracts the features of the data in the temporal dimension, and finally completes the categorization output through a fully connected layer. The average accuracy of 91.11% for gesture-motion classification was verified by training on a homebrew dataset;and tested on a public dataset, NinaPro DB1, with an average accuracy of 91.03%. The experimental findings indicate that the proposed Res-LSTM network framework contributes to solving the problem of lack of feature information of sparse sEMG signals and improves the pattern recognition accuracy. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Gesture recognition

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：