检索结果-内蒙古大学图书馆

arXiv 2023年

作者： Luo, Pengfei Xu, Tong Wu, Shiwei Zhu, Chen Xu, Linli Chen, Enhong School of Computer Science and Technology University of Science and Technology of China State Key Laboratory of Cognitive Intelligence Anhui Hefei China School of Data Science University of Science and Technology of China State Key Laboratory of Cognitive Intelligence Anhui Hefei China Career Science Lab BOSS Zhipin School of Management University of Science and Technology of China Beijing China

Multimodal entity linking (MEL) task, which aims at resolving ambiguous mentions to a multimodal knowledge graph, has attracted wide attention in recent years. Though large efforts have been made to explore the complementary effect among multiple modalities, however, they may fail to fully absorb the comprehensive expression of abbreviated textual context and implicit visual indication. Even worse, the inevitable noisy data may cause inconsistency of different modalities during the learning process, which severely degenerates the performance. To address the above issues, in this paper, we propose a novel Multi-GraIned Multimodal interaction Network (MIMIC) framework for solving the MEL task. Specifically, the unified inputs of mentions and entities are first encoded by textual/visual encoders separately, to extract global descriptive features and local detailed features. Then, to derive the similarity matching score for each mention-entity pair, we device three interaction units to comprehensively explore the intra-modal interaction and inter-modal fusion among features of entities and mentions. In particular, three modules, namely the Text-based Global-Local interaction Unit (TGLU), Vision-based DuaL interaction Unit (VDLU) and Cross-Modal Fusion-based interaction Unit (CMFU) are designed to capture and integrate the fine-grained representation lying in abbreviated text and implicit visual cues. Afterwards, we introduce a unit-consistency objective function via contrastive learning to avoid inconsistency and model degradation. Experimental results on three public benchmark datasets demonstrate that our solution outperforms various state-of-the-art baselines, and ablation studies verify the effectiveness of designed modules. © 2023, CC BY-NC-ND.

关键词： Learning systems

来源：评论

学校读者我要写书评

暂无评论

SceneSketcher: Fine-Grained Image Retrieval with Scene Sketches 16th

SceneSketcher: Fine-Grained Image Retrieval with Scene Sketc...

引用

16th European Conference on computer Vision, ECCV 2020

作者： Liu, Fang Zou, Changqing Deng, Xiaoming Zuo, Ran Lai, Yu-Kun Ma, Cuixia Liu, Yong-Jin Wang, Hongan State Key Laboratory of Computer Science and Beijing Key Lab of Human-Computer Interaction Institute of Software Chinese Academy of Sciences Beijing China University of Chinese Academy of Sciences Beijing China HMI Laboratory Huawei Technologies Shenzhen China Cardiff University Cardiff United Kingdom Tsinghua University Beijing China

ISBN: (纸本)9783030585280

Sketch-based image retrieval (SBIR) has been a popular research topic in recent years. Existing works concentrate on mapping the visual information of sketches and images to a semantic space at the object level. In this paper, for the first time, we study the fine-grained scene-level SBIR problem which aims at retrieving scene images satisfying the user’s specific requirements via a freehand scene sketch. We propose a graph embedding based method to learn the similarity measurement between images and scene sketches, which models the multi-modal information, including the size and appearance of objects as well as their layout information, in an effective manner. To evaluate our approach, we collect a dataset based on SketchyCOCO and extend the dataset using Coco-stuff. Comprehensive experiments demonstrate the significant potential of the proposed approach on the application of fine-grained scene-level image retrieval. © 2020, Springer Nature Switzerland AG.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Constrained maximum weighted bipartite matching:a novel approach to radio broadcast scheduling

引用

science China(Information sciences) 2019年第7期62卷 156-169页

作者： Shaojiang WANG Tianyong WU Yuan YAO Dongbo BU Shaowei CAI State Key Laboratory of Computer Science Institute of SoftwareChinese Academy of Sciences School of Computer Science and Technology University of Chinese Academy of Sciences Beijing Key Lab of Human-Computer Interaction Institute of SoftwareChinese Academy of Sciences Institute of Computing Technology Chinese Academy of Sciences

Given a set of radio broadcast programs, the radio broadcast scheduling problem is to allocate a set of devices to transmit the programs to achieve the optimal sound quality. In this article, we propose a complete algorithm to solve the problem, which is based on a branch-and-bound(BnB) algorithm. We formulate the problem with a new model, called constrained maximum weighted bipartite matching(CMBM),i.e., the maximum matching problem on a weighted bipartite graph with constraints. For the reduced matching problem, we propose a novel BnB algorithm by introducing three new strategies, including the highest quality first, the least conflict first and the more edge first. We also establish an upper bound estimating function for pruning the search space of the algorithm. The experimental results show that our new algorithm can quickly find the optimal solution for the radio broadcast scheduling problem at small scales, and has higher scalability for the problems at large scales than the existing complete algorithm.

关键词： radio broadcast scheduling branch-and-bound algorithm constrained maximum weighted bipartite matching Kuhn-Munkres algorithm strategy combinations

来源：评论

学校读者我要写书评

暂无评论

Advswap: Covert Adversarial Perturbation with High Frequency Info-Swapping for Autonomous Driving Perception

Advswap: Covert Adversarial Perturbation with High Frequency...

引用

International Conference on Intelligent Transportation

作者： Yuanhao Huang Qinfan Zhang Jiandong Xing Mengyue Cheng Haiyang Yu Yilong Ren Xiao Xiong School of Transportation Science and Engineering Beihang University Beijing P.R.China State Key Lab of Intelligent Transportation System Beijing P.R.China Zhongguancun Laboratory Beijing P.R.China Department of Electrical and Computer Engineering University of Alberta Edmonton Canada

ISBN: (数字)9798331505929

ISBN: (纸本)9798331505936

Perception module of Autonomous vehicles (AVs) are increasingly susceptible to be attacked, which exploit vulnerabilities in neural networks through adversarial inputs, thereby compromising the AI safety. Some researches focus on creating covert adversarial samples, but existing global noise techniques are detectable and difficult to deceive the human visual system. This paper introduces a novel adversarial attack method, AdvSwap, which creatively utilizes wavelet-based highfrequency information swapping to generate covert adversarial samples and fool the camera. AdvSwap employs invertible neural network for selective high-frequency information swapping, preserving both forward propagation and data integrity. The scheme effectively removes the original label data and incorporates the guidance image data, producing concealed and robust adversarial samples. Experimental evaluations and comparisons on the GTSRB and nuScenes datasets demonstrate that AdvSwap can make concealed attacks on common traffic targets. The generates adversarial samples are also difficult to perceive by humans and algorithms. Meanwhile, the method has strong attacking robustness and attacking transferability.

关键词： Wavelet transforms Training Neural networks Noise Visual systems Robustness Classification algorithms Data mining Information exchange Autonomous vehicles

来源：评论

学校读者我要写书评

暂无评论

HOIAnimator: Generating Text-Prompt human-Object Animations Using Novel Perceptive Diffusion Models

HOIAnimator: Generating Text-Prompt Human-Object Animations ...

引用

Conference on computer Vision and Pattern Recognition (CVPR)

作者： Wenfeng Song Xinyu Zhang Shuai Li Yang Gao Aimin Hao Xia Hau Chenglizhao Chen Ning Li Hong Qin Beijing Information Science and Technology University Zhongguancun Laboratory China State Key Laboratory of Virtual Reality Technology and Systems Beihang University Research Unit of Virtual Human and Virtual Surgery (2019RU004) Chinese Academy of Medical Sciences College of Computer Science and Technology China University of Petroleum (East China) Department of Computer Science Stony Brook University (SUNY at Stony Brook) New York USA

ISBN: (数字)9798350353006

ISBN: (纸本)9798350353013

To date, the quest to rapidly and effectively produce human-object interaction (HOI) animations directly from textual descriptions stands at the forefront of computer vision research. The underlying challenge demands both a discriminating interpretation of language and a comprehen-sive physics-centric model supporting real-world dynamics. To ameliorate, this paper advocates HOIAnimator, a novel and interactive diffusion model with perception ability and also ingeniously crafted to revolutionize the animation of complex interactions from linguistic narratives. The effectiveness of our model is anchored in two ground-breaking innovations: (1) Our Perceptive Diffusion Models (PDM) brings together two types of models: one focused on hu-man movements and the other on objects. This combination allows for animations where humans and objects move in concert with each other, making the overall motion more realistic. Additionally, we propose a Perceptive Message Passing (PMP) mechanism to enhance the communication bridging the two models, ensuring that the animations are smooth and unified; (2) We devise an interaction Contact Field (ICF), a sophisticated model that implicitly captures the essence of HOls. Beyond mere predictive contact points, the ICF assesses the proximity of human and object to their respective environment, informed by a probabilistic distribution of interactions learned throughout the denoising phase. Our comprehensive evaluation showcases HOlani-mator's superior ability to produce dynamic, context-aware animations that surpass existing benchmarks in text-driven animation synthesis.

关键词： computer vision Technological innovation Message passing Computational modeling Noise reduction Linguistics Animation

来源：评论

学校读者我要写书评

暂无评论

Gesture interaction in virtual reality

引用

Virtual Reality & Intelligent Hardware 2019年第1期1卷 84-112页

作者： Yang LI Jin HUANG Feng TIAN Hong-An WANG Guo-Zhong DAI Beijing Key Laboratory of Human-Computer Interaction Institute of SoftwareChinese Academy of SciencesBeijing 100190China State Key Laboratory of Computer Science Chinese Academy of SciencesBeijing 100190China

With the development of virtual reality(VR)and human-computer interaction technology,how to use natural and efficient interaction methods in the virtual environment has become a hot topic of *** is one of the most important communication methods of human beings,which can effectively express users'*** the past few decades,gesture-based interaction has made significant *** article focuses on the gesture interaction technology and discusses the definition and classification of gestures,input devices for gesture interaction,and gesture interaction recognition *** application of gesture interaction technology in virtual reality is studied,the existing problems in the current gesture interaction are summarized,and the future development is prospected.

关键词： Virtual reality Gesture interaction Gesture recognition

来源：评论

学校读者我要写书评

暂无评论

Energy Efficiency Optimization for Full-Duplex D2D Communications Underlaying Distributed Antenna Systems

IEEE Transactions on Green Communications and Networking

引用

IEEE Transactions on Green Communications and Networking 2024年

作者： Liu, Zhan Liao, Zhiyuan Li, Chunquan Zhang, Zhijun Yu, Junzhi Liu, P.X. Hunan University of Humanities Science and Technology School of Information Loudi417000 China Nanchang University School of Information Engineering Nanchang330031 China Jiangxi Provincial Key Laboratory of Intelligent Systems and Human-Machine Interaction Nanchang330031 China South China University of Technology School of Automation Science and Engineering Guangzhou510640 China Peking University State Key Laboratory for Turbulence and Complex Systems Department of Advanced Manufacturing and Robotics College of Engineering Beijing100871 China Carleton University Department of Systems and Computer Engineering OttawaONK1S 5B6 Canada

In this paper, we investigate the total system energy efficiency (EE) of full-duplex (FD) device-to-device (D2D) communications underlaying distributed antenna systems (DAS), where remote access units (RAUs), D2D users (DUs), and cellular users (CUs) are all capable of FD operation. Specifically, we jointly optimize subcarrier assignment and power allocation under the quality of service (QoS) requirements of CUs and DUs and the maximum power constraints of RAUs, CUs, and DUs. In addition, we propose a novel spectrum sharing strategy that allows each subcarrier to be assigned to multiple CUs and/or multiple D2D pairs (DPs) for flexibility. To solve the formulated non-convex optimization problem, we first employ fractional programming to transform the objective function in the optimization problem from the fractional form into the equivalent subtractive form. Then, an efficient iterative resource allocation algorithm is proposed, which needs to solve an inner problem in each iteration. After relaxing the variables and introducing penalty factors, the non-convex inner problem is transformed into a convex problem through the successive convex approximation (SCA) method and solved by iterative algorithm. Simulation results demonstrate that the proposed algorithm can considerably improve the system EE compared to other benchmark schemes. Furthermore, the proposed spectrum sharing strategy is superior to the existing sharing strategies. © 2017 IEEE.

关键词： Convex optimization

来源：评论

学校读者我要写书评

暂无评论

Trajectory prediction model for crossing-based target selection

引用

Virtual Reality & Intelligent Hardware 2019年第3期1卷 330-340页

作者： Hao ZHANG Jin HUANG Feng TIAN Guozhong DAI Hongan WANG Beijing Key Lab of Human-Computer Interaction Institute of SoftwareChinese Academy of SciencesBeijing 100190China State Key Lab of Computer Science Institute of SoftwareChinese Academy of SciencesBeijing 100190China

Background Crossing-based target selection motion may attain less error rates and higher interactive speed in some *** of the research in target selection fields are focused on the analysis of the interaction ***,as trajectories play a much more important role in crossing-based target selection compared to the other interactive techniques,an ideal model for trajectories can help computer designers make predictions about interaction results during the process of target selection rather than at the end of the whole *** In this paper,a trajectory prediction model for crossing based target selection tasks is proposed by taking the reference of a dynamic model *** Simulation results demonstrate that our model performed well with regard to the prediction of trajectories,endpoints and hitting time for target-selection motion,and the average error of trajectories,endpoints and hitting time values were found to be 17.28%,2.73mm and 11.50%,respectively.

关键词： Target selection Crossing-based selection Trajectory prediction

来源：评论

学校读者我要写书评

暂无评论

Potential Indicator for Continuous Emotion Arousal by Dynamic Neural Synchrony

arXiv

引用

arXiv 2025年

作者： Pan, Guandong Wu, Zhaobang Yang, Yaqian Wang, Xin Liu, Longzhao Zheng, Zhiming Tang, Shaoting School of Computer Science and Engineering Beihang University Beijing100191 China Institute of Artificial Intelligence Beihang University Beijing100191 China Key laboratory of Mathematics Informatics and Behavioral Semantics Beihang University Beijing100191 China Institute of Medical Artificial Intelligence Binzhou Medical University Yantai264003 China Zhongguancun Laboratory Beijing100094 China Beijing Advanced Innovation Center for Future Blockchain and Privacy Computing Beihang University Beijing100191 China PengCheng Laboratory Shenzhen518055 China State Key Lab of Software Development Environment Beihang University Beijing100191 China

The need for automatic and high-quality emotion annotation is paramount in applications such as continuous emotion recognition and video highlight detection, yet achieving this through manual human annotations is challenging. Inspired by inter-subject correlation (ISC) utilized in neuroscience, this study introduces a novel Electroencephalography (EEG) based ISC methodology that leverages a single-electrode and feature-based dynamic approach. Our contributions are three folds: Firstly, we reidentify two potent emotion features suitable for classifying emotions—first-order difference (FD) an differential entropy (DE). Secondly, through the use of overall correlation analysis, we demonstrate the heterogeneous synchronized performance of electrodes. This performance aligns with neural emotion patterns established in prior studies, thus validating the effectiveness of our approach. Thirdly, by employing a sliding window correlation technique, we showcase the significant consistency of dynamic ISCs across various features or key electrodes in each analyzed film clip. Our findings indicate the method’s reliability in capturing consistent, dynamic shared neural synchrony among individuals, triggered by evocative film stimuli. This underscores the potential of our approach to serve as an indicator of continuous human emotion arousal. The implications of this research are significant for advancements in affective computing and the broader neuroscience field, suggesting a streamlined and effective tool for emotion analysis in real-world applications. © 2025, CC BY-NC-SA.

关键词： Neurons

来源：评论

学校读者我要写书评

暂无评论

Potential Indicator for Continuous Emotion Arousal by Dynamic Neural Synchrony 4th

Potential Indicator for Continuous Emotion Arousal by Dyna...

引用

4th International Workshop on human Brain and Artificial Intelligence, HBAI 2024

作者： Pan, Guandong Wu, Zhaobang Yang, Yaqian Wang, Xin Liu, Longzhao Zheng, Zhiming Tang, Shaoting School of Computer Science and Engineering Beihang University Beijing100191 China Institute of Artificial Intelligence Beihang University Beijing100191 China Key Laboratory of Mathematics Informatics and Behavioral Semantics Beihang University Beijing100191 China Institute of Medical Artificial Intelligence Binzhou Medical University Yantai264003 China Zhongguancun Laboratory Beijing100094 China Beijing Advanced Innovation Center for Future Blockchain and Privacy Computing Beihang University Beijing100191 China PengCheng Laboratory Shenzhen518055 China State Key Lab of Software Development Environment Beihang University Beijing100191 China

ISBN: (纸本)9789819640003

The need for automatic and high-quality emotion annotation is paramount in applications such as continuous emotion recognition and video highlight detection, yet achieving this through manual human annotations is challenging. Inspired by inter-subject correlation (ISC) utilized in neuroscience, this study introduces a novel Electroencephalography (EEG) based ISC methodology that leverages a single-electrode and feature-based dynamic approach. Our contributions are three folds: Firstly, we reidentify two potent emotion features suitable for classifying emotions-first-order difference (FD) an differential entropy (DE). Secondly, through the use of overall correlation analysis, we demonstrate the heterogeneous synchronized performance of electrodes. This performance aligns with neural emotion patterns established in prior studies, thus validating the effectiveness of our approach. Thirdly, by employing a sliding window correlation technique, we showcase the significant consistency of dynamic ISCs across various features or key electrodes in each analyzed film clip. Our findings indicate the method’s reliability in capturing consistent, dynamic shared neural synchrony among individuals, triggered by evocative film stimuli. This underscores the potential of our approach to serve as an indicator of continuous human emotion arousal. The implications of this research are significant for advancements in affective computing and the broader neuroscience field, suggesting a streamlined and effective tool for emotion analysis in real-world applications. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Electroencephalography (EEG) Emotion Annotation Inter-subject Correlation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：