检索结果-内蒙古大学图书馆

IEEE International Conference on Image Processing

作者： Suyuan Huang Haoxin Zhang Yanyu Xu Yan Gao Yao Hu Zengchang Qin Intelligent Computing and Machine Learning Lab School of ASEE Beihang University Xiaohongshu Inc. Institute of High Performance Computing A*Star Guangzhou Zhongsuan Cloud Technology Co.. Ltd.

ISBN: (数字)9798350349399

ISBN: (纸本)9798350349405

Video action segmentation aims to identify and localize actions. Existing models have achieved impressive performance with pre-extracted frame-level features, but this may limit zero-shot learning and cross-dataset inference, especially for new actions or scenes. To overcome this problem, we propose a novel end-to-end network designed for robust performance across both familiar and novel action segmentation scenarios. Our approach combines a plug-and-play visual prompt module enhancing CLIP features’ temporal understanding, and a learnable text prompt that enriches label semantics and refines the model’s focus, significantly boosting performance. Our results demonstrate that CLIP features can assist in action segmentation tasks, and prompts can improve task effectiveness. Furthermore, our findings show that CLIP features contain information that i3d features do not. We evaluate the proposed method on several video datasets, including Georgia Tech Egocentric Activities (GTEA), 50Salads, and Breakfast, and the results show that the proposed model outperforms existing SOTA models.

关键词： Image segmentation Visualization Zero-shot learning Surveillance Semantics Refining Human-robot interaction

来源：评论

学校读者我要写书评

暂无评论

Learning to Localize Cross-Anatomy Landmarks in X-Ray Images with a Universal Model

引用

Biomedical Engineering Frontiers 2022年第1期3卷 298-308页

作者： Heqin Zhu Qingsong Yao Li Xiao S.Kevin Zhou Key Lab of Intelligent Information Processing of Chinese Academy of Sciences(CAS) Institute of Computing TechnologyCASBeijing 100190China Center for Medical Imaging RoboticsAnalytic Computing&Learning(MIRACLE)School of Biomedical Engineering&Suzhou Institute for Advanced ResearchUniversity of Science and Technology of ChinaSuzhou 215123China

Objective and Impact *** this work,we develop a universal anatomical landmark detection model which learns once from multiple datasets corresponding to different anatomical *** with the conventional model trained on a single dataset,this universal model not only is more light weighted and easier to train but also improves the accuracy of the anatomical landmark *** accurate and automatic localization of anatomical landmarks plays an essential role in medical image ***,recent deep learning-based methods only utilize limited data from a single *** is promising and desirable to build a model learned from different regions which harnesses the power of big *** model consists of a local network and a global network,which capture local features and global features,*** local network is a fully convolutional network built up with depth-wise separable convolutions,and the global network uses dilated convolution to enlarge the receptive field to model global *** evaluate our model on four 2D X-ray image datasets totaling 1710 images and 72 landmarks in four anatomical *** experimental results show that our model improves the detection accuracy compared to the state-of-the-art *** model makes the first attempt to train a single network on multiple datasets for landmark *** results qualitatively and quantitatively show that our proposed model performs better than other models trained on multiple datasets and even better than models trained on a single dataset separately.

关键词： convolution utilize separable

来源：评论

学校读者我要写书评

暂无评论

Nested relation extraction with iterative neural network

引用

Frontiers of Computer Science 2021年第3期15卷 109-122页

作者： Yixuan CAO Dian CHEN Zhengqi XU Hongwei LI Ping LUO Key Lab of Intelligent Information Processing of Chinese Academy of Sciences(CAS) Institute of Computing TechnologyCASBeijing 100190China University of Chinese Academy of Sciences Beijing 100049China

Most existing researches on relation extraction focus on binary flat relations like Bomln relation between a Person and a *** a large portion of objective facts de-scribed in natural language are complex,especially in professional documents in fields such as finance and biomedicine that require precise *** example,“the GDP of the United States in 2018 grew 2.9%compared with 2017”describes a growth rate relation between two other relations about the economic index,which is beyond the expressive power of binary flat ***,we propose the nested relation extraction problem and formulate it as a directed acyclic graph(DAG)structure extraction ***,we propose a solution using the Iterative Neural Network which extracts relations layer by *** proposed solution achieves 78.98 and 97.89 FI scores on two nested relation extraction tasks,namely semantic cause-and-efFect relation extraction and formula ***,we observe that nested relations are usually expressed in long sentences where entities are mentioned repetitively,which makes the annotation difficult and ***,we extend our model to incorporate a mention-insensitive mode that only requires annotations of relations on entity concepts(instead of exact mentions)while preserving most of its *** mention-insensitive model performs better than the mention sensitive model when the random level in mention selection is higher than 0.3.

关键词： nested relation extraction mention insensitive relation iterative neural network

来源：评论

学校读者我要写书评

暂无评论

A Case Study of Dependency Network for Building Packages: The Fedora Linux Distribution 35

A Case Study of Dependency Network for Building Packages: Th...

引用

35th International Conference on Software Engineering and Knowledge Engineering, SEKE 2023

作者： Du, Jiman Zhu, Jiaxin Li, Hui Chen, Wei Xu, Lijie Liu, Jie Chen, Zhifeng School of Computer Electronics and Information Guangxi University China State Key Lab of Computer Science ISCAS University of CAS China University of Chinese Academy of Sciences Nanjing China Nanjing Institute of Software Technology China MIIT Key Lab of Cloud Computing Standards and Applications China Electronic Standardization Institute China

To port the Linux distributions to a new Instruction Set Architecture (ISA), developers have to rebuild the software packages of the distributions. The complex dependencies of the software packages bring a great challenge. It is important to understand and properly handle the dependencies. We selected Fedora, a typical Linux distribution, and studied the dependencies within the software repositories of aarch64 and x86_64 architecture. We proposed a package dependency network framework to study the roles played by different packages. We obtained three network dependency patterns and proposed the corresponding division strategies which help developer build the source packages in parallel. Our study reveals that the key packages located at the root of multiple dependency chains significantly impact the division of the network, and their builds should be prioritized. Meanwhile, some packages with external dependencies can be temporarily masked to make a sub-network independent. Furthermore, the network dependency patterns are also observed in Fedora 33 riscv64 and OpenEuler riscv64. Our findings can help researchers have a better knowledge of Linux distribution dependency network and help practitioners conduct efficient package builds. © 2023 Knowledge Systems Institute Graduate School. All rights reserved.

关键词： Software packages

来源：评论

学校读者我要写书评

暂无评论

Generalized-Extended-State-Observer and Equivalent-Input-Disturbance Methods for Active Disturbance Rejection: Deep Observation and Comparison

引用

IEEE/CAA Journal of Automatica Sinica 2023年第4期10卷 957-968页

作者： Jinhua She Kou Miyamoto Qing-Long Han Min Wu Hiroshi Hashimoto Qing-Guo Wang School of Engineering Tokyo University of TechnologyHachiojiTokyo 192-0982Japan K.Miyamoto is with the Institute of Technology Shimizu CorporationKotoTokyo 135-0044Japan School of Science Computing and Engineering TechnologiesSwinburne University of TechnologyMelbourneVIC 3122Australia School of Automation China University of GeosciencesWuhan 430074 Hubei Key Laboratory of Advanced Control and Intelligent Automation for Complex Systems Engineering Research Center of Intelligent Technology for Geo-Exploration Ministry of EducationWuhan 430074China School of Industrial Technology Advanced Institute of Industrial TechnologyTokyo 140-0011Japan Institute of Artificial Intelligence and Future Networks Beijing Normal UniversityZhuhai 519087 Guangdong Key Lab of AI and Multi-Modal Data Processing Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science BNUHKBU United International College Zhuhai 519087China

Active disturbance-rejection methods are effective in estimating and rejecting disturbances in both transient and steady-state *** paper presents a deep observation on and a comparison between two of those methods:the generalized extended-state observer(GESO)and the equivalent input disturbance(EID)from assumptions,system configurations,stability conditions,system design,disturbance-rejection performance,and extensibility.A time-domain index is introduced to assess the disturbance-rejection performance.A detailed observation of disturbance-suppression mechanisms reveals the superiority of the EID approach over the GESO method.A comparison between these two methods shows that assumptions on disturbances are more practical and the adjustment of disturbance-rejection performance is easier for the EID approach than for the GESO method.

关键词： Active disturbance-rejection control(ADRC) disturbance observer(DOB) equivalent input disturbance(EID) extendedstate observer(ESO) generalized extended-state observer(GESO)

来源：评论

学校读者我要写书评

暂无评论

Adaptively feature matching via joint transformational-spatial clustering

Adaptively feature matching via joint transformational-spati...

引用

作者： Wang, Linbo Tan, Li Fang, Xianyong Guo, Yanwen Wan, Shaohua MOE Key Laboratory of Intelligent Computing and Signal Processing School of Computer Science and Technology Anhui University Hefei China National Key Lab for Novel Software Technology Nanjing University Nanjing China School of Information and Safety Engineering Zhongnan University of Economics and Law Wuhan China

The transformational and spatial proximities are important cues for identifying inliers from an appearance based match set because correct matches generally stay close in input images and share similar local transformations. However, most existing approaches only check one type of them or both types consecutively with manually set thresholds, and thus their matching accuracy and flexibility in handling large-scale images are limited. In this paper, we present an efficient clustering based approach to identify match inliers with both proximities simultaneously. It first projects the putative matches into a joint transformational-spatial space, where mismatches tend to scatter all around while correct matches gather together. A mode-seeking process based on joint kernel density estimation is then proposed to obtain significant clusters in the joint space, where each cluster contains matches mapping the same object across images with high accuracy. Moreover, kernel bandwidths for measuring match proximities are adaptively set during density estimation, which enhances its applicability for matching different images. Experiments on three standard datasets show that the proposed approach delivers superior performance on a variety of feature matching tasks, including multi-object matching, duplicate object matching and object retrieval. © 2021, The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature.

关键词： Clustering Density estimation Feature matching Mode-seeking

来源：评论

学校读者我要写书评

暂无评论

DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Object Detection 38

DA-Ada: Learning Domain-Aware Adapter for Domain Adaptive Ob...

引用

38th Conference on Neural Information Processing Systems, NeurIPS 2024

作者： Li, Haochen Zhang, Rui Yao, Hantao Zhang, Xin Hao, Yifan Song, Xinkai Li, Xiaqing Zhao, Yongwei Li, Ling Chen, Yunji Intelligent Software Research Center Institute of Software CAS Beijing China State Key Lab of Processors Institute of Computing Technology CAS Beijing China State Key Laboratory of Multimodal Artificial Intelligence Systems Institute of Automation CAS Beijing China University of Chinese Academy of Sciences Beijing China

Domain adaptive object detection (DAOD) aims to generalize detectors trained on an annotated source domain to an unlabelled target domain. As the visual-language models (VLMs) can provide essential general knowledge on unseen images, freezing the visual encoder and inserting a domain-agnostic adapter can learn domain-invariant knowledge for DAOD. However, the domain-agnostic adapter is inevitably biased to the source domain. It discards some beneficial knowledge discriminative on the unlabelled domain, ***-specific knowledge of the target domain. To solve the issue, we propose a novel Domain-Aware Adapter (DA-Ada) tailored for the DAOD task. The key point is exploiting domain-specific knowledge between the essential general knowledge and domain-invariant knowledge. DA-Ada consists of the Domain-Invariant Adapter (DIA) for learning domain-invariant knowledge and the Domain-Specific Adapter (DSA) for injecting the domain-specific knowledge from the information discarded by the visual encoder. Comprehensive experiments over multiple DAOD tasks show that DA-Ada can efficiently infer a domain-aware visual encoder for boosting domain adaptive object detection. Our code is available at https://***/Therock90421/DA-Ada. © 2024 Neural information processing systems foundation. All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Text Semantic Analysis and intelligent Interaction in Parallel Dispatching Systems for Technical Station 4

Text Semantic Analysis and Intelligent Interaction in Parall...

引用

4th IEEE International Conference on Digital Twins and Parallel Intelligence, DTPI 2024

作者： Jiao, Yuantao Wang, Jian Li, Runmei Xiong, Gang Chen, Shichao Beijing Jiaotong University School of Automation and Intelligence Beijing China Institute of Automation Chinese Academy of Sciences Beijing Engineering Research Center of Intelligent Systems and Technology Beijing China Cloud Computing Center Chinese Academy of Sciences Guangdong Engineering Research Center of 3D Printing and Intelligent Manufacturing Dongguan China Chinese Academy of Sciences State Key Laboratory for Multimodal Artificial Intelligence Systems Beijing China

ISBN: (纸本)9798350349252

Technical station dispatching system plays an important role in cargo operation, but due to the large number of dispatching systems and complex operation, dispatchers rely on manual experience to complete the task, and many scenarios cannot be applied in the real system. Therefore, this paper utilizes the data-driven form to establish a parallel dispatching system for technical stations, semantically analyzes text data rich in dispatcher experience, completes the Computational Experiments on the textual semantic data in the artificial dispatching system, and synchronizes the Parallel Execution of data interaction with the real dispatching system. © 2024 IEEE.

关键词： parallel intelligence railway dispatching system technical station

来源：评论

学校读者我要写书评

暂无评论

An Actor-Critic Framework Deep Reinforcement Learning Approach to High-speed Train Timetable Rescheduling Problem 4

An Actor-Critic Framework Deep Reinforcement Learning Approa...

引用

4th IEEE International Conference on Digital Twins and Parallel Intelligence, DTPI 2024

作者： Tian, Zhilong Li, Runmei Xiong, Gang Zhu, Fenghua Beijing Jiaotong University School of Automation and Intelligence Beijing China Chinese Academy of Sciences Beijing Engineering Research Center of Intelligent Systems and Technology Institute of Automation Beijing China Chinese Academy of Sciences Guangdong Engineering Research Center of 3D Printing and Intelligent Manufacturing Cloud Computing Center Dongguan China Chinese Academy of Sciences State Key Laboratory for Multimodal Artificial Intelligence Systems Beijing China

ISBN: (纸本)9798350349252

High-speed trains are inevitably affected by emergencies in their daily operation, which may cause the trains to fail to run according to the original planned timetable. Therefore, how to adjust the timetable of subsequent trains in emergencies is crucial for railway operations. This paper introduces a deep reinforcement learning algorithm based on an actor-critic network, which minimizes the total delay time by defining the departure order as an action. Numerical experiments are performed on the timetable of Beijing-Shanghai high-speed railway line. © 2024 IEEE.

关键词： Deep reinforcement learning

来源：评论

学校读者我要写书评

暂无评论

Contrastive Learning for Robust Android Malware Familial Classification

引用

IEEE Transactions on Dependable and Secure computing 2022年 1-14页

作者： Wu, Yueming Dou, Shihan Zou, Deqing Yang, Wei Qiang, Weizhong Jin, Hai National Engineering Research Center for Big Data Technology and System Services Computing Technology and System Lab Hubei Engineering Research Center on Big Data Security School of Cyber Science and Engineering Huazhong University of Science and Technology Wuhan China Shanghai Key Laboratory of Intelligent Information Processing School of Computer Science Fudan University Shanghai China University of Texas at Dallas Dallas USA National Engineering Research Center for Big Data Technology and System Services Computing Technology and System Lab Cluster and Grid Computing Lab School of Computer Science and Technology Huazhong University of Science and Technology Wuhan China

Due to its open-source nature, Android operating system has been the main target of attackers to exploit. Malware creators always perform different code obfuscations on their apps to hide malicious activities. Features extracted from these obfuscated samples through program analysis contain many useless and disguised features, which leads to many false negatives. To address the issue, in this paper, we demonstrate that obfuscation-resilient malware family analysis can be achieved through contrastive learning. The key insight behind our analysis is that contrastive learning can be used to reduce the difference introduced by obfuscation while amplifying the difference between malware and other types of malware. Based on the proposed analysis, we design a system that can achieve robust and interpretable classification of Android malware. To achieve robust classification, we perform contrastive learning on malware samples to learn an encoder that can automatically extract robust features from malware samples. To achieve interpretable classification, we transform the function call graph of a sample into an image by centrality analysis. Then the corresponding heatmaps can be obtained by visualization techniques. These heatmaps can help users understand why the malware is classified as this family. We implement IFDroid and perform extensive evaluations on two datasets. Experimental results show that IFDroid is superior to state-of-the-art Android malware familial classification systems. Moreover, IFDroid is capable of maintaining a 98.4% F1 on classifying 69,421 obfuscated malware samples. IEEE

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：