检索结果-内蒙古大学图书馆

41st International Conference on machine Learning, ICML 2024

作者： Lian, Shijie Zhang, Ziyi Li, Hua Li, Wenjie Yang, Laurence Tianruo Kwong, Sam Cong, Runmin Hainan University China China Ministry of Education China Huazhong University of Science and Technology China St. Francis Xavier University Canada LIngnan University School of Data Science Hong Kong Shandong University School of Control Science and Engineering China Key Laboratory of Machine Intelligence and System Control Ministry of Education Jinan China

With the breakthrough of large models, Segment Anything Model (SAM) and its extensions have been attempted to apply in diverse tasks of computer vision. Underwater salient instance segmentation is a foundational and vital step for various underwater vision tasks, which often suffer from low segmentation accuracy due to the complex underwater circumstances and the adaptive ability of models. Moreover, the lack of large-scale datasets with pixel-level salient instance annotations has impeded the development of machine learning techniques in this field. To address these issues, we construct the first large-scale underwater salient instance segmentation dataset (USIS10K), which contains 10,632 underwater images with pixel-level annotations in 7 categories from various underwater scenes. Then, we propose an Underwater Salient Instance Segmentation architecture based on Segment Anything Model (USIS-SAM) specifically for the underwater domain. We devise an Underwater Adaptive Visual Transformer (UA-ViT) encoder to incorporate underwater domain visual prompts into the segmentation network. We further design an out-of-the-box underwater Salient Feature Prompter Generator (SFPG) to automatically generate salient prompters instead of explicitly providing foreground points or boxes as prompts in SAM. Comprehensive experimental results show that our USIS-SAM method can achieve superior performance on USIS10K datasets compared to the state-of-the-art methods. Datasets and codes are released on Github. Copyright 2024 by the author(s)

关键词： Pixels

来源：评论

学校读者我要写书评

暂无评论

A Pilot Study on Coupling Behaviors of Limb Movements and Brain Activation Based on EEG and fNIRS Synchronization Information

A Pilot Study on Coupling Behaviors of Limb Movements and Br...

引用

2023 IEEE International Conference on Real-Time Computing and Robotics, RCAR 2023

作者： Tang, Xi Li, Qingge Tang, Yao Tian, Lan Zheng, Yue Li, Xiangxin Jiang, Naifu Shang, Peng Li, Guanglin Peng, Liang Fang, Peng Shenzhen Institute of Advanced Technology Chinese Academy of Sciences Cas Key Laboratory of Human-Machine Intelligence-Synergy Systems The Shenzhen Engineering Laboratory of Neural Rehabilitation Technology Shenzhen518055 China University of Chinese Academy of Sciences Shenzhen College of Advanced Technology Shenzhen518055 China Chinese Academy of Sciences State Key Laboratory of Management and Control for Complex Systems Institute of Automation Beijing100190 China

ISBN: (纸本)9798350327182

Motion is one of the basic physiological functions of human beings. However, many brain diseases such as stroke may cause different degrees of motor dysfunctions for patients. As a commonly used rehabilitation method, active and passive exercise training may enhance patients' neuromuscular functions and recover their motor abilities. It is known that limb movements are strongly coupled with brain activation but there is currently insufficient exploration on the coupling behaviors from the perspective of informatics. In this study, the coupling relationship between limb movements and brain activation was preliminarily studied based on three healthy subjects. Electroencephalogram (EEG) and functional near-infrared spectroscopy (fNIRS) signals were synchronously collected during lower limb movements, and time-frequency analysis (TFA) and transfer entropy (TE) analysis were performed to quantitatively study the brain activation behaviors. In the experiments, a desynchronization phenomenon of μ rhythm in EEG was observed during exercise states, and the experimental results demonstrate the activation rule of motor and prefrontal cortexes upon limb movements. Calculations show that there exists a bidirectional flow of information between EEG and cerebral oxygen metabolism signals, but with a difference between different directions. This work may support the rehabilitation for patients with motor dysfunctions with a guidance of quantitative indicators and also benefit the exploration on neuroscience. © 2023 IEEE.

关键词： Electroencephalography

来源：评论

学校读者我要写书评

暂无评论

Specificity-preserving RGB-D saliency detection

引用

Computational Visual Media 2023年第2期9卷 297-317页

作者： Tao Zhou Deng-Ping Fan Geng Chen Yi Zhou Huazhu Fu School of Computer Science and Engineering Nanjing University of Science and TechnologyNanjing 210094China Key Laboratory of System Control and Information Processing Ministry of EducationShanghaiChina Computer Vision Lab ETH Z¨urichZ¨urichSwitzerland School of Computer Science and Engineering Northwestern Polytechnical UniversityXi’anChina School of Computer Science and Engineering Southeast UniversityNanjingChina Inception Institute of Artificial Intelligence Abu DhabiUnited Arab Emirates

Salient object detection(SOD)in RGB and depth images has attracted increasing research *** RGB-D SOD models usually adopt fusion strategies to learn a shared representation from RGB and depth modalities,while few methods explicitly consider how to preserve modality-specific *** this study,we propose a novel framework,the specificity-preserving network(SPNet),which improves SOD performance by exploring both the shared information and modality-specific ***,we use two modality-specific networks and a shared learning network to generate individual and shared saliency prediction *** effectively fuse cross-modal features in the shared learning network,we propose a cross-enhanced integration module(CIM)and propagate the fused feature to the next layer to integrate cross-level ***,to capture rich complementary multi-modal information to boost SOD performance,we use a multi-modal feature aggregation(MFA)module to integrate the modalityspecific features from each individual decoder into the shared *** using skip connections between encoder and decoder layers,hierarchical features can be fully *** experiments demonstrate that our SPNet outperforms cutting-edge approaches on six popular RGB-D SOD and three camouflaged object detection *** project is publicly available at https://***/taozh2017/SPNet.

关键词： salient object detection(SOD) RGB-D cross-enhanced integration module(CIM) multi-modal feature aggregation(MFA)

来源：评论

学校读者我要写书评

暂无评论

Chinese Text Classification Using BERT and Flat-Lattice Transformer 11th

Chinese Text Classification Using BERT and Flat-Lattice Tran...

引用

11th International Conference on Artificial intelligence and Mobile Services, AIMS 2022 held as Part of the Services Conference Federation, SCF 2022

作者： Lv, Haifeng Ning, Yishuang Ning, Ke Ji, Xiaoyu He, Sheng Guangxi Key Laboratory of Machine Vision and Intelligent Control WuZhou University Wuzhou China Kingdee Research Kingdee International Software Group Company Limited Shenzhen China Guangxi Colleges and Universities Key Laboratory of Industry Software Technology Wuzhou University Wuzhou543002 China

ISBN: (纸本)9783031235030

Recently, large scale pre-trained language models such as BERT and models with lattice structure that consisting of character-level and word-level information have achieved state-of-the-art performance in most downstream natural language processing (NLP) tasks, including named entity recognition (NER), English text classification and sentiment analysis. For Chinese text classification, the existing methods have also tried such kinds of models. However, they cannot obtain the desired results since these pre-trained models are based on characters, which cannot be applied for Chinese language that is based on words. To address this problem, in this paper, we propose BFLAT which a simple but efficient model for Chinese text classification. Specifically, BFLAT utilizes BERT and word2vec to learn character-level and word-level vector representations, and then adopts the flat-lattice transformer to integrate both of the two-level vector representations. Experimental results on two datasets demonstrate that our proposed method outperforms the baseline methods over 1.38–21.82% and 3.42–20.7% in terms of relative F1-measure on two Chinese text classification benchmarks, respectively. © 2022, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： Sentiment analysis

来源：评论

学校读者我要写书评

暂无评论

Diffusion Posterior Proximal Sampling for Image Restoration 24

Diffusion Posterior Proximal Sampling for Image Restoration

引用

32nd ACM International Conference on Multimedia, MM 2024

作者： Wu, Hongjie He, Linchao Zhang, Mingqin Chen, Dongdong Luo, Kunming Luo, Mengting Zhou, Ji-Zhe Chen, Hu Lv, Jiancheng College of Computer Science Sichuan University Chengdu China National Key Laboratory of Fundamental Science on Synthetic Vision Sichuan University Chengdu China Heriot-Watt University Edinburgh United Kingdom Hong Kong University of Science and Technology Hong Kong Engineering Research Center of Machine Learning and Industry Intelligence Ministry of Education China College of Computer Science Sichuan University China

ISBN: (纸本)9798400706868

Diffusion models have demonstrated remarkable efficacy in generating high-quality samples. Existing diffusion-based image restoration algorithms exploit pre-trained diffusion models to leverage data priors, yet they still preserve elements inherited from the unconditional generation paradigm. These strategies initiate the denoising process with pure white noise and incorporate random noise at each generative step, leading to over-smoothed results. In this paper, we present a refined paradigm for diffusion-based image restoration. Specifically, we opt for a sample consistent with the measurement identity at each generative step, exploiting the sampling selection as an avenue for output stability and enhancement. The number of candidate samples used for selection is adaptively determined based on the signal-to-noise ratio of the timestep. Additionally, we start the restoration process with an initialization combined with the measurement signal, providing supplementary information to better align the generative process. Extensive experimental results and analyses validate that our proposed method significantly enhances image restoration performance while consuming negligible additional computational resources. © 2024 ACM.

关键词： Image reconstruction

来源：评论

学校读者我要写书评

暂无评论

Laser tracking leader-follower automatic cooperative navigation system for UAVs

引用

International Journal of Agricultural and Biological Engineering 2022年第2期15卷 165-176页

作者： Rui Ming Zhiyan Zhou Zichen Lyu Xiwen Luo Le Zi Cancan Song Yu Zang Wei Liu Rui Jiang College of Engineering South China Agricultural University/Guangdong Laboratory for Lingnan Modern AgricultureGuangzhou 510642China College of Computer and Control Engineering Minjiang University/Fujian Provincial Key Laboratory of Information Processing and Intelligent ControlFuzhou 350108China Guangdong Provincial Key Laboratory for Agricultural Artificial Intelligence(GDKL-AAI) Guangzhou 510642China Guangdong Engineering Research Center for Agricultural Aviation Application(ERCAAA) Guangzhou 510642China Key Laboratory of Key Technology on Agricultural Machine and Equipment South China Agricultural UniversityMinistry of Education of P.R.ChinaGuangzhou 510642China

Currently,small payload and short endurance are the main problems of a single UAV in agricultural applications,especially in large-scale *** is one of the important methods to solve the above problems of UAVs by improving operation efficiency through multi-UAV cooperative *** study proposed a laser tracking leader-follower automatic cooperative navigation system for *** leader in the cluster fires a laser beam to irradiate the follower,and the follower performs a visual tracking flight according to the light spot at the relative position of the laser tracking *** on the existing kernel correlation filter(KCF)tracking algorithm,an improved KCF real-time spot tracking method was *** with the traditional KCF tracking algorithm,the recognition and tracking rate of the optimized algorithm was increased from 70%to 95%in indoor environment,and was increased from 20%to 90%in outdoor *** navigation control method was studied from two aspects:the distance coordinate transformation model based on micro-gyroscope and navigation control *** error of spot position was reduced from the maximum(3.12,−3.66)cm to(0.14,0.12)cm by correcting the deviation distance of the spot at different angles through a coordinate correction *** image coordinate conversion model was established for a complementary metal-oxide-semiconductor(CMOS)camera and laser receiving device at different mounting *** laser receiving device was divided into four regions,S0-S3,and the speed of the four regions is calculated using an uncontrollable discrete Kalman *** outdoor flight experiments of two UAVs were carried out outdoors using this *** experiment results show that the average flight error of the two UAVs on the X-axis is 5.2 cm,and the coefficient of variation is *** average flight error on the Z-axis is 7.3 cm,and the coefficient of variation is *** study demonstrated the possibility

关键词： two-UAVs cooperative visual navigation laser tracking

来源：评论

学校读者我要写书评

暂无评论

A Preliminary Study on the Functional Coupling between Nerve and Blood Microcirculation for Applications in Rehabilitation Robots

A Preliminary Study on the Functional Coupling between Nerve...

引用

2023 IEEE International Conference on Robotics and Biomimetics, ROBIO 2023

作者： Li, Qingge Dong, Yuanzhe Zhang, Yuxiang Wang, Xin Jiang, Naifu Huang, Jianping Cui, Han Tian, Lan Zheng, Yue Li, Xiangxin Wang, Lin Li, Guanglin Liang, Wenyuan Peng, Liang Fang, Peng Shenzhen Institute of Advanced Technology Cas Key Laboratory of Human-Machine Intelligence-Synergy Systems Shenzhen518055 China University of Chinese Academy of Sciences Shenzhen College of Advanced Technology Shenzhen518055 China National Research Center for Rehabilitation Technical Aids Beijing100176 China Chinese Academy of Sciences State Key Laboratory of Management and Control for Complex Systems Institute of Automation Beijing100190 China

ISBN: (纸本)9798350325706

Rehabilitation robots play an important role in the motor function rehabilitation for stroke survivors with hemiplegia. However, the rehabilitation effect of current robots is still limited partly because a single training of motor function can be strongly affected by the decreased blood supply function of the bedridden patients. This work proposed an approach to study the coupling relationship between the motor and blood supply functions by combining the synchronously recorded EEG and cerebral blood oxygen information, where the cerebrations in different movement paradigms were analyzed from an aspect of "functional coupling". The results show that the information of oxyhemoglobin concentration change (ΔHbO) can effectively indicate the cortex activation, and a stronger blood supply is needed in cortexes to perform body movements. The correlations within motor cortexes are significantly stronger than the ones between motor and prefrontal cortexes, and a higher resistance level of extremity training will cause stronger correlations. Calculation of transfer entropy (TE) shows that there exists a bidirectional information transmission between the electrophysiological and blood supply signals, and more information is transmitted always in the direction from ΔHbO to EEG than in the opposite direction. The information transmission or the coupling relationship can be significantly enhanced by extremity movements, and large TE values are achieved in the Theta, Beta and Gamma frequency bands of EEG that correspond to motor functions. This work has demonstrated the functional coupling between nerve and blood microcirculation, which would provide a technical guidance to improve the rehabilitation effect for current robot systems and have great application potentials. © 2023 IEEE.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Trajectory Tracking Multi-mode Predictive control Based on Softswitching for Unmanned Surface Vehicle

Trajectory Tracking Multi-mode Predictive Control Based on S...

引用

第43届中国控制会议

作者： Kunpeng Duan Shanling Dong Meiqin Liu Senlin Zhang College of Electrical Engineering Zhejiang University National Key Laboratory of Industrial Control Technology Zhejiang University National Key Laboratory of Human-Machine Hybrid Augmented Intelligence Xi'an Jiaotong University Jinhua Institute of Zhejiang University

ISBN: (数字)9789887581581

ISBN: (纸本)9798350366907

The unmanned surface vehicle(USV) plays a vital role in ocean exploration and utilization. Its primary tasks include navigating designated routes and safely avoiding obstacles in complex environments, ensuring efficient and secure arrival at destinations. This paper proposes a soft-switching-based multi-mode predictive control method. Specifically, A two-stage control model is defined to categorize the control modes, and a nonlinear model predictive controller(NMPC) embedding relevant obstacle avoidance constraints is developed. Then combined with NMPC framework, a sigmoid function is introduced to handle the multi-mode control problem. In addition, we apply the proposed algorithm successfully to the trajectory tracking control of USV. Simulation results show the strength and reliability of the proposed algorithm, which reduces the errors and improves the control accuracy effectively.

关键词： Unmanned surface vehicle model predictive control multi-mode control trajectory tracking

来源：评论

学校读者我要写书评

暂无评论

ALFLAT: Chinese NER Using ALBERT, Flat-Lattice Transformer, Word Segmentation and Entity Dictionary 2nd

ALFLAT: Chinese NER Using ALBERT, Flat-Lattice Transformer, ...

引用

2nd EAI International Conference on Applied Cryptography in Computer and Communications, AC3 2022

作者： Lv, Haifeng Ding, Yong School of Data Science and Software Engineering Wuzhou University Wuzhou China Guangxi Key Laboratory of Cryptography and Information Security School of Computer Science and Information Security Guilin University of Electronic Technology Guilin China Guangxi Key Laboratory of Machine Vision and Intelligent Control Wuzhou University Wuzhou China

ISBN: (纸本)9783031170805

Recently, the character-word lattice structure has been proved to be effective for Chinese named entity recognition (NER) by incorporating the word information. However, one hand, since the lattice structure is dynamic and complex, although some existing lattice-based models are effectively utilize the parallel computation of GPUs, they do not fully utilize word segmentation boundary tags that as features are helpful for NER task. On the other hand, the character-word vector needs to be trained, and the user-defined entity dictionary cannot be effectively used. In this paper, we propose ALFLAT: based on a flat-lattice Transformer to incorporate ALBERT pre-trained model, word segmentation information and user-defined entity dictionary for Chinese NER. ALFLAT converts the lattice structure into a flat structure consisting of spans, integrate word segmentation embedding with the output of flat-lattice Transformer model, then modifies the emission scores according to the user-defined entity dictionary, finally utilize Viterbi decoding of the CRF layer to obtain the correct entity results. Each span corresponds to a character or latent word and its position in the original lattice. With the power of ALBERT pre-trained model, Transformer and position encoding, ALFLAT can fully leverage the lattice, word segmentation and user-defined entity dictionary information. Experiments on MSRA dataset show ALFLAT outperforms other lexicon-based models in performance and efficiency. © 2022, ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering.

关键词： Computational linguistics

来源：评论

学校读者我要写书评

暂无评论

Weld defect detection of power battery pack based on image segmentation

引用

International Journal of Wireless and Mobile Computing 2022年第2期23卷 139-145页

作者： Tao, Bo He, Fuqiang Tang, Quan Guo, Zhinan Long, Hansen Li, Shidong Cao, Yongcheng Ruan, Guijian Key Laboratory of Metallurgical Equipment and Control Technology Ministry of Education Wuhan University of Science and Technology Wuhan430081 China Hubei Key Laboratory of Mechanical Transmission and Manufacturing Engineering Wuhan University of Science and Technology Wuhan430081 China Precision Manufacturing Institute Wuhan University of Science and Technology Wuhan430081 China OPT Machine Vision Tech Co. Ltd Dongguan523852 China Jingmen Wusan Mechanism Equipment Manufacture Co. Ltd Jingmen431821 China Jiangsu Ruihong Photoelectric Technology Co. Ltd. Suqian321300 China

The safety and production efficiency are an important part of the power batteries production process and need to be considered seriously. Aiming at the welding quality of a power battery, a three-dimensional detection method based on the line laser sensor was proposed. Firstly, the depth data of the weld surface of the battery top cover is obtained by using a line laser sensor, and the defect area is segmented by using a multi thresholds segmentation method based on contour lines. Through the connected domain algorithm, the centres of defective areas are located. And the defect type is determined according to distance between the centres of the defect areas. Experimental results show that the detection rate reaches 97%, which indicates that the scheme has high detection accuracy and strong stability, and verifies the effectiveness of the method. Copyright © 2022 Inderscience Enterprises Ltd.

关键词： Welds

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：