检索结果-内蒙古大学图书馆

Multi-Modal Domain Adaptation Variational Autoencoder for EEG-Based Emotion recognition

IEEE/CAA Journal of Automatica Sinica 2022年第9期9卷 1612-1626页

作者： Yixin Wang Shuang Qiu Dan Li Changde Du Bao-Liang Lu Huiguang He the Research Center for Brain-inspired Intelligence National Laboratory of Pattern RecognitionInstitute of AutomationChinese Academy of ScienceBeijing 100190 the University of Chinese Academy of Sciences Beijing 100049 the Beijing Institute of Control and Electronic Technology Beijing 100038China the School of Mathematics and Information Sciences Yantai UniversityYantai 264003China the Center for Excellence in Brain Science and Intelligence Technology Chinese Academy of ScienceBeijingChina the Department of Computer Science and Engineering Shanghai Jiao Tong UniversityShanghai 200240China

Traditional electroencephalograph(EEG)-based emotion recognition requires a large number of calibration samples to build a model for a specific subject,which restricts the application of the affective brain computer interface(BCI)in *** attempt to use the multi-modal data from the past session to realize emotion recognition in the case of a small amount of calibration *** solve this problem,we propose a multimodal domain adaptive variational autoencoder(MMDA-VAE)method,which learns shared cross-domain latent representations of the multi-modal *** method builds a multi-modal variational autoencoder(MVAE)to project the data of multiple modalities into a common *** adversarial learning and cycle-consistency regularization,our method can reduce the distribution difference of each domain on the shared latent representation layer and realize the transfer of *** experiments are conducted on two public datasets,SEED and SEED-IV,and the results show the superiority of our proposed *** work can effectively improve the performance of emotion recognition with a small amount of labelled multi-modal data.

关键词： Cycle-consistency domain adaptation electroencephalograph(EEG) multi modality variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

SliceProp: A Slice-Wise Bidirectional Propagation Model for Interactive 3D Medical Image Segmentation 1

SliceProp: A Slice-Wise Bidirectional Propagation Model for ...

引用

1st IEEE International Conference on Medical Artificial Intelligence, MedAI 2023

作者： Xu, Xin Lu, Wenjing Lei, Jiahao Qiu, Peng Shen, Hong-Bin Yang, Yang Shanghai Jiao Tong University Key Lab. of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering Department of Computer Science and Engineering Shanghai200240 China Shanghai Ninth People's Hospital Shanghai Jiao Tong University School of Medicine Department of Vascular Surgery China Shanghai Jiao Tong University Institute of Image Processing and Pattern Recognition Key Laboratory of System Control and Information Processing Ministry of Education of China Shanghai200240 China

ISBN: (纸本)9798350358780

Interactive medical image segmentation methods have become increasingly popular in recent years. These methods combine manual labeling and automatic segmentation, reducing the workload of annotation while maintaining high accuracy. However, most current interactive segmentation frameworks are limited to 2D image data, and are not suitable for 3D image data due to the large size and high complexity of 3D data, as well as the challenges posed by information asymmetry and sparse annotation. In this paper, we propose SliceProp, an interactive segmentation framework that implements slice-wise Label Bidirectional Propagation (LBP) for 3D medical image segmentation. SliceProp extends the interactive 2D image segmentation algorithm to 3D image segmentation, and can handle 3D data with large size and high complexity. Moreover, equipped with a Backtracking Feedback Check (BFC) module, SliceProp effectively addresses the issues of information asymmetry and spatial sparse annotation in 3D medical image segmentation. Additionally, we adopt an uncertainty-based criterion to pri-oritize the slices to be refined interactively, which enhances the efficiency of the interaction process by enabling the model to focus on the regions with the most unreliable predictions. SliceProp is evaluated on two datasets and achieves promising results compared to state-of-the-art methods. © 2023 IEEE.

关键词： Medical imaging

来源：评论

学校读者我要写书评

暂无评论

Deep Learning in Palmprint recognition-A Comprehensive Survey

arXiv

引用

arXiv 2025年

作者： Gao, Chengrui Yang, Ziyuan Jia, Wei Leng, Lu Zhang, Bob Teoh, Andrew Beng Jin College of Computer Science Sichuan University Chengdu610065 China Singapore School of Computer and Information Hefei University of Technology Hefei China Jiangxi Provincial Key Laboratory of Image Processing and Pattern Recognition Nanchang Hangkong University Nanchang China Pattern Analysis and Machine Intelligence Group Department of Computer and Information Science University of Macau Taipa China School of Electrical and Electronic Engineering College of Engineering Yonsei University Seoul Korea Republic of

Palmprint recognition has emerged as a prominent biometric technology, widely applied in diverse scenarios. Traditional handcrafted methods for palmprint recognition often fall short in representation capability, as they heavily depend on researchers' prior knowledge. Deep learning (DL) has been introduced to address this limitation, leveraging its remarkable successes across various domains. While existing surveys focus narrowly on specific tasks within palmprint recognition-often grounded in traditional methodologies-there remains a significant gap in comprehensive research exploring DL-based approaches across all facets of palmprint recognition. This paper bridges that gap by thoroughly reviewing recent advancements in DL-powered palmprint recognition. The paper systematically examines progress across key tasks, including region-of-interest segmentation, feature extraction, and security/privacy-oriented challenges. Beyond highlighting these advancements, the paper identifies current challenges and uncovers promising opportunities for future research. By consolidating state-of-the-art progress, this review serves as a valuable resource for researchers, enabling them to stay abreast of cutting-edge technologies and drive innovation in palmprint recognition. Copyright © 2025, The Authors. All rights reserved.

关键词： Biometrics

来源：评论

学校读者我要写书评

暂无评论

Split-net: Dual transformer encoder with splitting scene text image for script identification

引用

pattern recognition Letters 2025年 196卷 100-108页

作者： Ayush Roy Shivakumara Palaiahnakote Umapada Pal Cheng-Lin Liu Department of Computer Science and Engineering State University of New York Buffalo United States School of Science Engineering and Environment University of Salford Manchester United Kingdom Computer Vision and Pattern Recognition Indian Statistical Institute Kolkata India State Key Laboratory of Multimodal Artificial Intelligence Systems Institute of Automation of the Chinese Academy of Sciences Beijing China School of Artificial Intelligence University of Chinese Academy of Sciences Beijing China

Script identification is vital for understanding scenes and video images. It is challenging due to high variations in physical appearance, typeface design, complex background, distortion, and significant overlap in the characteristics of different scripts. Unlike existing models, which aim to tackle the script images utilizing the scene text image as a whole, we propose to split the image into upper and lower halves to capture the intricate differences in stroke and style of various scripts. Motivated by the accomplishments of the transformer, a modified script-style-aware Mobile-Vision Transformer (M-ViT) is explored for encoding visual features of the images. To enrich the features of the transformer blocks, a novel Edge Enhanced Style Aware Channel Attention Module (EESA-CAM) has been integrated with M-ViT. Furthermore, the model fuses the features of the dual encoders (extracting features from the upper and the lower half of the images) by a dynamic weighted average procedure utilizing the gradient information of the encoders as the weights. In experiments on three standard datasets, MLe2e, CVSI2015, and SIW-13, the proposed model yielded superior performance compared to state-of-the-art models.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Self-attention transfer networks for speech emotion recognition

引用

Virtual Reality & Intelligent Hardware 2021年第1期3卷 43-54页

作者： Ziping ZHAO Keru Wang Zhongtian BAO Zixing ZHANG Nicholas CUMMINS Shihuang SUN Haishuai WANG Jianhua TAO Björn WSCHULLER College of Computer and Information Engineering Tianjin Normal UniversityTianjin 300387China GLAM-Group on Language Audio&MusicImperial College LondonSW72AZUK Chair of Embedded Intelligence for Health Care and Wellbeing University of Augsburg86159Germany Department of Biostatistics and Health Informatics IoPPNKing's College LondonLondonSE58AFUK Department of Computer Science and Engineering Fairfield University 06824USA National Laboratory of Pattern Recognition CASIABeijing 100190China

Background A crucial element of human-machine interaction,the automatic detection of emotional states from human speech has long been regarded as a challenging task for machine learning *** vital challenge in speech emotion recognition(SER)is learning robust and discriminative representations from *** machine learning methods have been widely applied in SER research,the inadequate amount of available annotated data has become a bottleneck impeding the extended application of such techniques(e.g.,deep neural networks).To address this issue,we present a deep learning method that combines knowledge transfer and self-attention for SER ***,we apply the log-Mel spectrogram with deltas and delta-deltas as ***,given that emotions are time dependent,we apply temporal convolutional neural networks to model the variations in *** further introduce an attention transfer mechanism,which is based on a self-attention algorithm to learn long-term *** self-attention transfer network(SATN)in our proposed approach takes advantage of attention transfer to learn attention from speech recognition,followed by transferring this knowledge into *** evaluation built on Interactive Emotional Dyadic Motion Capture(IEMOCAP)dataset demonstrates the effectiveness of the proposed model.

关键词： Speech emotion recognition Attention transfer Self-attention Temporal convolutional neural networks(TCNs)

来源：评论

学校读者我要写书评

暂无评论

RBP-Former: Joint Prediction of RNA-protein Binding Sites on Full-length RNA Transcripts for Multiple RBPs

RBP-Former: Joint Prediction of RNA-protein Binding Sites on...

引用

IEEE International Conference on Bioinformatics and Biomedicine (BIBM)

作者： Yichong Li Xiaojian Liu Fan Cheng Xiaoyong Pan Yang Yang Department of Computer Science and Engineering Shanghai Jiao Tong University Shanghai China Institute of Image Processing and Pattern Recognition Shanghai Jiao Tong University Shanghai China Key Laboratory of System Control and Information Processing Ministry of Education of China Shanghai China Key Laboratory of Shanghai Education Commission for Intelligent Interaction and Cognitive Engineering Shanghai China

ISBN: (数字)9798350386226

ISBN: (纸本)9798350386233

RNA-binding proteins (RBPs) are essential for gene expression, and the complex RNA-protein interaction mechanisms require analysis of global RNA information. Therefore, accurate prediction of RBP binding sites on full-length RNA transcripts is crucial for understanding these mechanisms and their roles in diseases. While machine learning methods can predict RBP binding to RNA fragments, extending this to full-length transcripts presents challenges due to sequence length and data imbalance. In this paper, we introduce RBP-Former, a binding site joint prediction model designed specifically for full-length RNA transcripts that can be used for multiple RBPs. This model processes information at both coarse and fine-grained levels to fully exploit sequence data and its interactions with multiple RBPs. We develop multi-level imbalance learning strategies, achieving favorable results on imbalanced data. Our method outperforms existing methods in predicting binding sites on full-length RNA transcripts for multiple RBPs, demonstrating its effectiveness in handling imbalanced label and sample distributions.

关键词： Proteins Protein engineering Accuracy RNA Machine learning Predictive models Data models Gene expression Bioinformatics Diseases

来源：评论

学校读者我要写书评

暂无评论

Stain-Adaptive Self-Supervised Learning for Histopathology Image Analysis

arXiv

引用

arXiv 2022年

作者： Ye, Hai-Li Wang, Da-Han Department of Computer and Information Engineering Xiamen University of Technology Xiamen361000 China Fujian Provincial Key Laboratory of Pattern Recognition and Image Understanding Xiamen361000 China

It is commonly recognized that color variations caused by differences in stains is a critical issue for histopathology image analysis. Existing methods adopt color matching, stain separation, stain transfer or the combination of them to alleviate the stain variation problem. In this paper, we propose a novel Stain-Adaptive Self-Supervised Learning(SASSL) method for histopathology image analysis. Our SASSL integrates a domain-adversarial training module into the SSL framework to learn distinctive features that are robust to both various transformations and stain variations. The proposed SASSL is regarded as a general method for domain-invariant feature extraction which can be flexibly combined with arbitrary downstream histopathology image analysis modules (e.g. nuclei/tissue segmentation) by fine-tuning the features for specific downstream tasks. We conducted experiments on publicly available pathological image analysis datasets including the PANDA, BreastPathQ, and CAMELYON16 datasets, achieving the state-of-the-art performance. Experimental results demonstrate that the proposed method can robustly improve the feature extraction ability of the model, and achieve stable performance improvement in downstream tasks. Copyright © 2022, The Authors. All rights reserved.

关键词： Image analysis

来源：评论

学校读者我要写书评

暂无评论

Reprogramming pretrained target-specific diffusion models for dual-target drug design 24

Reprogramming pretrained target-specific diffusion models fo...

引用

Proceedings of the 38th International Conference on Neural Information Processing Systems

作者： Xiangxin Zhou Jiaqi Guan Yijia Zhang Xingang Peng Liang Wang Jianzhu Ma School of Artificial Intelligence University of Chinese Academy of Sciences and New Laboratory of Pattern Recognition (NLPR) State Key Laboratory of Multimodal Artificial Intelligence Systems (MAIS) Institute of Automation Chinese Academy of Sciences (CASIA) Department of Computer Science University of Illinois Urbana-Champaign Department of Electronic Engineering Tsinghua University Institute for Artificial Intelligence Peking University Department of Electronic Engineering Tsinghua University and Institute for AI Industry Research Tsinghua University

ISBN: (纸本)9798331314385

Dual-target therapeutic strategies have become a compelling approach and attracted significant attention due to various benefits, such as their potential in overcoming drug resistance in cancer therapy. Considering the tremendous success that deep generative models have achieved in structure-based drug design in recent years, we formulate dual-target drug design as a generative task and curate a novel dataset of potential target pairs based on synergistic drug combinations. We propose to design dual-target drugs with diffusion models that are trained on single-target protein-ligand complex pairs. Specifically, we align two pockets in 3D space with protein-ligand binding priors and build two complex graphs with shared ligand nodes for SE(3)-equivariant composed message passing, based on which we derive a composed drift in both 3D and categorical probability space in the generative process. Our algorithm can well transfer the knowledge gained in single-target pretraining to dual-target scenarios in a zero-shot manner. We also repurpose linker design methods as strong baselines for this task. Extensive experiments demonstrate the effectiveness of our method compared with various baselines.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Contrastive learning based method for X-ray and CT registration under surgical equipment occlusion

引用

computers in Biology and Medicine 2024年 180卷 108946页

作者： Wang, Xiyuan Zhang, Zhancheng Xu, Shaokang Luo, Xiaoqing Zhang, Baocheng Wu, Xiao-Jun School of Electronics and Information Engineering at University of Science and Technology Suzhou SuZhou215009 China Jiangsu Provincial Engineering Laboratory of Pattern Recognition and Computational Intelligence School of Artificial Intelligence and Computer Science at Jiangnan University WuXi214122 China Department of Orthopaedics General Hospital of Central Theater Command of PLA WuHan430012 China Shanghai Jirui Maestro Surgical Technology Co ShangHai200000 China

Deep learning-based 3D/2D surgical navigation registration techniques achieved excellent results. However, these methods are limited by the occlusion of surgical equipment resulting in poor accuracy. We designed a contrastive learning method that treats occluded and unoccluded X-rays as positive samples, maximizing the similarity between the positive samples and reducing interference from occlusion. The designed registration model has Transformer's residual connection (ResTrans), which enhances the long-sequence mapping capability, combined with the contrast learning strategy, ResTrans can adaptively retrieve the valid features in the global range to ensure the performance in the case of occlusion. Further, a learning-based region of interest (RoI) fine-tuning method is designed to refine the misalignment. We conducted experiments on occluded X-rays that contained different surgical devices. The experiment results show that the mean target registration error (mTRE) of ResTrans is 3.25 mm and the running time is 1.59 s. Compared with the state-of-the-art (SOTA) 3D/2D registration methods, our method offers better performance on occluded 3D/2D registration tasks. © 2024 Elsevier Ltd

关键词： Surgical equipment

来源：评论

学校读者我要写书评

暂无评论

Multi-Unit Floor Plan recognition and Reconstruction Using Improved Semantic Segmentation of Raster-Wise Floor Plans

arXiv

引用

arXiv 2024年

作者： Kratochvila, Lukas de Jong, Gijs Arkesteijn, Monique Bilík, Šimon Zemčík, Tomáš Horak, Karel Rellermeyer, Jan S. Department of Control and Instrumentation Faculty of Electrical Engineering and Communication Brno University of Technology Brno Czech Republic Department of Software Technology Faculty of Electrical Engineering Mathematics and Computer Science TU Delft Delft Netherlands Department of Management in the Built Environment Faculty of Architecture and the Built Environment TU Delft Delft Netherlands Computer Vision and Pattern Recognition Laboratory Department of Computational Engineering Lappeenranta-Lahti University of Technology LUT Lappeenranta Finland Dependable and Scalable Software Systems Institute of Systems Engineering Faculty of Electrical Engineering and Computer Science Leibniz University Hannover Hannover Germany

Digital twins have a major potential to form a significant part of urban management in emergency planning, as they allow more efficient designing of the escape routes, better orientation in exceptional situations, and faster rescue intervention. Nevertheless, creating the twins still remains a largely manual effort, due to a lack of 3D-representations, which are available only in limited amounts for some new buildings. Thus, in this paper we aim to synthesize 3D information from commonly available 2D architectural floor plans. We propose two novel pixel-wise segmentation methods based on the MDA-Unet and MACU-Net architectures with improved skip connections, an attention mechanism, and a training objective together with a reconstruction part of the pipeline, which vectorizes the segmented plans to create a 3D model. The proposed methods are compared with two other state-of-the-art techniques and several benchmark datasets. On the commonly used CubiCasa benchmark dataset, our methods have achieved the mean F1 score of 0.86 over five examined classes, outperforming the other pixel-wise approaches tested. We have also made our code publicly available to support research in the field. © 2024, CC BY.

关键词： Semantic Segmentation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：