检索结果-内蒙古大学图书馆

HIFUNet: Multi-Class Segmentation of Uterine Regions From MR Images Using Global Convolutional Networks for HIFU Surgery Planning

引用

IEEE TRANSACTIONS ON MEDICAL IMAGING 2020年第11期39卷 3309-3320页

作者： Zhang, Chen Shu, Huazhong Yang, Guanyu Li, Faqi Wen, Yingang Zhang, Qin Dillenseger, Jean-Louis Coatrieux, Jean-Louis Southeast Univ Lab Image Sci & Technol Nanjing 210096 Peoples R China Ctr Rech Informat Biomed Sino Francais F-35000 Rennes France Southeast Univ Minist Educ Key Lab Comp Network & Informat Integrat Nanjing 210096 Peoples R China Chongqing Med Univ Coll Biomed Engn State Key Lab Ultrasound Engn Med Chongqing 400016 Peoples R China Natl Engn Res Ctr Ultrasound Med Chongqing 401121 Peoples R China Chongqing Haifu Med Technol Co Ltd Chongqing 401121 Peoples R China Natl Inst Hlth & Med Res F-35000 Rennes France Univ Rennes 1 Lab Traitement Signal & Image F-35000 Rennes France

Accurate segmentation of uterus, uterine fibroids, and spine from MR images is crucial for high intensity focused ultrasound (HIFU) therapy but remains still difficult to achieve because of 1) the large shape and size variations among individuals, 2) the low contrast between adjacent organs and tissues, and 3) the unknown number of uterine fibroids. To tackle this problem, in this paper, we propose a large kernel encoder-decoder Network based on a 2D segmentation model. The use of this large kernel can capturemulti-scale contexts by enlarging the valid receptive field. In addition, a deep multiple atrous convolution block is also employed to enlarge the receptive field and extract denser feature maps. Our approach is compared to both conventional and other deep learning methods and the experimental results conducted on a large dataset show its effectiveness.

关键词： encoder-decoder global convolutional networks HIFU MR images segmentation uterine fibroids

来源：评论

学校读者我要写书评

暂无评论

PGCNet: patch graph convolutional network for point cloud segmentation of indoor scenes

引用

VISUAL COMPUTER 2020年第10-12期36卷 2407-2418页

作者： Sun, Yuliang Miao, Yongwei Chen, Jiazhou Pajarola, Renato Zhejiang Univ Technol Coll Comp Sci & Technol Hangzhou Peoples R China Zhejiang Sci Tech Univ Coll Informat Sci & Technol Hangzhou Peoples R China Univ Zurich Dept Informat CH-8050 Zurich Switzerland

Semantic segmentation of 3D point clouds is a crucial task in scene understanding and is also fundamental to indoor scene applications such as indoor navigation, mobile robotics, augmented reality. Recently, deep learning frameworks have been successfully adopted to point clouds but are limited by the size of data. While most existing works focus on individual sampling points, we use surface patches as a more efficient representation and propose a novel indoor scene segmentation framework called patch graph convolution network (PGCNet). This framework treats patches as input graph nodes and subsequently aggregates neighboring node features by dynamic graph U-Net (DGU) module, which consists of dynamic edge convolution operation inside U-shaped encoder-decoder architecture. The DGU module dynamically update graph structures at each level to encode hierarchical edge features. Incorporating PGCNet, we can segment the input scene into two types, i.e., room layout and indoor objects, which is afterward utilized to carry out final rich semantic labeling of various indoor scenes. With considerable speedup training, the proposed framework achieves effective performance equivalent to state-of-the-art for segmenting standard indoor scene dataset.

关键词： Point cloud Scene segmentation Surface patch Graph convolutional network Edge convolution encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

Pixel Voting decoder: A novel decoder that regresses pixel relationships for segmentation

引用

EXPERT SYSTEMS WITH APPLICATIONS 2022年第0期193卷 116438-116438页

作者： Xian, Pengfei Po, Lai-Man Xiong, Jingjing Zhou, Chang Zhao, Yuzhi Yu, Wing-Yin Ou, Weifeng Zhang, Yujia Zhang, Xiaori City Univ Hong Kong Hong Kong Peoples R China Fudan Univ Shanghai Peoples R China

With the rapid development of the convolutional neural network, both instance segmentation and semantic segmentation have achieved remarkable performances. Recently, many efforts have been made to use a unified encoder-decoder architecture to solve these two segmentation tasks simultaneously. The encoder extracts high-level features from the input images for both tasks. However, existing decoders cannot meet the performance requirements of these two tasks: the semantic segmentation decoder is not flexible enough for instance segmentation, and the instance segmentation decoder lacks the precision of semantic segmentation. Therefore, we introduce a novel Pixel Voting decoder to satisfy both precision and flexibility. The proposed decoder regresses the interlayer pixel relationships between the input and output feature maps across the convolutional layers. Then, the pixel relationships are regarded as the pixel votes for dynamically decoding the higher level information from the encoder. Finally, we propose the dynamic deconvolution to make full use of the votes for each pixel during the decoding process. Meanwhile, the matrix computation for the dynamic deconvolution is designed to boost the calculation. Experiments show that the proposed method can achieve better performance than the well-known methods on both instance segmentation on the COCO dataset and semantic segmentation on the Cityscapes dataset. The matrix implementation of the dynamic deconvolution also shows its high efficiency and feasibility.

关键词： Convolutional neural network Dynamic deconvolution encoder-decoder Image segmentation Pixel voting Residual block

来源：评论

学校读者我要写书评

暂无评论

基于深度学习的多模态融合情感分类模型

基于深度学习的多模态融合情感分类模型

引用

作者：周洁大连理工大学

学位级别：硕士

多模态情感分析是指对包含多种模态的信息载体中的情感进行分析的过程。随着互联网的发展,信息越来越趋向于多模态化,相比较于单一模态只能描述事物的某一个方面,这种包含多种模态的数据呈现的内容更丰富,更多元化。本文通过对视频数据... 详细信息

多模态情感分析是指对包含多种模态的信息载体中的情感进行分析的过程。随着互联网的发展,信息越来越趋向于多模态化,相比较于单一模态只能描述事物的某一个方面,这种包含多种模态的数据呈现的内容更丰富,更多元化。本文通过对视频数据进行情感分析,考虑文本、音频和视觉三种模态信息,能更加准确地理解人类的情感表达,减少由单一模态信息造成的歧义或者误解,具有深远的应用价值。本文针对多模态融合方法进行了研究调查,并结合了最新的自然语言处理相关技术,提出了一种基于encoder-decoder的多模态融合模型,该模型通过不同的编码函数分别捕捉模态内交互特征和模态间交互特征,最后再通过Transformer解码层输出模态融合特征进行预测。根据在三个公开数据集上的对比实验分析,本文提出的多模态融合模型能够有效地改进多模态情感分析任务的性能,并通过消融实验以及模态分布可视化,直观地验证了该模型方法的有效性和可行性。在基于文本情感信息为主导的多模态情感分析方法的研究上,本文提出了基于BERT的跨模态融合情感分析模型。该模型主要考虑以文本模态为主的跨模态融合特征,引入跨模态注意力机制,编码文本和音频、文本和视觉的跨模态动态交互信息,对预先训练的BERT模型进行微调,以此提高情感预测的性能。在多模态情感分析数据集上进行对比实验,实验结果表明,本文提出的模型具有较好的性能表现,并对各个模态组件进行消融实验,再将注意力权重进行可视化,从不同维度上展示模型结构的作用。

关键词：多模态情感分析深度学习 BERT 跨模态融合 encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

D-SELD: Dataset-Scalable Exemplar LCA-decoder

引用

NEUROMORPHIC COMPUTING AND ENGINEERING 2024年第4期4卷 044009-044009页

作者： Takaghaj, Sanaz Mahmoodi Sampson, Jack Penn State Univ Dept Comp Sci & Engn University Pk PA 16802 USA Penn State Univ Sch Engn Design & Innovat University Pk PA 16802 USA

Neuromorphic computing has recently gained significant attention as a promising approach for developing energy-efficient, massively parallel computing systems inspired by the spiking behavior of the human brain and natively mapping spiking neural networks (SNNs). Effective training algorithms for SNNs are imperative for increased adoption of neuromorphic platforms;however, SNN training continues to lag behind advances in other classes of ANN. In this paper, we reduce this gap by proposing an innovative encoder-decoder technique that leverages sparse coding and the locally competitive algorithm (LCA) to provide an algorithm specifically designed for neuromorphic platforms. Using our proposed Dataset-Scalable Exemplar LCA-decoder we reduce the computational demands and memory requirements associated with training SNNs using error backpropagation methods on increasingly larger training sets. We offer a solution that can be scalably applied to datasets of any size. Our results show the highest reported top-1 test accuracy using SNNs on the ImageNet and CIFAR100 datasets, surpassing previous benchmarks. Specifically, we achieved a record top-1 accuracy of 80.75% on ImageNet (ILSVRC2012 validation set) and 79.32% on CIFAR100 using SNNs.

关键词： encoder-decoder LCA exemplar learning spiking neural networks sparse coding neuromorphic computing

来源：评论

学校读者我要写书评

暂无评论

Reconstructing QRS Complex From PPG by Transformed Attentional Neural Networks

引用

IEEE SENSORS JOURNAL 2020年第20期20卷 12374-12383页

作者： Chiu, Hong-Yu Shuai, Hong-Han Chao, Paul C. -P. Chiao Tung Univ NCTU BASIC Lab Dept Elect & Comp Engn Hsinchu 30010 Taiwan Natl Chiao Tung Univ NCTU Dept Elect & Comp Engn Sensors IC Lab Hsinchu 30010 Taiwan

Technology that translates photoplethysmogram (PPG) into the QRS complex of electrocardiogram (ECG) would be transformative for people who require continuously monitoring. However, directly decoding the QRS complex of ECG from PPG is challenging because PPG signals usually have different offsets due to 1) different devices, and 2) personal differences, which makes the alignment difficult. In this paper, we make the first attempt to reconstruct the QRS complex of ECG only from the recording of PPG by an end-to-end deep learning-based approach. Specifically, we propose a novel encoder-decoder architecture containing three components: 1) a sequence transformer network which automatically calibrates the offset, 2) an attention network, which dynamically identifies regions of interest, and 3) a new QRS complex-enhanced loss for better reconstruction. The experiment results on a real dataset demonstrate the effectiveness of the proposed method: 3.67% R peak failure rate of the reconstructed ECG and high correlation of pulse transit time between the reconstructed QRS complex and the groundtruth QRS complex (rho = 0.844), which creates a new opportunity for low-cost clinical studies via the waveform-level reconstruction of the QRS complex of ECG from PPG.

关键词： Electrocardiography Monitoring Electrodes Biomedical monitoring Skin Standards Sensors Convolutional neural network electrocardiography encoder-decoder photoplethysmography transform network

来源：评论

学校读者我要写书评

暂无评论

Tracking Performance Limitations of MIMO Networked Control Systems With Multiple Communication Constraints

引用

IEEE TRANSACTIONS ON CYBERNETICS 2020年第7期50卷 2982-2995页

作者： Chen, Chao-Yang Gui, Weihua Wu, Lianghong Liu, Zhaohua Yan, Huaicheng Hunan Univ Sci & Technol Sch Informat & Elect Engn Xiangtan 411201 Peoples R China Boston Univ Ctr Polymer Studies Boston MA 02215 USA Boston Univ Dept Phys Boston MA 02215 USA Cent South Univ Sch Informat Sci & Engn Changsha 410012 Peoples R China East China Univ Sci & Technol Key Lab Adv Control & Optimizat Chem Proc Minist Educ Shanghai 200237 Peoples R China Hubei Normal Univ Coll Mechatron & Control Engn Huangshi 435002 Hubei Peoples R China

In this paper, the tracking performance limitation of networked control systems (NCSs) is studied. The NCSs are considered as continuous-time linear multi-input multioutput (MIMO) systems with random reference noises. The controlled plants include unstable poles and nonminimum phase (NMP) zeros. The output feedback path is affected by multiple communication constraints. We focus on some basic communication constraints, including additive white noise (AWN), quantization noise, bandwidth, as well as encoder-decoder. The system performance is evaluated with the tracking error energy, and used a two-degree-of-freedom (2DOF) controller. The explicit representation of the tracking performance is given in this paper. The results indicate the tracking performance limitations rely to internal characteristics of the plant (unstable poles and NMP zeros), reference noises [the reference noise power distribution (RNPD) and its directions], and the characteristics of communication constraints. The characteristics of communication constraints include communication noise power distribution (CNPD);quantization noise power distribution (QNPD), and their distribution directions;transform bandwidth allocation (TBA);transform encoder-decoder allocation (TEA), and their allocation directions;and NMP zeros and MP part of bandwidth. Moreover, the tracking performance limitations are also affected by the angles between the each transform NMP zero direction and RNPD direction, and these angles between each transform unstable poles direction and the direction of communication constraint distribution/allocation. In addition, for MIMO NCSs, bandwidth (there are not identical two channels) can always affect the direction of unstable poles, and the channel allocation of bandwidth and encode-decode may be used for a feasible method for the performance allocation of each channel. Finally, an instance is given for verifying the effectiveness of the theoretical outcomes.

关键词： MIMO communication Control systems Bandwidth Poles and zeros Power distribution Resource management Linear systems Bandwidth communication noise encoder-decoder performance limitation power distribution quantization noise reference noise

来源：评论

学校读者我要写书评

暂无评论

Multi-energy load forecasting for regional integrated energy systems considering temporal dynamic and coupling characteristics

引用

ENERGY 2020年第0期195卷 116964-000页

作者： Wang, Shaomin Wang, Shouxiang Chen, Haiwen Gu, Qiang Tianjin Univ Key Lab Smart Grid Minist Educ Tianjin 300072 Peoples R China Tianjin Xianghe Elect Co Ltd Tianjin 300072 Peoples R China State Grid Tianjin Elect Power Res Inst Tianjin 300022 Peoples R China

Accurate multi-energy load forecasting (MELF) is the key to realize the balance between supply and demand in regional integrated energy systems (RIES). To this end, a hybrid MELF method for RIES considering temporal dynamic and coupling characteristics (MELF_TDCC) is proposed. The novelty of MELF_TDCC lies in the following three aspects: 1) considering the high-dimensional temporal dynamic characteristic, an encoder-decoder model based on long-short term memory network (LSTMED) is proposed, which can extract the high dimensional potential feature, and reflect the temporal dynamic characteristics of historical load sequence effectively;2) considering the cross-coupling characteristic, a coupling feature matrix of multi-energy load is constructed, which reflects the cross-influence of electricity, cooling and heating loads;3) with the feature fusion layer of the hybrid model being built by gradient boosting decision tree (GBDT), the extended feature matrix for each class of load is constructed considering the intra-class inherent characteristics and inter-class coupling characteristic of loads, and the GBDT model is trained on the extended feature matrix, which provides multi-dimensional perspective for researching load essential characteristics. MELF_TDCC is verified on the ultra-short-term and short-term MELF scenarios based on an actual dataset. The simulation result shows that the proposed MELF_TDCC outperforms the current advanced methods. (C) 2020 Elsevier Ltd. All rights reserved.

关键词： encoder-decoder Long short-term memory Multi-energy load forecasting Regional integrated energy systems Rolling forecasting

来源：评论

学校读者我要写书评

暂无评论

Spatial interpolation using conditional generative adversarial neural networks

引用

INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE 2020年第4期34卷 735-758页

作者： Zhu, Di Cheng, Ximeng Zhang, Fan Yao, Xin Gao, Yong Liu, Yu Peking Univ Sch Earth & Space Sci Inst Remote Sensing & Geog Informat Syst Beijing Peoples R China Peking Univ Beijing Key Lab Spatial Informat Integrat & Its A Beijing Peoples R China UCL SpaceTimeLab Dept Civil Environm & Geomat Engn London England MIT Senseable City Lab 77 Massachusetts Ave Cambridge MA 02139 USA

Spatial interpolation is a traditional geostatistical operation that aims at predicting the attribute values of unobserved locations given a sample of data defined on point supports. However, the continuity and heterogeneity underlying spatial data are too complex to be approximated by classic statistical models. Deep learning models, especially the idea of conditional generative adversarial networks (CGANs), provide us with a perspective for formalizing spatial interpolation as a conditional generative task. In this article, we design a novel deep learning architecture named conditional encoder-decoder generative adversarial neural networks (CEDGANs) for spatial interpolation, therein combining the encoder-decoder structure with adversarial learning to capture deep representations of sampled spatial data and their interactions with local structural patterns. A case study on elevations in China demonstrates the ability of our model to achieve outstanding interpolation results compared to benchmark methods. Further experiments uncover the learned spatial knowledge in the model's hidden layers and test the potential to generalize our adversarial interpolation idea across domains. This work is an endeavor to investigate deep spatial knowledge using artificial intelligence. The proposed model can benefit practical scenarios and enlighten future research in various geographical applications related to spatial prediction.

关键词： Spatial interpolation generative adversarial networks deep learning encoder-decoder spatial prediction

来源：评论

学校读者我要写书评

暂无评论

Very-Short-Term Probabilistic Forecasting for a Risk-Aware Participation in the Single Price Imbalance Settlement

引用

IEEE TRANSACTIONS ON POWER SYSTEMS 2020年第2期35卷 1218-1230页

作者： Bottieau, Jeremie Hubert, Louis De Greve, Zacharie Vallee, Francois Toubeau, Jean-Francois Univ Mons Elect Power Engn Unit B-7000 Mons Belgium

The single imbalance pricing is an emerging mechanism in European electricity markets where all positive and negative imbalances are settled at a unique price. This real-time scheme thereby stimulates market participants to deviate from their schedule to restore the power system balance. However, exploiting this market opportunity is very risky due to the extreme volatility of the real-time power system conditions. In order to address this issue, we implement a new tailored deep-learning model, named encoder-decoder, to generate improved probabilistic forecasts of the imbalance signal, by efficiently capturing its complex spatio-temporal dynamics. The predicted distributions are then used to quantify and optimize the risk associated with the real-time participation of market players, acting as price-makers, in the imbalance settlement. This leads to an integrated forecast-driven strategy, modeled as a robust bi-level optimization. Results show that our probabilistic forecaster achieves better performance than other state of the art tools, and that the subsequent risk-aware robust dispatch tool allows finding a tradeoff between conservative and risk-seeking policies, leading to improved economic benefits. Moreover, we show that the model is computationally efficient and can thus be incorporated in the very-short-term dispatch of market players with flexible resources.

关键词： Deep learning electricity markets encoder-decoder robust optimization single imbalance pricing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：