检索结果-内蒙古大学图书馆

IEEE International Geoscience and Remote Sensing Symposium (IGARSS)

作者： Zhu, Dehui Du, Bo Zhang, Liangpei Wuhan Univ State Key Lab Informat Engn Surveying Mapping & R Wuhan 430079 Peoples R China Wuhan Univ Sch Comp Wuhan 430072 Peoples R China

ISBN: (纸本)9781665403696

In this paper, an encoder-decoder long short-term memory network-based anomaly detector (denoted as EDLAD) is proposed for hyperspectral images. The proposed EDLAD aims to simultaneously alleviate anomaly contamination and build a stable background component for anomaly detection. To reduce anomaly contamination, the EDLAD first utilizes a well-designed encoder-decoder LSTM to reconstruct the hyperspectral image. Based on the concept that the anomaly pixels occupy an extremely small fraction of the image, the well-designed encoder-decoder LSTM network tends to maintain the background and alleviate anomaly during the reconstruction process since the whole image is employed for training the network. Then the dimension reduction is used to further alleviate the anomaly contamination and build a stable background component. Finally, the EDLAD applies the Mahalanobis distance differences to detect the probable anomalies. The experiments on two benchmark hyperspectral images demonstrate the superiority of the EDLAD in anomaly detection.

关键词： Hyperspectral imagery (HSI) anomaly detection deep learning encoder-decoder long short-term memory (LSTM)

来源：评论

学校读者我要写书评

暂无评论

ATTENTION-BASED encoder-decoder NETWORK FOR SINGLE IMAGE DEHAZING

ATTENTION-BASED ENCODER-DECODER NETWORK FOR SINGLE IMAGE DEH...

引用

IEEE International Conference on Multimedia and Expo (ICME)

作者： Gao, Shunan Zhu, Jinghua Xi, Heran Heilongjiang Univ Sch Comp Sci & Technol Harbin Peoples R China

ISBN: (纸本)9781665449892

Single image dehazing is a challenging problem in computer vision due to it is highly ill-posed. Although recent research has made great progress, the dehazed images produced by existing models still have residual haze and lose too much detail information. To solve the above problems, we propose an end-to-end Attention-based encoder-decoder Network (AEDNet) which is capable to effectively remove haze while preserving image details well. AEDNet employs a novel channel shuffle attention mechanism to adaptively adjust the weight of each channel-wise feature. This attention mechanism is integrated in residual block which is the core feature extraction module of encoder-decoder. Extensive experiments on synthetic datasets and real-hazy images demonstrate that our AEDNet achieves better performance compared with previous state-of-the-art methods.

关键词： Single image dehazing encoder-decoder channel shuffle attention image detail restoration

来源：评论

学校读者我要写书评

暂无评论

Adding visual attention into encoder-decoder model for multi-modal machine translation

JOURNAL OF ENGINEERING RESEARCH

引用

JOURNAL OF ENGINEERING RESEARCH 2023年第2期11卷

作者： Xu, Chun Yu, Zhengqing Shi, Xiayang Chen, Fang Xinjiang Univ Finance & Econ Coll Informat Management Urumqi Peoples R China Zhengzhou Univ Light Ind Zhenghzou Peoples R China Henan Univ Tradit Chinese Med Med Dept Zhengzhou Peoples R China

Multi-modal neural machine translation (MNMT) aims to integrate visual and textual information to translate source sentences into target and attracts a lot of attentions. Existing methods contribute a lot for capturing interactions between visual and textual features to improve the performance of neural machine translation (NMT). However, most of them don't consider the multi-modal consistency for MNMT. In fact, the image provides the semantic global consistency between different languages. We believe that adding bilingual-visual agreement into the encoder and decoder simultaneously can obtain bilingual representations and is useful for NMT. In this paper, we propose to simultaneously integrate visual information in encoder and decoder to learn the interactions between visual and textual features, in this paper called the model VMNMT. As the visual information provide global context, the encoder and decoder can learn the bilingual representations. Besides, we introduce a new bilingual-visual agreement decoder to learn to better representations of corresponding imagesentence pairs. In the experiment, the improvement was 2.02 BLEU on the English-German16 dataset and 1.9 BLEU on the English-German17 dataset. The results show that our method can outperform baselines on several widely-used datasets in terms of various metrics.

关键词： Natural language processing Multi-modal machine translation Bilingual-visual encoder-decoder Bilingual representations

来源：评论

学校读者我要写书评

暂无评论

基于encoder-decoder框架的城镇污水厂出水水质预测

引用

中国农村水利水电 2023年第11期 93-99页

作者：史红伟陈祺王云龙李鹏程长春理工大学电子信息工程学院吉林长春130022 长春水务投资发展集团有限公司吉林长春130022

由于污水厂的出水水质指标繁多、污水处理过程中反应复杂、时序非线性程度高,基于机理模型的预测方法无法取得理想效果。针对此问题,提出基于深度学习的污水厂出水水质预测方法,并以吉林省某污水厂监测水质为来源数据,利用多种结合encod... 详细信息

由于污水厂的出水水质指标繁多、污水处理过程中反应复杂、时序非线性程度高,基于机理模型的预测方法无法取得理想效果。针对此问题,提出基于深度学习的污水厂出水水质预测方法,并以吉林省某污水厂监测水质为来源数据,利用多种结合encoder-decoder结构的神经网络预测水质。结果显示,所提结构对LSTM和GRU网络预测能力都有一定提升,对长期预测能力提升更加显著,ED-GRU模型效果最佳,短期预测中的4个出水水质指标均方根误差(RMSE)为0.7551、0.2197、0.0734、0.3146,拟合优度(R2)为0.9013、0.9332、0.9167、0.9532,可以预测出水质局部变化,而长期预测中的4个指标RMSE为1.7204、1.7689、0.4478、0.8316,R2为0.4849、0.5507、0.4502、0.7595,可以预测出水质变化趋势,与顺序结构相比,短期预测RMSE降低10%以上,R2增加2%以上,长期预测RMSE降低25%以上,R2增加15%以上。研究结果表明,基于encoder-decoder结构的神经网络可以对污水厂出水水质进行准确预测,为污水处理工艺改进提供技术支撑。

关键词：污水厂出水 encoder-decoder 多指标水质预测 GRU模型

来源：评论

学校读者我要写书评

暂无评论

A novel encoder-decoder structure for Time Series analysis based on Bayesian Uncertainty reduction

A novel Encoder-Decoder structure for Time Series analysis b...

引用

IEEE Latin American Conference on Computational Intelligence (LA-CCI)

作者： Llugsi, Ricardo El Yacoubi, Samira Fontaine, Allyx Lupera, Pablo Natl Polytech Sch Dept Elect Telecommun & Informat Networks DETRI Quito Ecuador Univ Perpignan Via Domitia UMR Espace Dev Perpignan France Univ Guyane UMR Espace Dev Cayenne French Guiana

ISBN: (纸本)9781728188645

In the present work, a novel Convolutional LSTM encoder-decoder structure for the implementation of Weather Forecast for the Andean city of Quito is presented. Aside from the above, the encoder-decoder structure uses a Walk-Forward validation, an adjustment of the Bayesian posterior predictive distribution and the ADAMW optimizer to carry out the forecast. The aforementioned stages are combined to obtain 4 error metrics per hour. The prediction is done in base of acquired data from a network of Automatic Weather Stations. The results show that the Convolutional encoder-decoder structure with a dropout probability of 0.05 and a model precision equal to 0.1 performs better than a LSTM model, LSTM Stacked model or ARIMA models reaching a maximum error of 1.03 degrees C. Finally, the methodology could be applied as an effective option to implement the post-processing stage for the physical model of a Weather Forecast System.

关键词： LSTM Time Series Convolutional encoder-decoder Neural Network Walk-Forward validation Dropout Bayesian uncertainty

来源：评论

学校读者我要写书评

暂无评论

An encoder-decoder Approach to Handwritten Mathematical Expression Recognition with Multi-head Attention and Stacked decoder 16th

An Encoder-Decoder Approach to Handwritten Mathematical Expr...

引用

16th IAPR International Conference on Document Analysis and Recognition (ICDAR)

作者： Ding, Haisong Chen, Kai Huo, Qiang Microsoft Res Asia Beijing Peoples R China

ISBN: (纸本)9783030863319

encoder-decoder framework with attention mechanism has become a mainstream solution to handwritten mathematical expression recognition (HMER) since "watch, attend and parse (WAP)" approach was proposed in 2017, where a convolutional neural network is used as encoder and a gated recurrent unit with attention is used in decoder. Inspired by the recent success of Transformer in many applications, in this paper, we adopt the design of multi-head attention and stacked decoder in Transformer to improve the decoder part of the WAP framework for HMER. Experimental results on CROHME tasks show that multi-head attention can boost the expression recognition rate (ExpRate) of WAP from 54.32%/58.05% to 56.76%/59.72% and stacked decoder can further improve ExpRate to 57.72%/61.38% on CROHME 2016/2019 test sets.

关键词： encoder-decoder Multi-head attention Stacked decoder Transformer Mathematical expression recognition

来源：评论

学校读者我要写书评

暂无评论

Table Structure Recognition Using CoDec encoder-decoder 16th

Table Structure Recognition Using CoDec Encoder-Decoder

引用

16th IAPR International Conference on Document Analysis and Recognition (ICDAR)

作者： Pegu, Bhanupriya Singh, Maneet Agarwal, Aakash Mitra, Aniruddha Singh, Karamjit Mastercard AI Garage Gurgaon India

ISBN: (纸本)9783030861599;9783030861582

Automated document analysis and parsing has been the focus of research since a long time. An important component of document parsing revolves around understanding tabular regions with respect to their structure identification, followed by precise information extraction. While substantial effort has gone into table detection and information extraction from documents, table structure recognition remains to be a long-standing task demanding dedicated attention. The identification of the table structure enables extraction of structured information from tabular regions which can then be utilized for further applications. To this effect, this research proposes a novel table structure recognition pipeline consisting of row identification and column identification modules. The column identification module utilizes a novel Column Detector encoder-decoder model (termed as CoDec encoder decoder) which is trained via a novel loss function for predicting the column mask for a given input image. Experiments have been performed to analyze the different components of the proposed pipeline, thus supporting their inclusion for enhanced performance. The proposed pipeline has been evaluated on the challenging ICDAR 2013 table structure recognition dataset, where it demonstrates state-of-the-art performance.

关键词： Table structure recognition encoder-decoder Document analysis

来源：评论

学校读者我要写书评

暂无评论

Multi-scale Salient Instance Segmentation based on encoder-decoder 13

Multi-scale Salient Instance Segmentation based on Encoder-D...

引用

13th Asian Conference on Machine Learning (ACML)

作者： Chen, Houru Shi, Caijuan Li, Wei Duan, Changyu Yan, Jinwei North China Univ Sci & Technol Tangshan 063210 Hebei Peoples R China

Salient instance segmentation refers to segmenting noticeable instance objects in images. In the face of multi-scale salient instances and overlapping instances, the existing salient instance segmentation methods have great limitations including inaccurate detection of large-scale instances, missing detection of small-scale instances, and wrong segmentation of overlapping in-stances. In order to solve these problems, a new multi-scale salient instance segmentation network (MSISNet) based on encoder-decoder is proposed. Firstly, we design a receptive field encoder (RFE), which adopts the serial dilated convolution instead of parallel dilated convolution and utilizes some common tricks to achieve better precision and speed. RFE can alleviate the problems of inaccurate detection of large-scale instances, missing detection of small-scale instances, and especially wrong segmentation of overlapping instances. Then, a pyramid decoder (PD) for the detection branch is designed to further alleviate the problem of inaccurate detection of large-scale instances and the difficulty in locating small-scale instances. Finally, a multi-stage decoder (MSD) is designed to improve the quality of the segmentation mask. In order to sufficiently evaluate the generalizability of our method, experiments are conducted not only on Salient Instance Segmentation-1K (SIS-1K) dataset, but also on Salient Objects in Clutter (SOC) dataset. The results show that the proposed method MSISNet is superior to the existing salient instance segmentation methods on mAP0:5 and some recently proposed non-salient instance segmentation methods.

关键词： Salient instance segmentation encoder-decoder Multi-scale Receptive field Feature fusion

来源：评论

学校读者我要写书评

暂无评论

Effective improvement of multi-step-ahead flood forecasting accuracy through encoder-decoder with an exogenous input structure

引用

JOURNAL OF HYDROLOGY 2022年 609卷

作者： Cui, Zhen Zhou, Yanlai Guo, Shenglian Wang, Jun Xu, Chong-Yu Wuhan Univ State Key Lab Water Resources & Hydropower Engn S Wuhan 430072 Peoples R China Univ Oslo Dept Geosci POB 1047 Blindern N-0316 Oslo Norway

Accurate and reliable multi-step-ahead flood forecasting is beneficial for reservoir operation and water resources management. The encoder-decoder (ED) that can tackle sequence-to-sequence problems is suitable for multistep-ahead flood forecasting. This study proposes a novel ED with an exogenous input (EDE) structure for multi-step-ahead flood forecasting. The exogenous input can be the outputs of process-based hydrological models. This study constructs four multi-step-ahead flood forecasting approaches, including the Xinanjiang (XAJ) hydrological model, the single-output long short-term memory (LSTM) neural network with recursive strategies, the recursive ED combined with the LSTM neural network (LSTM-RED), and the LSTM-EDE models. The performance of these four models is evaluated and compared by the long-term 3 h hydrologic data series of the Lushui and Jianxi basins in China. The results show that the LSTM-RED model that integrates recursive strategies into the training process of neural networks is more advantageous than the LSTM model. The proposed LSTMEDE model can overcome the exposure bias problem, simplify its model structure, increase the computational efficiency in the validation process, and improve the multi-step-ahead flood forecasting accuracy, as compared to the LSTM-RED model.

关键词： Flood forecasting Multi-step-ahead Neural network Deep learning encoder-decoder Exogenous input

来源：评论

学校读者我要写书评

暂无评论

CUFD: An encoder-decoder network for visible and infrared image fusion based on common and unique feature decomposition

引用

COMPUTER VISION AND IMAGE UNDERSTANDING 2022年第0期218卷

作者： Xu, Han Gong, Meiqi Tian, Xin Huang, Jun Ma, Jiayi Wuhan Univ Elect Informat Sch Wuhan 430072 Peoples R China

In this paper, we propose a novel method for visible and infrared image fusion by decomposing feature information, which is termed as CUFD. It adopts two pairs of encoder-decoder networks to implement feature map extraction and decomposition, respectively. On the one hand, the shallow features of the image contain abundant information while the deep features focus more on extracting the thermal targets. Thus, we use an encoder-decoder network to extract both shallow and deep features. Unlike existing methods, both of the shallow and deep features are used for fusion and reconstruction with different emphases. On the other hand, the infrared and visible features of the same layer have both similarities and differences. Therefore, we train the other encoder-decoder network to decompose the feature maps into common and unique information based on their similarities and differences. After that, we apply different fusion rules according to the flexible requirements. This operation is more beneficial to retain the significant feature information in the fusion results. Qualitative and quantitative experiments on publicly available TNO and RoadScene datasets demonstrate the superiority of our CUFD over the state-of-the-art.

关键词： Image fusion Infrared Visible Feature map encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：