检索结果-内蒙古大学图书馆

Automatic Crack Detection on Road Pavements Using encoder-decoder Architecture

MATERIALS 2020年第13期13卷 2960页

作者： Fan, Zhun Li, Chong Chen, Ying Wei, Jiahong Loprencipe, Giuseppe Chen, Xiaopeng Di Mascio, Paola Shantou Univ Coll Engn Dept Elect & Informat Engn Key Lab Digital Signal & Image Proc Guangdong Pro Shantou 515063 Peoples R China Sapienza Univ Rome Dept Civil Construct & Environm Engn I-00184 Rome Italy Pusan Natl Univ Dept Ind Engn Busan 609735 South Korea

Automatic crack detection from images is an important task that is adopted to ensure road safety and durability for Portland cement concrete (PCC) and asphalt concrete (AC) pavement. Pavement failure depends on a number of causes including water intrusion, stress from heavy loads, and all the climate effects. Generally, cracks are the first distress that arises on road surfaces and proper monitoring and maintenance to prevent cracks from spreading or forming is important. Conventional algorithms to identify cracks on road pavements are extremely time-consuming and high cost. Many cracks show complicated topological structures, oil stains, poor continuity, and low contrast, which are difficult for defining crack features. Therefore, the automated crack detection algorithm is a key tool to improve the results. Inspired by the development of deep learning in computer vision and object detection, the proposed algorithm considers an encoder-decoder architecture with hierarchical feature learning and dilated convolution, named U-Hierarchical Dilated Network (U-HDN), to perform crack detection in an end-to-end method. Crack characteristics with multiple context information are automatically able to learn and perform end-to-end crack detection. Then, a multi-dilation module embedded in an encoder-decoder architecture is proposed. The crack features of multiple context sizes can be integrated into the multi-dilation module by dilation convolution with different dilatation rates, which can obtain much more cracks information. Finally, the hierarchical feature learning module is designed to obtain a multi-scale features from the high to low- level convolutional layers, which are integrated to predict pixel-wise crack detection. Some experiments on public crack databases using 118 images were performed and the results were compared with those obtained with other methods on the same images. The results show that the proposed U-HDN method achieves high performance because it can

关键词： pavement cracking automatic crack detection encoder-decoder deep learning U-net hierarchical feature dilated Convolution

来源：评论

学校读者我要写书评

暂无评论

Lightweight encoder-decoder model for automatic skin lesion segmentation

引用

Informatics in Medicine Unlocked 2021年 25卷

作者： Wibowo, Adi Purnama, Satriawan Rasyid Wirawan, Panji Wisnu Rasyidi, Hanif Department of Computer Science Diponegoro University Informatics Semarang Indonesia College of Engineering & Computer Science Australian National University Canberra Australia

Accurate skin lesion segmentation (SLS) is an important step in computer-aided diagnosis of melanoma. Automatic detection of skin lesions in dermoscopy images is challenging because of the presence of artifacts and as lesions can have heterogeneous texture, color, and shape with fuzzy or indistinct boundaries. In this study, automatic SLS was performed using a lightweight encoder-decoder, MobileNetV3-UNet, which can achieve high accuracy with low resources. A comprehensive analysis was performed to improve the accuracy of the method in SLS. The semantic segmentation method consists of an encoder-decoder architecture, data augmentation, learning schemes, and post-processing methods. To enhance the SLS, we modified the decoder with the bidirectional ConvLSTM layer from the BCDU-Net and separable blocks from the separable-UNet architecture. Random augmentation was used to improve image diversity in the training dataset to avoid overfitting. Furthermore, a learning scheme based on stochastic weight averaging (SWA) was used to obtain better generalization by averaging multiple local optima. Our method was evaluated using three publicly available datasets, such as ISIC-2017, ISIC-2018, and PH2. We obtained dice coefficient and Jaccard index of 87.74%, 80.25%;91.01%, 83.44%;and 95.18%, 91.08% for ISIC-2017, ISIC-2018, and PH2, respectively. The experimental results proved that the modified MobileNetV3-UNet method can outperform several state-of-the-art methods. © 2021 The Authors

关键词： encoder-decoder MobileNet Random augmentation Skin lesion segmentation Stochastic weight averaging U-net

来源：评论

学校读者我要写书评

暂无评论

An encoder-decoder based grapheme-to-phoneme converter for Bangla speech synthesis

引用

ACOUSTICAL SCIENCE AND TECHNOLOGY 2019年第6期40卷 374-381页

作者： Ahmad, Arif Selim, Mohammad Reza Iqbal, Muhammed Zafar Rahman, Mohammad Shahidur Shahjalal Univ Sci & Technol Dept Comp Sci & Engn Sylhet 3114 Bangladesh

This paper proposes an encoder-decoder based sequence-to-sequence model for Grapheme-to-Phoneme (G2P) conversion in Bangla (Exonym: Bengali). G2P models are key components in speech recognition and speech synthesis systems as they describe how words are pronounced. Traditional, rule-based models do not perform well in unseen contexts. We propose to adopt a neural machine translation (NMT) model to solve the G2P problem. We used gated recurrent units (GRU) recurrent neural network (RNN) to build our model. In contrast to joint-sequence based G2P models, our encoder-decoder based model has the flexibility of not requiring explicit grapheme-to-phoneme alignment which are not straight forward to perform. We trained our model on a pronunciation dictionary of (approximately) 135,000 entries and obtained a word error rate (WER) of 12.49% which is a significant improvement from the existing rule-based and machine-learning based Bangla G2P models.

关键词： encoder-decoder Sequence-to-sequence GRU-RNN NMT

来源：评论

学校读者我要写书评

暂无评论

State-of-charge sequence estimation of lithium-ion battery based on bidirectional long short-term memory encoder-decoder architecture

引用

JOURNAL OF POWER SOURCES 2020年 449卷 227558-000页

作者： Bian, Chong He, Huoliang Yang, Shunkun Huang, Tingting Beihang Univ Sch Automat Sci & Elect Engn Beijing 100191 Peoples R China Beihang Univ Sch Reliabil & Syst Engn Beijing 100191 Peoples R China

State-of-charge (SOC) estimation of lithium-ion batteries based on deep learning techniques has been receiving considerable attention. However, most deep-learning-based methods focus on SOC estimation at fixed ambient temperatures and cannot provide useful indications for battery state in real-world scenarios because batteries usually experience varying temperatures during operation. In this study, an encoder-decoder with bidirectional long short-term memory (LSTM) is proposed for estimating the SOC at different temperature conditions. This end-to-end model can learn sequential information from the measurement sequences to characterize battery dynamics for sequence estimation. Introducing the bidirectional LSTMs into the encoder-decoder enables the model to capture the long-term dependencies of the measurement sequences from both past and future directions to increase the estimation accuracy. The proposed method is evaluated on public battery datasets under dynamic loading profiles. Validation with an experimental dataset shows that this method of considering the sequential contexts and bidirectional dependencies of battery measurement data can accurately estimate the SOC at different ambient temperatures. In particular, the mean absolute errors are as low as 1.07% at varying temperatures. The proposed method can improve the reliability and availability of battery management systems for monitoring the battery state under varying ambient conditions.

关键词： Lithium-ion battery encoder-decoder Bidirectional long short-term memory State-of-charge sequence estimation

来源：评论

学校读者我要写书评

暂无评论

Robust Cultivated Land Extraction Using encoder-decoder

引用

Journal of New Media 2020年第4期2卷 149-155页

作者： Aziguli Wulamu Jingyue Sang Dezheng Zhang and Zuxian Shi Department of Computer School of Computer and Communication EngineeringUniversity of Science and Technology Beijing(USTB)Beijing100083China Beijing Key Laboratory of Knowledge Engineering for Materials Science Beijing100083China 不详

Cultivated land extraction is essential for sustainable development and *** this paper,the network we propose is based on the encoder-decoder structure,which extracts the semantic segmentation neural network of cultivated land from satellite images and uses it for agricultural automation *** encoder consists of two part:the first is the modified Xception,it can used as the feature extraction network,and the second is the atrous convolution,it can used to expand the receptive field and the context information to extract richer feature *** decoder part uses the conventional upsampling operation to restore the original *** addition,we use the combination of BCE and Loves-hinge as a loss function to optimize the Intersection over Union(IoU).Experimental results show that the proposed network structure can solve the problem of cultivated land extraction in Yinchuan City.

关键词： Semantic segmentation encoder-decoder cultivated land extraction atrous convolution

来源：评论

学校读者我要写书评

暂无评论

A Novel Deep Learning-Based encoder-decoder Model for Remaining Useful Life Prediction

A Novel Deep Learning-Based Encoder-Decoder Model for Remain...

引用

International Joint Conference on Neural Networks (IJCNN)

作者： Liu, Hui Liu, Zhenyu Jia, Weiqiang Lin, Xianke Zhejiang Univ State Key Lab CAD & CG Hangzhou Peoples R China Univ Ontario Inst Technol Dept Mech Engn Oshawa ON Canada

ISBN: (纸本)9781728119854

A novel encoder-decoder model based on deep neural networks is proposed for the prediction of remaining useful life (RUL) in this work. The proposed model consists of an encoder and a decoder. In the encoder, the Bi-directional Long Short-Term Memory Networks (Bi-LSTM) and Convolutional Neural Networks (CNN) are used to capture the long-term temporal dependencies and important local features from the sequential data, respectively. Besides, single 1*1 convolution filter in the last convolutional layer is used for dimensionality reduction. In the decoder, the fully connected networks are employed to decode the feature information to predict RUL. In addition, the proposed data-driven method can achieve end-to-end prediction, which does not need feature engineering. To evaluate the proposed model, experimental verification is carried out on a commonly used aero-engine C-MAPSS dataset. Compared with other state-of-the-art approaches on the same dataset, the effectiveness and superiority of the proposed framework are demonstrated. For example, the scoring function value of the second subset is reduced by up to 64.99% compared with the best existing result.

关键词： remaining useful life encoder-decoder end-to-end data-driven

来源：评论

学校读者我要写书评

暂无评论

Automatic Generation of Chinese Couplets with Attention Based encoder-decoder Model 2

Automatic Generation of Chinese Couplets with Attention Base...

引用

2nd IEEE International Conference on Multimedia Information Processing and Retrieval (MIPR)

作者： Yuan, Shengqiong Zhong, Luo Li, Lin Zhang, Rui Wuhan Univ Technol Sch Comp Sci & Technol Wuhan Hubei Peoples R China

ISBN: (纸本)9781728111988

Chinese couplets, as one of the traditional Chinese culture, is the treasure of Chinese civilization and the inheritance of Chinese history. Given a sentence (namely an antecedent clause), people reply with another sentence (namely a subsequent clause) equal in length. Because of the complexity of the semantic and grammatical rules of couplet, it is not easy to create a suitable couplet that meets the requirements of sentence pattern, context, and flatness. In this paper, given the issued antecedent clause, we can automatically generate the subsequent clause by encoder-decoder model. Moreover, to satisfy special characteristics of couplets, we incorporate the attention mechanism into the encoding-decoding process, which greatly improves the accuracy of couplets generated automatically.

关键词： Deep learning Recurrent Neural Network attention encoder-decoder couplet

来源：评论

学校读者我要写书评

暂无评论

Fully Convolutional encoder-decoder Architecture (FCEDA) for Skin Lesions Segmentation 11th

Fully Convolutional Encoder-Decoder Architecture (FCEDA) for...

引用

11th International Conference on Computational Collective Intelligence (ICCCI)

作者： Adegun, Adekanmi Viriri, Serestina Univ KwaZulu Natal Sch Math Stat & Comp Sci ZA-4000 Durban South Africa

ISBN: (纸本)9783030283773;9783030283766

Segmentation which is identification of regions of interest (ROIs) in medical images is a very important step for image analysis in computer-aided diagnosis systems. Accurate segmentation of skin lesions images plays a vital role in efficient diagnosis of melanoma skin cancer. Diagnosis of melanoma cancer through the segmentation of skin lesions is a challenging task due to possible presence of noise and artefacts such as hairs, air or oil bubbles on the skin lesion images. Skin lesions images are also sometimes characterized with weak edges, irregular and fuzzy borders, marks, dark corners, skin lines and blood vessels on skin lesions. Recently, segmentation methods based on Fully Convolutional encoder-decoder Architecture (FCEDA) have achieved great success in medical images. This work presents automatic skin lesion segmentation method that is based on Fully Convolutional encoder-decoder Architecture. Two types of FCEDA namely U-Net and SegNet architectures, have been examined and utilized for segmentation of skin lesion images. The performance analysis of the two architectures have been conducted. Evaluation and comparison of these two architectures were also carried out. This work finds out and proposes possible improvements of these methods on the segmentation of skin lesions. It is also a systematic comparison of U-Net and SegNet models on the segmentation of skin lesion images. The paper discovers how deep learning methods can be explored using a supervised approach to get accurate results with less complexity possible. The models were evaluated on skin lesion challenge dataset in ISIC 2018 dermoscopic images archives.

关键词： Melanoma U-Net Deep learning FCEDA encoder-decoder SegNet Segmentation

来源：评论

学校读者我要写书评

暂无评论

An Effective encoder-decoder Network for Neural Cell Bodies and Cell Nucleus Segmentation of EM Images 41

An Effective Encoder-Decoder Network for Neural Cell Bodies ...

引用

41st Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)

作者： Jiang, Yi Xiao, Chi Li, Linlin Chen, Xi Shen, Lijun Han, Hua Chinese Acad Sci Inst Automat Beijing 100190 Peoples R China Chinese Acad Sci Inst Automat Natl Lab Pattern Recognit Beijing 100190 Peoples R China Chinese Acad Sci Ctr Excellence Brain Sci & Intelligence Technol Shanghai 200031 Peoples R China Univ Chinese Acad Sci Sch Future Technol Beijing 101408 Peoples R China

ISBN: (纸本)9781538613115

Neural systems are complicated networks connected by a large number of neurons through gap junctions and synapse. At present, for electron microscopy connectomics research, neuron structure recognition algorithms mostly focus on synapses, dendrites, axons and mitochondria, etc. However, effective methods for automatic recognition of neuronal cell bodies are rare. In this paper, we proposed an effective encoder-decoder network, which extracted segmentation features of neural cell bodies and cell nucleus by the modified residual network and pyramid module. The framework is capable of merging multi-scale contextual information and generating efficient segmentation results by integrating multilevel features. We applied this proposed network on two segmentation tasks for electron microscope (EM) images and compared it with other promising methods as U-Net and deeplab v3+. The results demonstrated that our method achieved the state-of-the-art performance on quality metrics. Finally, we visualized two intact neural cell bodies and cell nucleus to provide a close look into these fine structures.

关键词： encoder-decoder Electron Microscopy Neural Cell Bodies Cell Nucleus Image Segmentation

来源：评论

学校读者我要写书评

暂无评论

A method of face repair based on encoder-decoder and dual discrimination network

A method of face repair based on encoder-decoder and dual di...

引用

第三十九届中国控制会议

作者： Cui Can Zhao Jun Xiong Xingzhong Nathaniel O.Edwards Sichuan University of Science ＆ Engineering

Neural networks have made significant achievements in the field of image restoration. To efficiently repair facial images with large areas damaged, a decoder-encoder structured convolutional neural network is used as a generative model and skip-connection is added between some of its layers to enhance the structure prediction ability of the generated model and well suppressed the problem that the repair network is easy to over-fitting. The global discrimination network mostly uses the image’s edge structure and feature information to ensure that the repaired image, which is the output from the repair network, conforms to visual connectivity, while the local discriminators, not only recognize local consistency but also optimize more details. The network structure proposed in this paper combines the encoder-decoder, skip-connection, and dual discriminator networks to improve the effect of face completion. The experimental results on the CelebA show that the proposed method is superior to other methods in repairing images with large areas of damage.

关键词： Generative Adversarial Network（GAN） face inpainting skip-connection encoder-decoder global and local discriminators

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：