检索结果-内蒙古大学图书馆

16th International Conference on Signal-Image Technology and Internet-Based Systems (SITIS)

作者： Su, Qiqi Iliadou, Eleftheria Univ London Dept Comp Sci City London England Natl & Kapodistrian Univ Dept Otorhinolaryngol 1 Athens Med Sch Athens Greece

ISBN: (纸本)9781665464956

Understanding the factors that contribute to optimal hearing aid fitting and hearing aid user experiences is crucial in order to increase the satisfaction and quality of life of hearing loss patients, as well as reduce societal and financial burdens. This work proposes a novel framework that uses encoder-decoder with attention mechanism ( attn-ED) for predicting future hearing aid usage and SHAP to explain the factors contributing to this prediction. It has been demonstrated in experiments that attn-ED performs well at predicting future hearing aid usage, and that SHAP can be utilized to calculate the contribution of different factors affecting hearing aid usage. This framework aims to establish confidence that AI models can be utilized in the medical domain with the use of XAI methods. Moreover, the proposed framework can also assist clinicians in determining the nature of interventions.

关键词： XAI Hearing Loss encoder-decoder Attention Mechanism Hearing Aid Usage

来源：评论

学校读者我要写书评

暂无评论

A Nested Residual encoder-decoder Network for Overhead Contact System Fastener Anomaly Detection

引用

IEEE ACCESS 2021年 9卷 74959-74968页

作者： Wei, Tiantian Guo, Qifan Zhang, Xuewu Zhang, Cheng Jing, Wenfeng Xi An Jiao Tong Univ Sch Math & Stat Xian 710049 Peoples R China China Railway First Survey & Design Inst Grp Co L Xian 710043 Peoples R China

An overhead contact system (OCS) is key to providing power to high-speed railways. OCS detection is an important measure to ensure the safe operation of a high-speed railway. At present, OCS anomaly detection mainly relies on the manual analysis of the images regularly collected by the 4C system, which is very inefficient and can easily miss anomalies. Although some classification and object detection methods based on deep learning can be used for OCS anomaly detection, the effective training of deep networks can be difficult to support due to the small number of anomaly OCS image samples. Considering that most OCS faults are abnormal fasteners, we propose an abnormal detection method based on normal images, called the nested residual encoder-decoder network (NRE-Net). This network consists of two nested encoder-decoder networks, where the encoder is the shared part, and a residual structure is added to the encoding and decoding branches to enhance the feature expression ability. The experimental results show that the method can greatly improve the accuracy of anomaly detection for the CIFAR-10 dataset and OCS fastener dataset. Compared with the previous state-of-the-art approaches, the F-1 score of the proposed method for the two classes fastener in the OCS fastener dataset has increased by 10.8% and 11.9%, respectively.

关键词： OCS anomaly detection encoder-decoder distance measurement

来源：评论

学校读者我要写书评

暂无评论

Robust encoder–decoder learning framework for offline handwritten mathematical expression recognition based on a multi-scale deep neural network

引用

Science China(Information Sciences) 2021年第3期64卷 220-222页

作者： Guangcun SHAN Hongyu WANG Wei LIANG Kai CHEN School of Instrumentation Science and Optoelectronics Engineering Beihang University Institute of Electronics Chinese Academy of Sciences School of Electronic Electrical and Communication EngineeringUniversity of Chinese Academy of Sciences Boheng Technology (Hangzhou) Co.Ltd

Dear editor,Mathematical expressions have been widely employed in scientific research, finance, and statistics, and play a significant role in educational activities. For example, if a computer can recognize teachers' handwritten expressions as standard printed mathematical expressions, this will undoubtedly be more conducive and helpful for improving the effectiveness of lectures. Thus, the question of how to make computers automatically recognize mathematical expressions is highly significant.

关键词： encoder-decoder Multi-Scale Attention Handwritten mathematical expression recognition Attention recurrent neural network Dense Network

来源：评论

学校读者我要写书评

暂无评论

An Effective Classification Method for Hyperspectral Image With Very High Resolution Based on encoder-decoder Architecture

引用

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING 2021年 14卷 1509-1519页

作者： Zhang, Zhen Jiang, Tao Liu, Chenxi Zhang, Linjing Shandong Univ Sci & Technol Coll Geodesy & Geomat Qingdao 266590 Peoples R China

Hyperspectral images with very high resolution (VHR-HSI) have become considerably valuable due to their abundant spectral and spatial details. Classification of hyperspectral images (HSIs) is a basic and important procedure for diverse applications. However, low interclass spectral variability and high intraclass spectral variability in VHR-HSI, shadows, pedestrians, and low signal-to-noise ratio increase the fuzziness of different categories. To address the known challenges of VHR-HSI classification, an effective classification method based on encoder-decoder architecture is proposed. The proposed algorithm is an object-level contextual convolution neural network based on an improved residual network backbone with 3-D convolution, which fully considers the spatial-spectral and contextual features of HSIs. Two different spatial resolution aerial HSIs are used as experimental data. The results show that the overall accuracy of the proposed method is improved by 7.42% and 18.82%, respectively, compared to the pixelwise convolution neural network and DeepLabv3 algorithm, which is extraordinarily suitable for HSI classification with very high spatial resolution.

关键词： encoder-decoder hyperspectral image (HSI) with high spatial resolution image classification 3-D convolution residual network

来源：评论

学校读者我要写书评

暂无评论

Long-Term Traffic Prediction Based on LSTM encoder-decoder Architecture

引用

IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS 2021年第10期22卷 6561-6571页

作者： Wang, Zhumei Su, Xing Ding, Zhiming Beijing Univ Technol Coll Comp Sci Beijing Adv Innovat Ctr Future Internet Technol Beijing 100124 Peoples R China Chinese Acad Sci Inst Software Beijing 100190 Peoples R China Beijing Univ Technol Coll Comp Sci Beijing 100124 Peoples R China

Accurate traffic flow prediction is becoming increasingly important for transportation planning, control, management, and information services of successful. Numerous existing models focus on short-term traffic forecasts, but effective long-term forecasting of traffic flows have become a challenging issue in recent years. To solve this problem, this paper proposes a deep learning architecture which consisting of two parts: the long short-term memory encoder-decoder structure at the bottom and the calibration layer at the top. In the encoder-decoder model, we propose an hard attention mechanism based on learning similar patterns to enhance neuronal memory and reduce the accumulation of error propagation. To correct some of the missing details, we design a control gate in the calibration layer to learn the predicted data in groups according to different forms. The proposed method is evaluated on real-world datasets and compared with other state-of-the-art methods. It is verified that our model can accurately learn local feature and long-term dependence, and has better accuracy and stability in long-term sequence prediction.

关键词： Predictive models Forecasting Deep learning Calibration Market research Neural networks Prediction algorithms Freeway traffic flow long-term prediction encoder-decoder similar pattern attention

来源：评论

学校读者我要写书评

暂无评论

Multi-Granularity Sequence Alignment Mapping for encoder-decoder Based End-to-End ASR

引用

IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 2021年 29卷 2816-2828页

作者： Tang, Jian Zhang, Jie Song, Yan McLoughlin, Ian Dai, Li-Rong Univ Sci & Technol China USTC Natl Engn Lab Speech & Language Informat Proc Hefei 230026 Peoples R China

encoder-decoder based automatic speech recognition (ASR) methods are increasingly popular due to their simplified processing stages and low reliance on prior knowledge. Conventional encoder-decoder based approaches usually learn a sequence-to-sequence mapping function from the source speech to target units (e.g., subwords, characters) in an end-to-end manner. However, it is still unclear how to choose the optimal target unit, or granularity of multiple units. In general, as increasing the information available for learning sequence-to-sequence mapping functions can improve modeling effectiveness, we therefore propose a multi-granularity sequence alignment (MGSA) approach. This aims to enhance cross-sequence interactions between different granularity units for both modeling and inference stages in the encoder-decoder based ASR. Specifically, a decoder module is designed to generate multi-granularity sequence predictions. We then exploit the latent alignment mapping among units having different levels of granularity, by utilizing the decoded multi-level sequences as input for model prediction. The cross-sequence interaction can also be employed to re-calibrate output probabilities in the proposed post-inference algorithm. Experimental results on both WSJ-80 hrs and Switchboard-300 hrs datasets show the superiority of the proposed method compared to traditional multi-task methods as well as to single granularity baseline systems.

关键词： Speech processing Multi-granularity sequence alignment end-to-end ASR encoder-decoder post-inference deep learning

来源：评论

学校读者我要写书评

暂无评论

Automated Bridge Crack Detection Based on Improving encoder-decoder Network and Strip Pooling

引用

JOURNAL OF INFRASTRUCTURE SYSTEMS 2023年第2期29卷 04023004-04023004页

作者： Li, Gang Fang, Zhongyuan Mohammed, Al Mahbashi Liu, Tong Deng, Zhihao Changan Univ Sch Energy & Elect Engn Xian 710064 Shaanxi Peoples R China Changan Univ Sch Elect & Control Engn Xian 710064 Shaanxi Peoples R China

The detection of bridge cracks is an important task in bridge maintenance. It can also reflect the health of the bridge. However, cracks are usually in the form of strips, which are different from the concrete surface. Most crack detection algorithms cannot adapt to this situation well. In this paper, the original image of bridge cracks is collected and the data set is obtained through image processing. A bridge crack detection method based on improving encoder-decoder and mixed pooling module is proposed in this article. The basic features of the crack images are extracted by an encoder with dilated convolution. In this way, the resolution of the feature image can be guaranteed, and large receptive field can be obtained. Then the feature picture through the mix pooling module, which helps to capture remote context information and establish a remote dependency. Finally, the decoder restores the picture to its original size and integrates the original features. In the comparison experiment with the same experimental conditions, we compared with the classic image segmentation methods such as PSPNet, U-Net, FCN, and DeepLabv3+. The results show that our method achieves 98.3%, 97.3%, 97.6%, and 84.5% in precision, recall, F1-score, and MIoU. The results show that our method does have certain advantages in the field of crack detection and segmentation.

关键词： Crack detection Strip pooling encoder-decoder Mixed pooling module

来源：评论

学校读者我要写书评

暂无评论

An encoder-decoder Architecture within a Classical Signal-Processing Framework for Real-Time Barcode Segmentation

引用

SENSORS 2023年第13期23卷 6109页

作者： Gomez-Cardenes, Oscar Marichal-Hernandez, Jose Gil Son, Jung-Young Jimenez, Rafael Perez Rodriguez-Ramos, Jose Manuel Univ La Laguna Dept Ind Engn San Cristobal la Laguna 38200 Spain Konyang Univ Biomed Engn Dept Nonsan Si 320711 South Korea Univ Las Palmas Gran Canaria Inst Technol Dev & Innovat Commun Las Palmas Gran Canaria 35017 Spain Wooptix SL Res & Dev Dept San Cristobal la Laguna 38204 Spain

In this work, two methods are proposed for solving the problem of one-dimensional barcode segmentation in images, with an emphasis on augmented reality (AR) applications. These methods take the partial discrete Radon transform as a building block. The first proposed method uses overlapping tiles for obtaining good angle precision while maintaining good spatial precision. The second one uses an encoder-decoder structure inspired by state-of-the-art convolutional neural networks for segmentation while maintaining a classical processing framework, thus not requiring training. It is shown that the second method's processing time is lower than the video acquisition time with a 1024 x 1024 input on a CPU, which had not been previously achieved. The accuracy it obtained on datasets widely used by the scientific community was almost on par with that obtained using the most-recent state-of-the-art methods using deep learning. Beyond the challenges of those datasets, the method proposed is particularly well suited to image sequences taken with short exposure and exhibiting motion blur and lens blur, which are expected in a real-world AR scenario. Two implementations of the proposed methods are made available to the scientific community: one for easy prototyping and one optimised for parallel implementation, which can be run on desktop and mobile phone CPUs.

关键词： Radon transform scale-space methods multiscale DRT barcodes encoder-decoder pixelwise segmentation classical signal processing

来源：评论

学校读者我要写书评

暂无评论

Displacement prediction model for high arch dams using long short-term memory based encoder-decoder with dual-stage attention considering measured dam temperature

引用

ENGINEERING STRUCTURES 2023年第1期280卷

作者： Huang, Ben Kang, Fei Li, Junjie Wang, Feng Dalian Univ Technol Fac Infrastruct Engn Sch Hydraul Engn Dalian 116024 Peoples R China Hohai Univ Coll Water Conservancy & Hydropower Engn Nanjing 210098 Peoples R China China Three Gorges Univ Coll Hydraul & Environm Engn Yichang 443002 Peoples R China

Structural health monitoring method can provide important information to evaluate operational status of con-crete dams, by establishing accurate models to predict concrete dam behavior with monitored data. This study proposed a model using encoder-decoder based on long short-term memory network with dual-stage attention mechanism (DALSTM) to predict the displacement of concrete arch dams. encoder-decoder based on long short -term memory network is a deep learning technique that can perform time series prediction, and dual-stage attention mechanism focuses on the key information in the dam displacement series to improve the perfor-mance. The effectiveness and accuracy of the proposed prediction model are analyzed on a high arch dam using measured temperature in the dam body instead of the seasonal functions to represent the thermal effect. Compared with traditional stepwise regression, multiple linear regression models, radial basis function networks, and other deep learning models, results show that the proposed approach performance is more accurate and robust for dam health monitoring.

关键词： Structural health monitoring Dam behavior prediction High arch dam encoder-decoder Long short -term memory networks Dual-stage attention mechanism

来源：评论

学校读者我要写书评

暂无评论

Using Machine Learning to Identify Hydrologic Signatures With an encoder-decoder Framework

引用

WATER RESOURCES RESEARCH 2023年第3期59卷 e2022WR033091-e2022WR033091页

作者： Botterill, Tom E. E. McMillan, Hilary K. K. San Diego State Univ Dept Geog San Diego CA 92182 USA

Hydrologic signatures are quantitative metrics that describe a streamflow time series. Examples include annual maximum flow, baseflow index and recession shape descriptors. In this paper, we use machine learning (ML) to learn encodings that are optimal ML equivalents of hydrologic signatures, and that are derived directly from the data. We compare the learned signatures to classical signatures, interpret their meaning, and use them to build rainfall-runoff models in otherwise ungauged watersheds. Our model has an encoder-decoder structure. The encoder is a convolutional neural net mapping historical flow and climate data to a low-dimensional vector encoding, analogous to hydrological signatures. The decoder structure includes stores and fluxes similar to a classical hydrologic model. For each timestep, the decoder uses current climate data, watershed attributes and the encoding to predict coefficients that distribute precipitation between stores and store outflow coefficients. The model is trained end-to-end on the U.S. CAMELS watershed data set to minimize streamflow error. We show that learned signatures can extract new information from streamflow series, because using learned signatures as input to the process-informed model improves prediction accuracy over benchmark configurations that use classical signatures or no signatures. We interpret learned signatures by correlation with classical signatures, and by using sensitivity analysis to assess their impact on modeled store dynamics. Learned signatures are spatially correlated and relate to streamflow dynamics including seasonality, high and low extremes, baseflow and recessions. We conclude that process-informed ML models and other applications using hydrologic signatures may benefit from replacing expert-selected signatures with learned signatures.

关键词： hydrologic signature flow metric flow indices machine learning deep learning encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：