检索结果-内蒙古大学图书馆

CVAE-DF: A hybrid deep learning framework for fertilization status detection of pre-incubation duck eggs based on VIS/NIR spectroscopy

引用

SPECTROCHIMICA ACTA PART A-MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2024年 320卷 124569页

作者： Wang, Dongqiao Wang, Qiaohua Chen, Zhuoting Guo, Juncai Li, Shijun Huazhong Agr Univ Coll Engn Wuhan 430070 Peoples R China Minist Agr & Rural Agr Key Lab Agr Equipment Midlower Yangtze River Wuhan 430070 Peoples R China Wuhan Univ Sch Comp Sci Wuhan 430072 Peoples R China Huazhong Agr Univ Coll Anim Sci & Vet Med Key Lab Agr Anim Genet Breeding & Reprod Minist Educ Wuhan 430070 Peoples R China

Unfertilized duck eggs not removed prior to incubation will deteriorate quickly, posing a risk of contaminating the normally fertilized duck eggs. Thus, detecting the fertilization status of breeding duck eggs as early as possible is a meaningful and challenging task. Most existing work usually focus on the characteristics of chicken eggs during mid-term hatching. However, little attention has been paid to the detection for duck eggs prior to incubation. In this paper, we present a novel hybrid deep learning detection framework for the fertilization status of pre -incubation duck eggs, termed CVAE-DF, based on visible/near-infrared (VIS/NIR) transmittance spectroscopy. The framework comprises the encoder of a convolutional variational autoencoder (CVAE) and an improved deep forest (DF) model. More specifically, we first collected transmittance spectral data (400 -1000 nm) of 255 duck eggs before hatching. The multiplicative scatter correction (MSC) method was then used to eliminate noise and extraneous information of the raw spectral data. Two efficient data augmentation methods were adopted to provide sufficient data. After that, CVAE was applied to extract representative features and reduce the feature dimension for the detection task. Finally, an improved DF model was employed to build the classification model on the enhanced feature set. The CVAE-DF model achieved an overall accuracy of 95.94 % on the test dataset. These experimental results in terms of four metrics demonstrate that our CVAE-DF method outperforms the traditional methods by a significant margin. Furthermore, the results also indicate that CVAE holds great promise as a novel feature extraction method for the VIS/NIR spectral analysis of other agricultural products. It is extremely beneficial to practical engineering.

关键词： Fertilization Pre-incubation duck eggs Visible/near-infrared spectroscopy convolutional variational autoencoder Improved deep forest

来源：评论

学校读者我要写书评

暂无评论

A data-driven methodology for bridge indirect health monitoring using unsupervised computer vision

引用

MECHANICAL SYSTEMS AND SIGNAL PROCESSING 2024年 210卷

作者： Hurtado, A. Calderon Alamdari, M. Makki Atroshchenko, E. Chang, K. C. Kim, C. W. Univ New South Wales Ctr Infrastruct Engn & Safety Sch Civil & Environm Engn Sydney NSW 2052 Australia Kyoto Univ Grad Sch Engn Dept Civil & Earth Resources Engn Kyoto 6158540 Japan

In recent years, researchers have extensively explored the application of drive -by inspection technology for bridge damage assessment. This approach involves using the response of a sensing vehicle to identify damage. However, many existing methods rely on data collected from both healthy and damaged bridge conditions, which may not always be available. Therefore, this study introduces a fully unsupervised computer vision -based methodology for bridge structural health monitoring (BSHM) using drive -by inspection. It analyzes the time-frequency domain of a two -axle vehicle's response by deriving a novel formulation for the contact point response from vehicle axles. The axles signals are then processed through subtraction, filtering, and decomposition using empirical Fourier decomposition with an improved segmentation approach based on the Savitzky-Golay filter (SGEFD). Relevant Intrinsic Mode Functions (IMF) are extracted as features representing damage, and the Wavelet Synchro-squeezed transform (WSST) is obtained from these features and used as input for the damage assessment algorithm. The performance of two state-of-the-art unsupervised generative machine learning methods, namely convolutional variational autoencoders (CVAE) and convolutional adversarial autoencoders (CAAE), is compared for the damage assessment task. These methods are trained solely with the residual WSST obtained from the vehicle responses when traversing a bridge in its reference state. A damage index (DI) is defined based on the measured error between the original and reconstructed images, and a damage threshold is calculated from the DI distribution of samples from the benchmark bridge state. During testing, the error between the original and reconstructed WSST is compared to the damage threshold, enabling the classification of new samples as healthy or damaged. The methodology is evaluated using both numerical and experimental vehicle-bridge interaction (VBI) models, considering various

关键词： Indirect SHM Deep learning Empirical Fourier decomposition convolutional variational autoencoder convolutional adversarial autoencoder Bridge monitoring Damage assessment

来源：评论

学校读者我要写书评

暂无评论

Characterization of discrete fracture networks with deep-learning based hydrogeophysical inversion

引用

JOURNAL OF HYDROLOGY 2024年 631卷

作者： Deng, Yaping Kang, Xueyuan Ma, Haichun Qian, Jiazhong Ma, Lei Luo, Qiankun Hefei Univ Technol Sch Resources & Environm Engn Hefei 230009 Peoples R China Nanjing Univ Sch Earth Sci & Engn Key Lab Surficial Geochem Minist Educ Nanjing 210023 Peoples R China

Characterization of fracture network is essential for understanding groundwater flow and solute transport, as well as waste storage. Deep -learning based ensemble smoother methods have proven to be effective in estimating hydraulic parameters in porous media. Compared to porous media, fracture fields are highly heterogeneous and typically non -Gaussian distributed, making the estimation of the fracture field from sparse borehole data extremely difficult. In this paper, we developed a joint hydrogeophysical inversion framework to improve the characterization of fracture networks. We first trained a convolutional variational autoencoder (CVAE) network to parameterize the fracture field, and then integrated with the ensemble smoother with multiple data assimilation (ESMDA) method to infer the fracture distribution by incorporating multiple datasets. Two numerical cases with different complexity were considered to assess the ability of the proposed joint inversion framework. The results show that the proposed hydrogeophyscial inversion framework can capture the main features of the fracture field. By integrating both the pressure and SP data, the fracture field can be reconstructed with an improved accuracy and reduced estimation uncertainty.

关键词： Fracture network Hydrogeophysical inversion Deep learning convolutional variational autoencoder Data assimilation

来源：评论

学校读者我要写书评

暂无评论

Unsupervised Pre-Training of Imbalanced Data for Identification of Wafer Map Defect Patterns

引用

IEEE ACCESS 2021年 9卷 52352-52363页

作者： Shon, Ho Sun Batbaatar, Erdenebileg Cho, Wan-Sup Choi, Seong Gon Chungbuk Natl Univ Res Inst Comp & Informat Commun Cheongju 28644 South Korea Chungbuk Natl Univ Sch Elect Comp Engn Cheongju 28644 South Korea Chungbuk Natl Univ Dept Management Informat Syst Cheongju 28644 South Korea Chungbuk Natl Univ Sch Informat & Commun Engn Cheongju 28644 South Korea

Visual defect inspection and classification are significant steps of most manufacturing processes in the semiconductor and electronics industries. Known and unknown defects on wafer maps tend to cluster, and these spatial patterns provide valuable process information for supporting manufacturing in determining the root causes of abnormal processes. In previous studies, data augmentation-based deep learning (DL) techniques were most commonly used for the identification of wafer map defect patterns (WMDP). Data augmentation is an effective technique for improving the accuracy of modern image classifiers. However, current data augmentation implementations were manually designed for the WMDP problem. In this study, we propose a DL-based method with automatic data augmentation for the WMDP task. Basically, it focuses on learning effective discriminative features, from wafer maps, through a deep network structure. The network consists of a convolution-based variational autoencoder (CVAE) sequentially. First, we pre-trained the CVAE on large training data in an unsupervised manner. Second, we fine-tuned the encoder of the CVAE, which was followed by a neural network (NN) classifier, in a supervised manner. Additionally, we describe a simple procedure for automatically searching for improved data augmentation policies. The policy mainly consists of five image processing functions: rotation, flipping, shifting, shearing range, and zooming. The effectiveness of the proposed method was demonstrated through experimental results obtained from a simulation dataset and a real-world wafer map dataset (WM-811K). This study provides guidance for the application of deep learning in semiconductor manufacturing processes to improve product quality and yield.

关键词： Feature extraction Fabrication Semiconductor device modeling Deep learning Visualization convolutional codes Training Classification convolutional variational autoencoder deep learning imbalanced data neural network unsupervised pre-training variational autoencoder wafer map defect patterns

来源：评论

学校读者我要写书评

暂无评论

Comparison of an Automatic Classification of Partial Dischage Patterns for Large Hydrogenerator

Comparison of an Automatic Classification of Partial Dischag...

引用

IEEE International Conference on Prognostics and Health Management (ICPHM)

作者： Kokoko, Olivier Hudon, Claude Levesque, Melanie Amyot, Normand Zemouri, Ryad IREQ Hydro Quebec Varennes PQ Canada 1IREQ Hydro Quebec Varennes PQ Canada CNAM CEDRIC Lab Paris France

ISBN: (纸本)9781665419703

More and more scientific disciplines are using deep techniques for the automatic classification of massive high dimensionality unlabeled data. Among these disciplines, the classification of partial discharge (PD) patterns is one that represents major challenges in the field of hydrogenerator diagnosis. This paper proposes a method of comparison of five classification topology based on a single convolutional variational autoencoder (CVAE) and ten classifiers. The comparison is based on five cases exploiting all the same database, but using five different feature extraction rules to create the input vectors of the neural networks. These feature extraction rules are based on the expert judgement and are automatically computed in the preprocessing stage. Analysis of the output of all classifiers for each topology suggests that the accuracy level of the classification can be significantly improved by refining the feature extraction rules. Moreover, the visualization of the 2D latent space from the CVAE also suggests that the accuracy level can be even further improved if the whole dataset is considered instead of a smaller reference dataset randomly selected. Results raise many questions about the performance of feature extraction rules and the possibilities to better handling classification of large databases such as the one of PD measurement files used for hydrogenerator diagnosis.

关键词： convolutional variational autoencoder diagnosis partial discharges deep neural networks expert knowledge feature extraction hydro-generators latent space partial discharge pattern

来源：评论

学校读者我要写书评

暂无评论

Learning Latent Representation of Freeway Traffic Situations from Occupancy Grid Pictures Using variational autoencoder

引用

ENERGIES 2021年第17期14卷 5232-5232页

作者： Rakos, Oliver Becsi, Tamas Aradi, Szilard Gaspar, Peter Budapest Univ Technol & Econ Dept Control Transportat & Vehicle Syst H-1111 Budapest Hungary Inst Comp Sci & Control Syst & Control Lab H-1111 Budapest Hungary

Several problems can be encountered in the design of autonomous vehicles. Their software is organized into three main layers: perception, planning, and actuation. The planning layer deals with the sort and long-term situation prediction, which are crucial for intelligent vehicles. Whatever method is used to make forecasts, vehicles' dynamic environment must be processed for accurate long-term forecasting. In the present article, a method is proposed to preprocess the dynamic environment in a freeway traffic situation. The method uses the structured data of surrounding vehicles and transforms it to an occupancy grid which a convolutional variational autoencoder (CVAE) processes. The grids (2048 pixels) are compressed to a 64-dimensional latent vector by the encoder and reconstructed by the decoder. The output pixel intensities are interpreted as probabilities of the corresponding field is occupied by a vehicle. This method's benefit is to preprocess the structured data of the dynamic environment and represent it in a lower-dimensional vector that can be used in any further tasks built on it. This representation is not handmade or heuristic but extracted from the database patterns in an unsupervised way.

关键词： NGSIM occupancy grid convolutional variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

Normal Appearance autoencoder for Lung Cancer Detection and Segmentation 22nd

Normal Appearance Autoencoder for Lung Cancer Detection and ...

引用

10th International Workshop on Machine Learning in Medical Imaging (MLMI) / 22nd International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI)

作者： Astaraki, Mehdi Toma-Dasu, Iuliana Smedby, Orjan Wang, Chunliang KTH Royal Inst Technol Dept Biomed Engn & Hlth Syst Halsovagen 11C S-14157 Huddinge Sweden Karolinska Univ Sjukhuset Karolinska Inst Dept Oncol Pathol S-17176 Solna Sweden

ISBN: (纸本)9783030322267;9783030322250

One of the major differences between medical doctor training and machine learning is that doctors are trained to recognize normal/healthy anatomy first. Knowing the healthy appearance of anatomy structures helps doctors to make better judgement when some abnormality shows up in an image. In this study, we propose a normal appearance autoencoder (NAA), that removes abnormalities from a diseased image. This autoencoder is semi-automatically trained using another partial convolutional in-paint network that is trained using healthy subjects only. The output of the autoencoder is then fed to a segmentation net in addition to the original input image, i.e. the latter gets both the diseased image and a simulated healthy image where the lesion is artificially removed. By getting access to knowledge of how the abnormal region is supposed to look, we hypothesized that the segmentation network could perform better than just being shown the original slice. We tested the proposed network on the LIDC-IDRI dataset for lung cancer detection and segmentation. The preliminary results show the NAA approach improved segmentation accuracy substantially in comparison with the conventional U-Net architecture.

关键词： Lung nodule segmentation Anomaly detection convolutional variational autoencoder

来源：评论

学校读者我要写书评

暂无评论

DEEP variational FILTER LEARNING MODELS FOR SPEECH RECOGNITION 44

DEEP VARIATIONAL FILTER LEARNING MODELS FOR SPEECH RECOGNITI...

引用

44th IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)

作者： Agrawal, Purvi Ganapathy, Sriram Indian Inst Sci Elect Engn LEAP Lab Bangalore Karnataka India

ISBN: (纸本)9781479981311

We present a novel approach to derive robust speech representations for automatic speech recognition (ASR) systems. The proposed method uses an unsupervised data-driven modulation filter learning approach that preserves the key modulations of speech signal in spectro-temporal domain. This is achieved by a deep generative modeling framework to learn modulation filters using convolutional variational autoencoder (CVAE). A skip connection based CVAE enables the learning of multiple irredundant modulation filters in the time and frequency modulation domain using temporal and spectral trajectories of input spectrograms. The learnt filters are used to process the spectrogram features for ASR training The ASR experiments are performed on Aurora-4 (additive noise with channel artifact) and CHiME-3 (additive noise with reverberation) databases. The results show significant improvements for the proposed CVAE model over the baseline features as well as other robust front-ends (average relative improvements of 9% in word error rate over base-line features on Aurora-4 database and 23% on CHiME-3 database). In addition, the performance of the proposed features is highly beneficial for semi-supervised training of ASR when reduced amounts of labeled training data are available (average relative improvements of 29% over baseline features on Aurora-4 database with 30% of the labeled training data).

关键词： Unsupervised filter learning convolutional variational autoencoder skip connections modulation filtering robust speech recognition

来源：评论

学校读者我要写书评

暂无评论

Unsupervised Raw Waveform Representation Learning for ASR 20

Unsupervised Raw Waveform Representation Learning for ASR

引用

Interspeech Conference

作者： Agrawal, Purvi Ganapathy, Sriram Indian Inst Sci Learning & Extract Acoust Patterns LEAP Lab Dept Elect Engn Bengaluru 560012 India

In this paper, we propose a deep representation learning approach using the raw speech waveform in an unsupervised learning paradigm. The first layer of the proposed deep model performs acoustic filtering while the subsequent layer performs modulation filtering. The acoustic filterbank is implemented using cosine-modulated Gaussian filters whose parameters are learned. The modulation filtering is performed on log transformed outputs of the first layer and this is achieved using a skip connection based architecture. The outputs from this two layer filtering are fed to the variational autoencoder model. All the model parameters including the filtering layers are learned using the VAE cost function. We employ the learned representations (second layer outputs) in a speech recognition task. Experiments are conducted on Aurora-4 (additive noise with channel artifact) and CHiME-3 (additive noise with reverberation) databases. In these experiments, the learned representations from the proposed framework provide significant improvements in ASR results over the baseline filterbank features and other robust front-ends (average relative improvements of 16% and 6% in word error rate over baseline features on clean and multi-condition training, respectively on Aurora-4 dataset, and 21% over the baseline features on CHiME-3 database).

关键词： Unsupervised representation learning raw speech waveform convolutional variational autoencoder cosine-modulated Gaussian filterbank speech recognition

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：