Fabric defect detection plays an increasingly important role in the industrial automation application for fabric production, but how to detect defects rapidly and accurately is still challenging. In this study, we pro...
详细信息
Fabric defect detection plays an increasingly important role in the industrial automation application for fabric production, but how to detect defects rapidly and accurately is still challenging. In this study, we propose a powerful fabric defect detection method using a hybrid of convolutional neural network (CNN) and variational autoencoder (VAE). The convolutional layers are used for extracting fabric image pattern features and the variational autoencoder is used for modeling the latent characteristics and inferring a reconstruction. The defect positions can be detected by the differences between the original image and the reconstruction image. The proposed method is validated on public patterned fabric datasets. The experimental results demonstrate that the proposed model can achieve outstanding performance in both image level and pixel level defect detection.
Ultraviolet (UV)-curable thermoset shape memory polymers (TSMPs) with high recovery stress but mild glass transition temperature (T-g) are highly desired for 3D/4D printing lightweight load-bearing structures and devi...
详细信息
Ultraviolet (UV)-curable thermoset shape memory polymers (TSMPs) with high recovery stress but mild glass transition temperature (T-g) are highly desired for 3D/4D printing lightweight load-bearing structures and devices. However, a bottleneck is that high recovery stress usually means high T-g. For a few TSMPs with high recovery stress, their T-g values are close to the decomposition temperature, and thus, the shape memory effect cannot be triggered safely and effectively. While machine learning (ML) has served as a useful tool to discover new materials and drugs, the grand challenge of using ML to discover new TSMPs persists in the very limited data available. Here, we report an enhanced ML approach by combining the transfer learning-variational autoencoder with a weighted-vector combination method. By learning a large data set with drug molecules in a pretraining process, we were able to effectively map the TSMPs to a hidden space that is much closer to a Gaussian distribution. Through this approach, we created a large compositional space and were able to discover five new types of UV-curable TSMPs with desired properties, one of which was validated by the experiments. Our contribution includes (1) representing the features of TSMPs by drug molecules to overcome the barrier of a limited training data set and (2) developing a ML framework that is able to overcome the barrier of mapping the molar ratio information. It is shown that this approach can effectively learn TSMP features by utilizing the relatedness between the data-scarce (and biased) TSMP target and data-abundant drug source, and the result is much more accurate and more robust than the benchmark set by the support vector machine method using direct label encoding and Morgan encoding. Therefore, it is believed that this framework is a state-of-the-art study in the TSMP field. This study opens new opportunities for discovering not only new TSMPs but also other thermoset polymers.
The regression based deep neural networks have achieved state-of-the-arts performance on depth 3D hand pose estimation task. This paper focuses on improving the regression mapping between features and pose joints. Ins...
详细信息
The regression based deep neural networks have achieved state-of-the-arts performance on depth 3D hand pose estimation task. This paper focuses on improving the regression mapping between features and pose joints. Inspired by the distribution modeling ability of variational autoencoders, we introduce an auxiliary variable into the regression network. During training, the auxiliary variable is modeled by an inference distribution that learns the underlying structural kinematics of human hand. Different with other regression methods on hand poses, our network estimates the pose joints from input depth features and the learned auxiliary variable as well. We show that by introducing the auxiliary variable, the regression is benefited from 1) regularization modeled by inference distribution;and 2) prior information carried by the auxiliary model. The effectiveness of the proposed regression method is evaluated with extensively self-comparative experiments and in comparison with other regression methods on hand pose datasets. The proposed network is easy to train in an end-to-end manner and can work with various feature extraction methods. We apply the proposed regression method to an existing hand pose estimation system, and improves the estimation accuracy by 18.35% and 16.65% on public hand pose datasets.
In powder metallurgy, iron -based powders are compacted to so-called green bodies, which undergo a subsequent sintering process. The microstructure of the green bodies affects the sintering process and the resulting p...
详细信息
In powder metallurgy, iron -based powders are compacted to so-called green bodies, which undergo a subsequent sintering process. The microstructure of the green bodies affects the sintering process and the resulting properties of the produced component. Assessing the microstructure experimentally is time consuming and costly, while simulation approaches rely on simplifications that significantly affect relevant microstructural information. This work investigates the suitability of deep generative models to synthetically generate realistic micrographs of green bodies as an alternative to experiments and simulations. For that purpose experiments comparing Generative Adversarial Neural Networks and variational autoencoders were conducted. The two deep learning frameworks were used to train models which produce synthetic micrographs in dependence of two parameters, namely powder particle size and green body porosity. The data utilized to train and evaluate the models was acquired from compacted water -atomized Astaloy 85Mo via scanning electron microscopy. Besides visual inspection, the trained models were evaluated via quantitative metrics. Distributions of relevant microstructural parameters such as pore perimeter or minimum and maximum pore feret diameter of the synthetically generated micrographs were compared to those of experimentally acquired micrographs. The results indicate that Generative Adversarial Networks represent a promising approach to train models, which produce synthetic micrographs that capture relevant properties of green body micrographs in dependence of given parameters such as powder particle size. In contrast, the variational autoencoders trained for this study did not capture microstructural properties well.
With the rise of cloud computing, many applications have been implemented into microservices to fully utilize cloud computing for scalability and maintainability purposes. However, there are some traditional monolith ...
详细信息
ISBN:
(数字)9781665475341
ISBN:
(纸本)9781665475341
With the rise of cloud computing, many applications have been implemented into microservices to fully utilize cloud computing for scalability and maintainability purposes. However, there are some traditional monolith applications that developers would like to partition into microservices. Unfortunately, it is difficult to find a solution when considering multiple factors (i.e., the strong dependency in each cluster and how often different microservices communicate with each other). Further, because we allow duplications of classes in multiple microservices to reduce the communications between them, the number of duplicated classes is also another important factor for maintainability. Therefore, we need to use machine learning algorithms to approximate a good solution due to the infeasibility of finding the optimal solution. We apply the variational autoencoder to extract features of classes and use the fuzzy c means to group the classes into microservices according to their extracted features. As a result, our approach outperforms the other baselines in some significant metrics. Also, when we allow duplication, we find that it is helpful in terms of reducing the overhead of communications between microservices.
Infectious keratitis is a major cause of visual impairment and a common blinding eye disease. Deep learning based prior researches mainly regard infectious keratitis diagnosis as a classification task on the slit-lamp...
详细信息
ISBN:
(数字)9781665496209
ISBN:
(纸本)9781665496209
Infectious keratitis is a major cause of visual impairment and a common blinding eye disease. Deep learning based prior researches mainly regard infectious keratitis diagnosis as a classification task on the slit-lamp images of single-visit. However, in real clinical applications, it is critical to analyze the lesion evolution characteristics represented by time-varying features over multiple-visits. To bridge this gap, in this paper, we focus on the problem with sequential clinical images of patients, and propose a novel disentangled sequential auto-encoder (DSLC-VAE) algorithm to separate the time-varying pathological features from the time-invariant ones for infectious keratitis diagnosis. Specifically, a inference model is exploited to generate time series of the shape and appearance of corneal lesions to represent keratitis progression, which are combined with location-related features to identify keratitis pathogen. Moreover, we construct a local consistent regularizer with a self-supervised task to enhance the consistency of the time-varying features across different infectious keratitis. Extensive experiments on real world dataset demonstrate superiority of our DSLC-VAE on both representation disentanglement and diagnosis accuracy.
This study addresses the optimization of the Vehicle Routing Problem (VRP) with prioritized customers by introducing and comparing two advanced solution approaches: a metaheuristic-based hyperheuristic framework and a...
详细信息
This study addresses the optimization of the Vehicle Routing Problem (VRP) with prioritized customers by introducing and comparing two advanced solution approaches: a metaheuristic-based hyperheuristic framework and a variational autoencoder (VAE)-based hyperheuristic. The VRP with prioritized customers introduces additional complexity by requiring efficient routing while ensuring high-priority customers receive service within strict constraints. To tackle this challenge, the proposed metaheuristic-based hyperheuristic dynamically selects and adapts low-level heuristics using Simulated Annealing (SA) and Ant Colony Optimization (ACO), enhancing search efficiency and solution quality. In contrast, the VAE-based approach leverages deep learning to model historical routing patterns and autonomously generate new heuristics tailored to problem-specific characteristics. Through extensive computational experiments on benchmark VRP instances, our results reveal that both approaches significantly enhance routing efficiency, with the VAE-based method demonstrating superior generalization across varying problem structures. Specifically, the VAE-based approach reduces total travel costs by an average of 8% and improves customer priority satisfaction by 95% compared to traditional hyperheuristic methods. Moreover, a comparative analysis with recent state-of-the-art algorithms highlights the competitive performance of our approaches in balancing computational efficiency and solution quality. These findings underscore the potential of integrating metaheuristics with machine learning in complex routing problems and provide valuable insights for real-world logistics and transportation planning. Future research will explore the generalization of these methodologies to dynamic and large-scale routing scenarios.
IoT sensors are becoming increasingly important supplement to traditional monitoring systems, particularly for in-situ based monitoring. Data collected using IoT sensors are often plagued with missing values occurring...
详细信息
IoT sensors are becoming increasingly important supplement to traditional monitoring systems, particularly for in-situ based monitoring. Data collected using IoT sensors are often plagued with missing values occurring as a result of sensor faults, network failures, drifts and other operational issues. Missing data can have substantial impact on in-field sensor calibration methods. The goal of this research is to achieve effective calibration of sensors in the context of such missing data. To this end, two objectives are presented in this paper. 1) Identify and examine effective imputation strategy for missing data in IoT sensors. 2) Determine sensor calibration performance using calibration techniques on data set with imputed values. Specifically, this paper examines the performance of variational autoencoder (VAE), Neural Network with Random Weights (NNRW), Multiple Imputation by Chain Equations (MICE), Random Forest-based Imputation (missForest) and K-Nearest Neighbour (KNN) for imputation of missing values on IoT sensors. Furthermore, the performance of sensor calibration via different supervised algorithms trained on the imputed dataset were evaluated. The analysis showed VAE technique to outperform the other methods in imputing the missing values at different proportions of missingness on two real-world datasets. Experimental results also showed improved calibration performance with imputed dataset.
Laughter is one of the most important nonverbal sound that human generates. It is a means for expressing his emotions. The acoustic and contextual features of this specific sound are different from those of speech and...
详细信息
Laughter is one of the most important nonverbal sound that human generates. It is a means for expressing his emotions. The acoustic and contextual features of this specific sound are different from those of speech and many difficulties arise during their modeling process. During this work, we propose an audio laughter generation system based on unsupervised generative models: the autoencoder (AE) and its variants. This procedure is the association of three main sub-process, (1) the analysis which consist of extracting the log magnitude spectrogram from the laughter database, (2) the generative models training, (3) the synthesis stage which incorporate the involvement of an intermediate mechanism: the vocoder. To improve the synthesis quality, we suggest two hybrid models (LSTM-VAE, GRU-VAE and CNN-VAE) that combine the representation learning capacity of variational autoencoder (VAE) with the temporal modelling ability of a long short-term memory RNN (LSTM) and the CNN ability to learn invariant features. To figure out the performance of our proposed audio laughter generation process, objective evaluation (RMSE) and a perceptual audio quality test (listening test) were conducted. According to these evaluation metrics, we can show that the GRU-VAE outperforms the other VAE models.
Noise from automobiles, such as buzzing, squeaking, and rattling (BSR) noises, is a key factor in automobile quality assessment. It is necessary to classify these noises for appropriate handling and prevention. Althou...
详细信息
Noise from automobiles, such as buzzing, squeaking, and rattling (BSR) noises, is a key factor in automobile quality assessment. It is necessary to classify these noises for appropriate handling and prevention. Although many researchers have conducted studies to classify noise, they suffer from several problems: difficulty in extracting appropriate features, insufficient data to train a classifier, and weak robustness to surrounding noise. This paper proposes a method called latent semantic controlling generative adversarial networks (LSC-GAN) to solve these problems. To capture the features of data, a variational autoencoder (VAE), an autoencoder with approximate inference in a latent Gaussian model, learns the data representation by projecting them into the latent space according to their features and reconstructing the projected data. Because the generator and the discriminator of the LSC-GAN are trained simultaneously, the capacity to extract the characteristics of the data is improved and a knowledge space of classifiable data is also expanded with insufficient data. While data are generated by the generator, the encoder projects them back to the latent space according to their characteristics to advance the ability to extract features. Finally, the encoder is trained to the classifier, which is trained to classify BSR noises. The proposed classifier outperforms other models and achieves an accuracy of 96.68%. We confirm using a confusion matrix that the proposed model classifies the types of insufficient class better than other models. Our proposed model classifies data with accuracy of 94.68%, even if the data contains surrounding noise, which means it is more robust to BSR with surrounding noise than other models. (c) 2020 Elsevier B.V. All rights reserved.
暂无评论