Carbon-fiber-reinforced plastic (CFRP) is a composite material whose base material is plastic and reinforcement material is carbon fibers. CFRP is widely used in various fields for laminating prepregs. The laminated p...
详细信息
Carbon-fiber-reinforced plastic (CFRP) is a composite material whose base material is plastic and reinforcement material is carbon fibers. CFRP is widely used in various fields for laminating prepregs. The laminated plate tends to sustain damage, such as delamination, fiber breakage, and base material breakage;hence, we must conduct high-precision and efficient nondestructive testing (NDT). Examples of NDT are ultrasonic examination, X-ray tomography, and infrared stress analysis. With most NDT methods, it is difficult to easily obtain detailed correct information of defects, such as their depth, position, and size. To solve this problem, we develop a machine-learning-aided inverse analysis model that predicts the spatial information of defects from the sum of the principal stresses on the surface calculated from the temperature change measured by infrared analysis, and we propose it as an alternative method to the existing damage analysis. Applying the proposed method to the simulated stress distributions of quasi-isotropic CFRP laminates with defects, the results showed over 99% success to recognize the detail information of defects. Additionally, we examine the properties of the dataset using a forward analysis model and a variational autoencoder. Our method with a convolutional neural network enables us to successfully estimate the information of defects at high speed. Experimental data can be applicable as well as the simulation results to our proposed method, and we believe our method will be a powerful supporting tool for the current NDT for CFRPs.
The task of emotional voice conversion (EVC) aims to convert speech from one emotional state into another, while keeping linguistic content, speaker identity and other emotion-independent information unchanged. Becaus...
详细信息
The task of emotional voice conversion (EVC) aims to convert speech from one emotional state into another, while keeping linguistic content, speaker identity and other emotion-independent information unchanged. Because previous studies were limited to a specific set of emotions, it is challenging to realize the conversion of emotions never seen in training stage. In this paper, we propose a one-shot emotional voice conversion model based on feature separation. The proposed method could control emotional characteristics with Global Emotion Embeddings (GEEs), and introduce activation guidance (AG) and mutual information (MI) minimization to reduce the correlations between emotion embedding and emotion-independent representation. At run-time conversion, it could produce the desired emotional utterance from a single pairwise utterance without any emotion labels, whether the target emotion appears in the training set or not. The subjective and objective evaluations validate the effectiveness of our proposed model for both the degree of feature separation and emotion expression, even it could achieve unseen emotion conversion.
dB is a web-based interface that serves as a "drummer bot" for exploring interactive groove-making experiences with an AI percussion system. This system, leveraging variational autoencoders (VAEs), transform...
详细信息
Consideration of subgroups or domains within medical image datasets is crucial for the development and evaluation of robust and generalizable machine learning systems. To tackle the domain identification problem, we e...
详细信息
ISBN:
(纸本)9783031439926;9783031439933
Consideration of subgroups or domains within medical image datasets is crucial for the development and evaluation of robust and generalizable machine learning systems. To tackle the domain identification problem, we examine deep unsupervised generative clustering approaches for representation learning and clustering. The variational Deep Embedding (VaDE) model is trained to learn lower-dimensional representations of images based on a Mixture-of-Gaussians latent space prior distribution while optimizing cluster assignments. We propose the Conditionally Decoded variational Deep Embedding (CDVaDE) model which incorporates additional variables of choice, such as the class labels, as conditioning factors to guide the clustering towards subgroup structures in the data which have not been known or recognized previously. We analyze the behavior of CDVaDE on multiple datasets and compare it to other deep clustering algorithms. Our experimental results demonstrate that the considered models are capable of separating digital pathology images into meaningful subgroups. We provide a general-purpose implementation of all considered deep clustering methods as part of the open source Python package DomId (https://***/DIDSR/DomId).
This paper presents a neural architecture search scheme for chip layout hotspot detection. In this work, hotspot detectors, in the form of neural networks, are modeled as weighted directed acyclic graphs. A variationa...
详细信息
ISBN:
(纸本)9798350396249
This paper presents a neural architecture search scheme for chip layout hotspot detection. In this work, hotspot detectors, in the form of neural networks, are modeled as weighted directed acyclic graphs. A variational autoencoder maps the discrete graph topological space into a continuous embedding space. Bayesian Optimization performs neural architecture search in this embedding space, where an architecture performance predictor is employed to accelerate the search process. Experimental studies on ICCAD 2012 and ICCAD 2019 Contest benchmarks demonstrate that, the proposed scheme significantly improves the agility of previous neural architecture search schemes, and generates hotspot detectors with competitive detection accuracy, false alarm rate, and inference time.
Multivariate time series anomaly detection is a crucial area of research in several domains, including finance, logistics, and manufacturing. Successfully identifying abnormal behaviors or events can help prevent disr...
详细信息
ISBN:
(数字)9789819947522
ISBN:
(纸本)9789819947515;9789819947522
Multivariate time series anomaly detection is a crucial area of research in several domains, including finance, logistics, and manufacturing. Successfully identifying abnormal behaviors or events can help prevent disruptions, but the high false positive rate in this field is a significant challenge that affects detection accuracy. In this paper, we propose a novel method, mTranAD, which improves upon the TranAD algorithm by leveraging the benefits of Transformer and variational autoencoder (VAE) in multivariate unsupervised anomaly detection. Specifically, mTranAD replaces TranAD's autoencoder structure with a VAE and trains it using the VAE's loss function. The incorporation of latent variables in the VAE model enables accurate reconstruction of data and mapping of data to a lower dimensional latent space, allowing for a more efficient description of input data complexity with fewer parameters. By utilizing these latent variables, the model can effectively handle high-dimensional, complex data and exhibit greater flexibility when generating new data. We conduct experiments on four public datasets (NAB, MBA, SMAP and WADI) and compare mTranAD's performance with 11 other state-of-the-art methods, including TranAD, MERLIN, LSTM-NDT, OmniAnomaly, USAD, and DAGMM. The experimental results demonstrate that mTranAD outperforms these methods in terms of performance, accuracy, and reliability. The primary purpose of this paper is to improve the TranAD algorithm and enhance the accuracy of multivariate time series anomaly detection by reducing the false positive rate.
In this work, we propose an approach to generate whole-slide image (WSI) tiles by using deep generative models infused with matched gene expression profiles. First, we train a variational autoencoder (VAE) that learns...
详细信息
In this work, we propose an approach to generate whole-slide image (WSI) tiles by using deep generative models infused with matched gene expression profiles. First, we train a variational autoencoder (VAE) that learns a latent, lower-dimensional representation of multi-tissue gene expression profiles. Then, we use this representation to infuse generative adversarial networks (GANs) that generate lung and brain cortex tissue tiles, resulting in a new model that we call RNA-GAN. Tiles generated by RNA-GAN were preferred by expert pathologists compared with tiles generated using traditional GANs, and in addition, RNA-GAN needs fewer training epochs to generate high-quality tiles. Finally, RNA-GAN was able to generalize to gene expression profiles outside of the training set, showing imputation capabilities. A web-based quiz is available for users to play a game distinguishing real and synthetic tiles: https://***/, and the code for RNA-GAN is available here: https://***/gevaertlab/RNA-GAN.
The computer vision community is increasingly interested in exploring hyperbolic space for image representation, as hyperbolic approaches have demonstrated outstanding results in efficiently representing data with an ...
详细信息
ISBN:
(纸本)9798350320107
The computer vision community is increasingly interested in exploring hyperbolic space for image representation, as hyperbolic approaches have demonstrated outstanding results in efficiently representing data with an underlying hierarchy. This interest arises from the intrinsic hierarchical nature among images. However, despite the hierarchical nature of remote sensing (RS) images, the investigation of hyperbolic spaces within the RS community has been relatively limited. The objective of this study is therefore to examine the relevance of hyperbolic embeddings of RS data, focusing on scene embedding. Using a variational Auto-Encoder, we project the data into a hyperbolic latent space while ensuring numerical stability with a feature clipping technique. Experiments conducted on the NWPU-RESISC45 image dataset demonstrate the superiority of hyperbolic embeddings over the Euclidean counterparts in a classification task. Our study highlights the potential of operating in hyperbolic space as a promising approach for embedding RS data.
Visual surveillance has become indispensable in the evolution of Intelligent Transportation Systems (ITS). Video object trajectories are key to many of the visual surveillance applications. Classifying varying length ...
详细信息
Visual surveillance has become indispensable in the evolution of Intelligent Transportation Systems (ITS). Video object trajectories are key to many of the visual surveillance applications. Classifying varying length time series data such as video object trajectories using conventional neural networks, can be challenging. In this paper, we propose trajectory classification and anomaly detection using a hybrid Convolutional Neural Network (CNN) and variational autoencoder (VAE) architecture. First, we introduce a high level features for varying length object trajectories using color gradient representation. In the next stage, a semi-supervised way to annotate moving object trajectories extracted using Temporally Incremental Gravitational Model (TIGM) is used for class labeling. For training, anomalous trajectories are identified using t-Distributed Stochastic Neighbor Embedding (t-SNE). Finally, a hybrid CNN-VAE architecture has been proposed for trajectory classification and anomaly detection. The results obtained using publicly available surveillance video datasets reveal that the proposed method can successfully identify traffic anomalies such as violations in lane driving, sudden speed variations, abrupt termination of vehicle during movement, and vehicles moving in wrong directions. The accuracy of trajectory classification improves by a margin of 1-6% against popular neural networks-based classifiers across various datasets using the proposed high-level features. The gradient representation also improves the anomaly detection accuracy significantly (30-35%). Code and dataset can be found at https://***/santhoshkelathodi/CNN-VAE.
Evacuation planning is important for reducing casualties in toxic gas leak incidents. However, most evacuation plans are too qualitative to be applied to unexpected practical situations. Here, we suggest an evacuation...
详细信息
Evacuation planning is important for reducing casualties in toxic gas leak incidents. However, most evacuation plans are too qualitative to be applied to unexpected practical situations. Here, we suggest an evacuation route proposal system based on a quantitative risk evaluation that provides the safest route for individual evacuees by predicting dynamic gas dispersion with a high accuracy and short calculation time. Detailed evacuation scenarios, including weather conditions, leak intensity, and evacuee information, were considered. The proposed system evaluates the quantitative risk in the affected area using a deep neural network surrogate model to determine optimal evacuation routes by integer programming. The surrogate model was trained using data from computational fluid dynamics simulations. A variational autoencoder was applied to extract the geometric features of the affected area. The predicted risk was combined with linearized integer programming to determine the optimal path in a predefined road network. A leak scenario of an ammonia gas pipeline in a petrochemical complex was used for the case study. The results show that the developed model offers the safest route within a few seconds with minimum risk. The developed model was applied to a sensitivity analysis to determine variable influences and safe shelter locations.
暂无评论