The development of modern manufacturing has raised greater demands on the accuracy, response speed, and operating cost of industrial accident warnings. Compared to conventional contact sensors, surveillance cameras ca...
详细信息
The development of modern manufacturing has raised greater demands on the accuracy, response speed, and operating cost of industrial accident warnings. Compared to conventional contact sensors, surveillance cameras can contactlessly capture spatial-temporal information of the open workspace with stable data quality, widely used in industrial process monitoring. However, due to the scarcity of industrial video datasets and the rarity and diversity of abnormal events, existing video -based anomaly detection models perform poorly in manufacturing scenarios. In this regard, we collect two datasets from typical industrial sites and propose a memory -enhanced spatial-temporal encoding (MSTE) framework for automatic industrial anomaly detection. The proposed MSTE framework learns spatial and temporal normality as well as spatial-temporal correlations with parallel structures and simultaneously measures deviations in appearance, motion, and consistency to respond to complex industrial anomalies accurately. Experimental results on public benchmarks and realworld industrial videos show that our method outperforms existing methods and achieves accurate temporal localization of various spatial-temporal anomalies, which helps to improve the safety and reliability of intelligent manufacturing.
Modern power machinery is inherently complex and operates under dynamic operating conditions, so they demand advanced solutions based on deep learning to diagnose bearing faults inside rotating equipment that cause un...
详细信息
Modern power machinery is inherently complex and operates under dynamic operating conditions, so they demand advanced solutions based on deep learning to diagnose bearing faults inside rotating equipment that cause unplanned downtime and safety issues, leading to operational challenges. However, most deep learning approaches aim to improve performance by incorporating hybrid neural networks that rely on multiple convolutional and temporal units, often overlooking optimizing the large number of hyperparameters that define the structure and performance of hybrid models along with the associated computational constraints. To address this gap, this study presents an innovative approach for the detection and classification of bearing faults by integrating an optimized sparse deep autoencoder (DAE) with a Bidirectional Long Short-Term Memory model (BiLSTM). The optimal network structure and hyperparameters are determined through Bayesian optimization (BO) with parallel settings, which automatically searches for network configurations that improve the feature extraction ability of the DAE and the generalization ability of the Bi-LSTM for more efficient fault classification in rolling bearings. Parallel optimization accelerates network structure and hyperparameter tuning by evaluating multiple configurations at once. It leverages the full potential of available multi-core Central Processing Units (CPUs)/Graphics Processing Units (GPUs) in conjunction with a lightweight BO surrogate model. This autonomous and user-friendly framework generates inputs from principal component analysis for linear and BO-DAE for non-linear feature extraction and selection, which are then used to train a BO-enhanced Bi-LSTM. This threestage optimized method effectively captures spatial and temporal dependencies in vibrational signals, achieving superior efficiency, accuracy, and reliability compared to shallow and deep learning models. Evaluation metrics, including macro precision (99.50 %), reca
Cybersecurity has become a key component of national strategy in recent years. Traditional cybersecurity technology such as network traffic-based intrusion detection and threatening intelligence sensing are designed t...
详细信息
Cybersecurity has become a key component of national strategy in recent years. Traditional cybersecurity technology such as network traffic-based intrusion detection and threatening intelligence sensing are designed to focus on the traffic features of network, which are no doubt effective defense technologies. However, these methods required decent amount of domain knowledge and massive training data, which brought a significant barrier for cybersecurity research. In this work, we propose a novel residual autoencoder and support vector machine combined approach (RAE-SVM) for webpage tamper-resistant detection using high-level webpage image features. This method, inspired by the Chinese proverb "mend the fold after the sheep have been stolen." The web crawler technology is used for website screenshot within limited domain names, and input them into autoencoder architecture and SVM for feature extraction and invaded webpage detection. This method combines the advantages of deep residual network, convolutional autoencoder and SVM, and the interdisciplinary intersection between cybersecurity and high-level image features. The experimental results demonstrate that the proposed method achieves an accuracy of 95%, significantly higher than other models, which proves the validity of the proposed method.
Mineral potential mapping (MPM) can recognise irregular patterns of mineralization-related indicator features and proxies. It serves as an anomaly detection technique, given that mineralization itself is a rare geolog...
详细信息
Mineral potential mapping (MPM) can recognise irregular patterns of mineralization-related indicator features and proxies. It serves as an anomaly detection technique, given that mineralization itself is a rare geological event. In this regard, unsupervised anomaly detection (UAD) algorithms could be effective in identifying high potential zones of mineralization accounting for irregular pattern recognition. The main advantage of these algorithms lies in their ability toexploit geo-datasets without requiring any form of annotation. In this study, eight evidence layers were first created based on the conceptual model of mineral deposits to build a model of Fe prospectivity in the Esfordi region of Yazd province, located in east-central Iran. Then, three unsupervised anomaly detection algorithms, namely deep autoencoder (DAE), one-class support vector machine (OC-SVM), and isolation forest (IForest) were employed to assess Fe prospectivity in the area. The prediction-area (P-A) plot was subsequently used to evaluate the efficacy of the three prospectivity models. Finding indicate that the deep autoencoder outperforms the other adopted machine learning methods in identifying high potential areas of Fe mineralization. Considering the significance of hyperparameters in the implementation of these algorithms, we also investigate the application of the P-A plot to identify optimal hyperparameter values, thereby enhancing the performance of the Fe prospectivity model. The results demonstrate that in IForest and DAE, and to some extent OC-SVM, experts can adjust hyperparameters without relying on labelled data, achieving a commendable level of detection performance. This innovative approach and workflow are broadly applicable to regional-scale mineral exploration across diverse metallogenic provinces globally.
In the current network environment, the original network data present the characteristics of multiple features and high dimensions. Furthermore, in the existing element extraction methods based on deep neural networks...
详细信息
In the current network environment, the original network data present the characteristics of multiple features and high dimensions. Furthermore, in the existing element extraction methods based on deep neural networks, the loss of feature information layer by layer increases as the data dimensions decrease, which greatly affects the retention of network data information elements and brings a huge challenge to effective network security protection. This paper refers to a residual neural network to improve the deep autoencoder (DAE) and then utilizes them to propose a novel element extraction method named the layer-by-layer loss compensation deep autoencoder (LC-DAE) based on the sparrow search algorithm (SSA). In the proposed method, a loss compensation module is added to each encoding layer of the DAE. Specifically, this module first restores the data by using the decoding layer corresponding to the encoding layer. Then, the loss value of the calculated characteristic information is compensated using the output of the corresponding encoding layer. Subsequently, the SSA is used to optimize the LC-DAE in the training process. Finally, the experimental results show that compared with the existing methods, this method retains more sufficient element information, and significantly improves the classification performance of data using neural networks.
Machine learning techniques for network-based intrusion detection often assume that network traffic does not change over time or that model updates can be easily performed. This paper proposes a novel, reminiscent int...
详细信息
ISBN:
(纸本)9781728181042
Machine learning techniques for network-based intrusion detection often assume that network traffic does not change over time or that model updates can be easily performed. This paper proposes a novel, reminiscent intrusion detection model based on deep autoencoders and transfer learning to ease the model update burden in a twofold implementation. First, a deep autoencoder is used as an additional feature extraction stage to obtain a historical feature representation of network traffic. Second, at model updates, the deep autoencoder parameters are updated through a transfer learning procedure, thus, significantly decreasing the amount of needed labeled training data and the computational costs. Experiments performed on a 8TB dataset containing real and valid network traffic ranging for one year have shown that approaches in the literature cannot handle with the network traffic behavior changes over time, requiring impractical amounts of labeled data to be provided during model training tasks. In addition, if no model updates are performed, the proposed scheme can improve the true-negative rate by up to 23.9%. If done so, it can provide similar accuracy rates of traditional techniques while demanding only 22% of labeled training data and 28% of computational costs.
Objective: Autism spectrum disorder (ASD) affects nearly 1 in 44 children younger than 8 years old in the United States, and the situation may be even worse in remote areas of the world. However, it is difficult to ut...
详细信息
Objective: Autism spectrum disorder (ASD) affects nearly 1 in 44 children younger than 8 years old in the United States, and the situation may be even worse in remote areas of the world. However, it is difficult to utilize existing approaches to screen patients with ASD in remote areas due to the lack of professionals and high-tech instruments. Therefore, we develop a fast and accurate scalable method for screening children with ASD. Methods: A deep weakly supervised artificial intelligence model is proposed for ASD screening based on the dynamic viewing patterns (DVP) over viewing time and visual stimuli. In training, we utilized a long short-term memory (LSTM) network to learn the mapping between the autoencoder-based encoded dynamic patterns and the labels. In testing, we fed the encoded DVP of each undiagnosed child into the trained network and predicted the diagnosis category based on the score on all stimuli. Results: Based on the multi-center evaluation on 165 subjects (95 typically developing children and 70 children with ASD) aged 3-6 years from different areas of China, our method achieves an average recognition accuracy of 96.73% (sensitivity 96.85% and specificity 96.63%). Conclusion: The DVP is a discriminating attribute to identify the atypical performance of ASD. The DVP-based model is an effective platform for enhancing auxiliary ASD screening accuracy. Significance: We validated the importance of dynamic information on between-group differences and classification. Additionally, the evaluation results suggest that the proposed model can provide an objective and accessible tool for scalable ASD screening applications.
Crowds often appear in surveillance videos in public places, from which anomaly detection is of great importance to public safety. Since the abnormal cases are rare, variable and unpredictable, autoencoders with encod...
详细信息
Crowds often appear in surveillance videos in public places, from which anomaly detection is of great importance to public safety. Since the abnormal cases are rare, variable and unpredictable, autoencoders with encoder and decoder structures using only normal samples have become a hot topic among various approaches for anomaly detection. However, since autoencoders have excessive generalization ability, they can sometimes still reconstruct abnormal cases very well. Recently, some researchers construct memory modules under normal conditions and use these normal memory items to reconstruct test samples during inference to increase the reconstruction errors for anomalies. However, in practice, the errors of reconstructing normal samples with the memory items often increase as well, which makes it still difficult to distinguish between normal and abnormal cases. In addition, the memory-based autoencoder is usually available only in the specific scene where the memory module is constructed and almost loses the prospect of cross-scene applications. We mitigate the overgeneralization of autoencoders from a different perspective, namely, by reducing the prediction errors for normal cases rather than increasing the prediction errors for abnormal cases. To this end, we propose an autoencoder based on hybrid attention and motion constraint for anomaly detection. The hybrid attention includes the channel attention used in the encoding process and spatial attention added to the skip connection between the encoder and decoder. The hybrid attention is introduced to reduce the weight of the feature channels and regions representing the background in the feature matrix, which makes the autoencoder features more focused on optimizing the representation of the normal targets during training. Furthermore, we introduce motion constraint to improve the autoencoder's ability to predict normal activities in crowded scenes. We conduct experiments on real-world surveillance videos, UCSD, CU
Discriminative dictionary learning has been extensively used for pattern classification tasks. By incorporating different kinds of label information into the dictionary learning framework, a dictionary can be attained...
详细信息
Discriminative dictionary learning has been extensively used for pattern classification tasks. By incorporating different kinds of label information into the dictionary learning framework, a dictionary can be attained that represents the original signal with discriminative reconstruction. The previous works learn the dictionary in the original space which limits the dictionary learning performance. In this paper, we propose an approach, namely deep Discriminative Dictionary Pair Learning ((DPL)-P-3) for image classification. The input of (DPL)-P-3 is not the matrix collected by original gray images or hand-crafted features but the relatively deeper features derived from autoencoders. Then, a structured dictionary is designed based on the discriminative contributions across different classes to reconstruct the deep feature. In addition, the associated structured projective dictionary is learned as well to guarantee the decoders updating towards the minimal error of deconvolution operator. By leveraging the discriminative-dictionary-learning-based loss function and the autoencoder loss function, (DPL)-P-3 can simultaneously learn the deep potential feature and the corresponding dictionary pair. In the testing phase of (DPL)-P-3, the minimum error between the deep feature and the structured projective component with regard to different classes can directly indicate the label by a basic matrix multiplication operation. Experimental results on challenging Extended Yale B, AR, UMIST, COIL20, Scene 15, and Caltech101 datasets demonstrate that the proposed (DPL)-P-3 outperforms the prominent dictionary learning methods.
Prediction of Student performance through a machine predicts a student's future success. It can be considered an essential procedure to determine the students' academic excellence and identify them at high ris...
详细信息
Prediction of Student performance through a machine predicts a student's future success. It can be considered an essential procedure to determine the students' academic excellence and identify them at high risk for academic performance. Prediction of student performance also provides universities with a high reputation and ranking. The evaluation of 'What students can do with their learning' is still a tedious task. There are many challenging factors to solve this problem, mainly owing to the enormous amount of data collected from students. Most of the research works have focused on developing new methodologies for student performance prediction. But all the existing work has some performance limitations. Here, a new model called transient search capsule network based on the deep autoencoder (TSCNDE) is introduced to detect student performance. The TSCNDE method is implemented with the help of the PYTHON tool. The performance prediction process has been completed with the help of the OULA dataset. The obtained results are assessed on accuracy (99.2%), precision, (99.8%), specificity (98.7%), and sensitivity (98.9%) parameters. The results obtained showed that the TSCNDE method is about 99.2% more accurate than the other related method. Also, the obtained results are compared with some existing deep learning and machine learning methods.
暂无评论