AbstractParkinson’s disease (PD) is a neurodegenerative disorder that occurs as a result of a decrease in the chemical called dopamine in the brain. There is no definitive treatment for PD, but some medications used ...
详细信息
AbstractParkinson’s disease (PD) is a neurodegenerative disorder that occurs as a result of a decrease in the chemical called dopamine in the brain. There is no definitive treatment for PD, but some medications used to control symptoms in the early stages have a critical effect on the progression of the disease. Approximately 90% of patients with PD have vocal problems, and although voice disorders seen in the early stages are not apparent in the patient’s speech, they can be detected by acoustic analysis. In this study, a decision support system was proposed for the diagnosis of PD utilizing the feature extraction power of autoencoder (AE) & long short-term memory (LSTM) models by using speech signals as input data. Firstly, simple (SAE), convolutional (CAE), and recurrent (RAE) AE models were created for the ablation analysis. Then, the effect of hybridization and deepening of these models with LSTM layers on the classification performance was observed. Within the scope of the study, RAE achieved the highest accuracy among the base models while CAE & LSTM hybrid model provided the highest performance among all models with 95.79% accuracy for PD diagnosis based on audio signals. It was concluded that hybridization of the AE and LSTM models significantly improved the performance of simple and convolutional AE, and deepening the network to a certain extent improves the classification performance according to the type of AE.
The imbalanced data classification is a challenging issue in many domains including medical intelligent diagnosis and fraudulent transaction analysis. The performance of the conventional classifier degrades due to the...
详细信息
The imbalanced data classification is a challenging issue in many domains including medical intelligent diagnosis and fraudulent transaction analysis. The performance of the conventional classifier degrades due to the imbalanced class distribution of the training data set. Recently, machine learning and deep learning techniques are used for imbalanced data classification. Data preprocessing approaches are also suitable for handling class imbalance problem. Data augmentation is one of the preprocessing techniques used to handle skewed class distribution. Synthetic Minority Oversampling Technique (SMOTE) is a promising class balancing approach and it generates noise during the process of creation of synthetic samples. In this paper, autoencoder is used as a noise reduction technique and it reduces the noise generated by SMOTE. Further, Deep one-dimensional Convolutional Neural Network is used for classification. The performance of the proposed method is evaluated and compared with existing approaches using different metrics such as Precision, Recall, Accuracy, Area Under the Curve and Geometric Mean. Ten data sets with imbalance ratio ranging from 1.17 to 577.87 and data set size ranging from 303 to 284807 instances are used in the experiments. The different imbalanced data sets used are Heart-Disease, Mammography, Pima Indian diabetes, Adult, Oil-Spill, Phoneme, Creditcard, BankNoteAuthentication, Balance scale weight & distance database and Yeast data sets. The proposed method shows an accuracy of 96.1%, 96.5%, 87.7%, 87.3%, 95%, 92.4%, 98.4%, 86.1%, 94% and 95.9% respectively. The results suggest that this method outperforms other deep learning methods and machine learning methods with respect to G-mean and other performance metrics.
Unmanned surface vehicle(USV)is currently a hot research topic in maritime communication network(MCN),where denoising and semantic segmentation of maritime images taken by USV have been rarely *** former has recently ...
详细信息
Unmanned surface vehicle(USV)is currently a hot research topic in maritime communication network(MCN),where denoising and semantic segmentation of maritime images taken by USV have been rarely *** former has recently researched on autoencoder model used for image denoising,but the existed models are too complicated to be suitable for real-time detection of *** this paper,we proposed a lightweight autoencoder combined with inception module for maritime image denoising in different noisy environments and explore the effect of different inception modules on the denoising ***,we completed the semantic segmentation task for maritime images taken by USV utilizing the pretrained U-Net model with tuning,and compared them with original U-Net model based on different ***,we compared the semantic segmentation of noised and denoised maritime images respectively to explore the effect of image noise on semantic segmentation *** studies are provided to prove the feasibility of our proposed denoising and segmentation ***,a simple integrated communication system combining image denoising and segmentation for USV is shown.
Abnormal detection plays an important role in video surveillance. LSTM encoder-decoder is used to learn representation of video sequences and applied for detecting abnormal event in complex environment. The learned re...
详细信息
Abnormal detection plays an important role in video surveillance. LSTM encoder-decoder is used to learn representation of video sequences and applied for detecting abnormal event in complex environment. The learned representation of LSTM encoder-decoder is learned from encoder, and it is crucial for decoder. However, LSTM encoder-decoder generally fails to account for the global context of the learned representation with a fixed dimension representation. In this paper, we explore a hybrid autoencoder architecture, which not only extracts better spatio-temporal context, but also improves the extrapolate capability of the corresponding decoder by the shortcut connection. The experiment shows that the hybrid model performs better than the state-of-the-art anomaly detection methods in both qualitative and quantitative ways on benchmark datasets.
Electricity theft is considered one of the most significant reasons of the non technical losses (NTL). It negatively influences the utilities in terms of the power supply quality, grid's safety, and economic loss....
详细信息
Electricity theft is considered one of the most significant reasons of the non technical losses (NTL). It negatively influences the utilities in terms of the power supply quality, grid's safety, and economic loss. Therefore, it is necessary to effectively deal with the electricity theft problem. For detecting electricity theft in smart grids (SGs), an efficient and state-of-the-art approach is designed in the underlying work based on autoencoder and bidirectional gated recurrent unit (AE-BiGRU). The proposed approach consists of six components: (1) data collection, (2) data preparation, (3) data balancing, (4) feature extraction, (5) classification and (6) performance evaluation. Moreover, bidirectional gated recurrent unit (BiGRU) is used for the identification of the anomalies in electricity consumption (EC) patterns caused due to factors like family formation changes, holidays, parties, and so on, which are referred as non-theft factors. The proposed autoencoder-bidirectional gated recurrent unit (AE-BiGRU) model employs the EC data acquired from state grid corporation of China (SGCC) for simulations. Furthermore, it is visualized from the simulation results that 90.1% accuracy and 10.2% false positive rate (FPR) are obtained by the proposed model. The results are better than different existing classifiers, i.e., logistic regression (LR), decision tree (DT), extreme gradient boosting (XGBoost), gated recurrent unit (GRU), etc.
As a powerful soft computing tool, fuzzy cognitive maps (FCMs) have been successfully employed for time-series modeling and forecasting problems. However, both the rapid time variation and the trends are still open pr...
详细信息
As a powerful soft computing tool, fuzzy cognitive maps (FCMs) have been successfully employed for time-series modeling and forecasting problems. However, both the rapid time variation and the trends are still open problems when processing univariate non-stationary time-series forecasting problems via FCM-based models. In this paper, we propose a time-series forecasting model by composing FCMs, gated recurrent unit network (GRU), and autoencoder network (AE). The model is termed GAE-FCM. Firstly, a scheme based on gated recurrent unit networks and autoencoder networks is designed to learn the potential representations and capture the long-term trend of non-stationary time series while decomposing these univariate time series into a group of multivariate feature vectors. Then, the obtained multivariate feature vectors are modeled as a fuzzy cognitive map in which quantifying its connection matrix is regarded as a convex optimization problem. Finally, the time-series trend is predicted by the optimized fuzzy cognitive map and corresponding modeling mechanism. The performance of the proposed model has been validated by comparison with several representative methods on five non-stationary time-series datasets.
To enhance the accuracy of identifying water sources in mine inrush incidents, this study, taking the Shengquan coal mine in Shandong, China, as a case study, proposes a novel water source identification model based o...
详细信息
To enhance the accuracy of identifying water sources in mine inrush incidents, this study, taking the Shengquan coal mine in Shandong, China, as a case study, proposes a novel water source identification model based on an improved autoencoder-the "Masked autoencoder-based Classifier" model. This model, through a unique autoencoder framework and a custom 'masked_loss' loss function, achieves semi-supervised learning and dimensionality reduction of groundwater sample ionic data. By configuring the hidden layers, the classifier component of the model directly receives data processed by the encoder component. This not only improves the model's performance but also optimizes its complexity. Through an evaluation of the model's fitting effectiveness, our model achieved an average accuracy of 88.8% across 20 runs, with precision, recall, F1-score, and MCC reaching 88.1%, 80.6%, 0.827, and 0.833, respectively, significantly outperforming other classic models. The model successfully identified the sources of three sets of inrush water samples, with a high number of successful runs and clear average probabilities. This work contributes not only to the field of mine water inrush source identification but also offers a new perspective for the broader field of machine learning.
Cross-modal retrieval has gained much attention in recent years. As the research mainstream, most of existing approaches learn projections for data from different modalities into a common space where data can be compa...
详细信息
Cross-modal retrieval has gained much attention in recent years. As the research mainstream, most of existing approaches learn projections for data from different modalities into a common space where data can be compared directly. However, they neglect the preservation of feature and semantic information, so they are unable to obtain satisfactory results as expected. In this paper, we propose a two-stage learning method to learn multi-modal mappings that project multi-modal data to low dimensional embeddings that preserve both feature and semantic information. In the first stage, we combine both low-level feature and high-level semantic information to learn feature-aware semantic code vectors. In the second stage, we use encoder-decoder paradigm to learn projections. The encoder projects feature vectors to code vectors, and the decoder projects code vectors back to feature vectors. The encoder-decoder paradigm guarantees the embeddings to preserve both feature and semantic information. An alternating minimization procedure is developed to solve the multi-modal semantic autoencoder optimization problem. Extensive experiments on three benchmark datasets demonstrate that the proposed method outperforms state-of-the-art cross-modal retrieval methods. (C) 2018 Elsevier B.V. All rights reserved.
As one of the key operations in Wireless Sensor Networks(WSNs), the energy-efficient data collection schemes have been actively explored in the literature. However, the transform basis for sparsifing the sensed data i...
详细信息
As one of the key operations in Wireless Sensor Networks(WSNs), the energy-efficient data collection schemes have been actively explored in the literature. However, the transform basis for sparsifing the sensed data is usually chosen empirically, and the transformed results are not always the sparsest. In this paper, we propose a Data Collection scheme based on Denoising autoencoder(DCDA) to solve the above problem. In the data training phase, a Denoising autoencoder(DAE) is trained to compute the data measurement matrix and the data reconstruction matrix using the historical sensed data. Then, in the data collection phase, the sensed data of whole network are collected along a data collection tree. The data measurement matrix is utilized to compress the sensed data in each sensor node, and the data reconstruction matrix is utilized to reconstruct the original data in the ***, the data communication performance and data reconstruction performance of the proposed scheme are evaluated and compared with those of existing schemes using real-world sensed data. The experimental results show that compared to its counterparts, the proposed scheme results in a higher data compression rate, lower energy consumption, more accurate data reconstruction, and faster data reconstruction speed.
Hyperspectral (HS) data comprises of continuous spectral responses of hundreds of narrow spectral bands with very fine spectral resolution or bandwidth, which offer feature identification and classification with high ...
详细信息
Hyperspectral (HS) data comprises of continuous spectral responses of hundreds of narrow spectral bands with very fine spectral resolution or bandwidth, which offer feature identification and classification with high accuracy. In the present study, Mutual Information (MI) based Segmented Stacked autoencoder (S-SAE) approach for spectral-spatial classification of the HS data is proposed to reduce the complexity and computational time compared to Stacked autoencoder (SAE) based feature extraction. A non-parametric dependency measure (MI) based spectral segmentation is proposed instead of linear and parametric dependency measure to take care of both linear and nonlinear inter-band dependency for spectral segmentation of the HS bands. Then morphological profiles are created corresponding to segmented spectral features to assimilate the spatial information in the spectral-spatial classification approach. Two non-parametric classifiers, Support Vector Machine (SVM) with Gaussian kernel and Random Forest (RF) are used for classification of the three most popularly used HS datasets. Results of the numerical experiments carried out in this study have shown that SVM with a Gaussian kernel is providing better results for the Pavia University and Botswana datasets whereas RF is performing better for Indian Pines dataset. The experiments performed with the proposed methodology provide encouraging results compared to numerous existing approaches. (C) 2018 International Society for Photogrammetry and Remote Sensing, Inc. (ISPRS). Published by Elsevier B.V. All rights reserved.
暂无评论