In this paper, a novel Hybrid deep Ensemble (HDE) is proposed for automatic speech disfluency classification on a sparse speech dataset. Categorizations of speech disfluencies for diagnosis of speech disorders have so...
详细信息
In this paper, a novel Hybrid deep Ensemble (HDE) is proposed for automatic speech disfluency classification on a sparse speech dataset. Categorizations of speech disfluencies for diagnosis of speech disorders have so long relied on sophisticated deep learning models. Such a task can be accomplished by a straightforward approach with high accuracy by the proposed model which is an optimal combination of diverse machine learning and deep learning algorithms in a hierarchical arrangement which includes a deep autoencoder that yields the compressed latent features. The proposed model has shown considerable improvement in downgrading processing time overcoming the issues of cumbersome hyper-parameter tuning and huge data demand of the deep learning algorithms with high classification accuracy. Experimental results show that the proposed Hybrid deep Ensemble has superior performance compared to the individual base learners, and the deep neural network as well. The proposed model and the baseline models were evaluated in terms of Cohen's kappa coefficient, Hamming loss, Jaccard score, F-score and classification accuracy.
Graph embedding, also known as network embedding and network representation learning, is a useful technique which helps researchers analyze information networks through embedding a network into a low-dimensional space...
详细信息
Graph embedding, also known as network embedding and network representation learning, is a useful technique which helps researchers analyze information networks through embedding a network into a low-dimensional space. However, existing graph embedding methods are all node-based, which means they can just directly map the nodes of a network to low-dimensional vectors while the edges could only be mapped to vectors indirectly. One important reason is the computational cost, because the number of edges is always far greater than the number of nodes. In this article, considering an important property of social networks, i.e., the network is sparse, and hence the average degree of nodes is bounded, we propose an edge-based graph embedding (edge2vec) method to map the edges in social networks directly to low-dimensional vectors. Edge2vec takes both the local and the global structure information of edges into consideration to preserve structure information of embedded edges as much as possible. To achieve this goal, edge2vec first ingeniously combines the deep autoencoder and Skip-gram model through a well-designed deep neural network. The experimental results on different datasets show edge2vec benefits from the direct mapping in preserving the structure information of edges.
Most existing subspace clustering methods focus on learning a meaningful (e.g., sparse or low-rank) representation of the data. However, they have the following two problems which greatly limit the performance: 1) The...
详细信息
Most existing subspace clustering methods focus on learning a meaningful (e.g., sparse or low-rank) representation of the data. However, they have the following two problems which greatly limit the performance: 1) They neglect the intrinsic local geometrical structures within the data to result in locality preserving property be missing. 2) They neglect the feature learning of the raw data which is usually so complex that the learned representation coefficient is not an optimal graph for clustering. This paper addresses the above problems and proposes a novel nonlinear subspace clustering model via adaptive graph regularized autoencoder (NSC-AGA). This model unifies feature learning, locality preserving, and representation matrix learning into a framework, and a new adaptive graph regularizer is introduced, which takes the representation coefficient matrix as a learnable similarity graph imposed on the Euclidean distance matrix of the deep features. Two matrices interact with each other to make the representation coefficient matrix reflect both the global linear correlation and the local geometric distance relationship. A number of experimental results on the five public image database demonstrate that the proposed NSC-AGA model achieves superior clustering performance compared with the state-of-the-art methods.
As an important category of deep models, deep generative model has attracted more and more attention with the proposal of deep Belief Networks (DBNs) and the fast greedy training algorithm based on restricted Boltzman...
详细信息
As an important category of deep models, deep generative model has attracted more and more attention with the proposal of deep Belief Networks (DBNs) and the fast greedy training algorithm based on restricted Boltzmann machines (RBMs). In the past few years, many different deep generative models are proposed and used in the area of Artificial Intelligence. In this paper, three important deep generative models including DBNs, deep autoencoder, and deep Boltzmann machine are reviewed. In addition, some successful applications of deep generative models in image processing, speech recognition and information retrieval are also introduced and analysed.
We consider the problem of index tracking whose goal is to construct a portfolio that minimizes the tracking error between the returns of a benchmark index and the tracking portfolio. This problem carries significant ...
详细信息
We consider the problem of index tracking whose goal is to construct a portfolio that minimizes the tracking error between the returns of a benchmark index and the tracking portfolio. This problem carries significant importance in financial economics as the tracking portfolio represents a parsimonious index that facilitates a practical means to trade the benchmark index. For this reason, extensive studies from various optimization and machine learning-based approaches have ensued. In this paper, we solve this problem through the latest developments from deep learning. Specifically, we associate a deep latent representation of asset returns, obtained through a stacked autoencoder, with the benchmark index's return to identify the assets for inclusion in the tracking portfolio. Empirical results indicate that to improve the performance of previously proposed deep learning-based index tracking, the deep latent representation needs to be learned in a strictly hierarchical manner and the relationship between the returns of the index and the assets should be quantified by statistical measures. Various deep learning-based strategies have been tested for the stock market indices of the S&P 500, FTSE 100 and HSI, and it is shown that our proposed methodology generates the best index tracking performance.
A priori knowledge-incorporating method based on time resolved fluorescence was successfully developed for the determination of polycyclic aromatic hydrocarbons in edible vegetable oils. Specifically, fluorescence dec...
详细信息
A priori knowledge-incorporating method based on time resolved fluorescence was successfully developed for the determination of polycyclic aromatic hydrocarbons in edible vegetable oils. Specifically, fluorescence decay functions of polycyclic aromatic hydrocarbons at characteristic emission wavelengths were used as the priori models and incorporated into the deep-autoencoder. The priori model-incorporating deep-autoencoder models were shown to be effective for the determination of polycyclic aromatic hydrocarbons in edible vegetable oils and root-mean-square errors of prediction lower than 2% were achieved. The influence of analyte, matrix and proportion of priori model were characterized. Increasing the proportion of priori model appropriately was beneficial to the performance of models and 16% was shown to be the best incorporated proportion.
A model was developed to allow a mobile robot to label the areas of a typical domestic room, using raw sequential visual and motor data, no explicit information on location was provided, and no maps were constructed. ...
详细信息
ISBN:
(纸本)9783319686004;9783319685991
A model was developed to allow a mobile robot to label the areas of a typical domestic room, using raw sequential visual and motor data, no explicit information on location was provided, and no maps were constructed. The model comprised a deep autoencoder and a recurrent neural network. The model was demonstrated to (1) learn to correctly label areas of different shapes and sizes, (2) be capable of adapting to changes in room shape and rearrangement of items in the room, and (3) attribute different labels to the same area, when approached from different angles. Analysis of the internal representations of the model showed that a topological structure corresponding to the room structure was self-organized as the trajectory of the internal activations of the network.
In the field of an unmanned aerial vehicle (UAV), the navigation algorithm with high precision and easy implementation is a hot topic of research, and the key of UAV control is to obtain accurate and real-time attitud...
详细信息
ISBN:
(纸本)9781728103778
In the field of an unmanned aerial vehicle (UAV), the navigation algorithm with high precision and easy implementation is a hot topic of research, and the key of UAV control is to obtain accurate and real-time attitude information. In this paper, a feature fusion algorithm based on unsupervised deep autoencoder (DAE) is proposed. It is used for data fusion of multiple sensors. The experimental results show that the unsupervised feature fusion algorithm can effectively improve the accuracy and has the potential to be applied to the data fusion of UAV sensors.
The insider threat is a significant security concern for both organizations and government sectors. Traditional machine learning-based insider threat detection approaches usually rely on domain focused feature enginee...
详细信息
ISBN:
(纸本)9783030369385;9783030369378
The insider threat is a significant security concern for both organizations and government sectors. Traditional machine learning-based insider threat detection approaches usually rely on domain focused feature engineering, which is expensive and impractical. In this paper, we propose an autoencoder-based approach aiming to automatically learn the discriminative features of the insider behaviours, thus alleviating security experts from tedious inspection tasks. Specifically, a Word2vec model is trained with a corpus transformed from various security logs to generate event representations. Instead of manually selecting Word2vec model parameters, we develop an autoencoder-based "parameter tuner" for the model to produce an optimal feature set. Then, the detection is undertaken by examining the reconstruction error of an autoencoder for each transformed event using the Carnegie Mellon University (CMU) CERT Programs insider threat database. Experimental results demonstrate that our proposed approach could achieve an extremely low false-positive rate (FPR) with all malicious events identified.
In order to increase the autonomy of the modern, high complexity robots with multiple degrees of freedom, it is necessary for them to be able to learn and adapt their skills, for example, using reinforcement learning ...
详细信息
In order to increase the autonomy of the modern, high complexity robots with multiple degrees of freedom, it is necessary for them to be able to learn and adapt their skills, for example, using reinforcement learning (RL). However, RL performance greatly depends on the task dimensionality. Methods for reducing the task dimensionality, such as deep autoencoder neural networks, are often employed. Such neural network based dimensionality reduction approaches require a large example database for training, but obtaining such a database for a real robot is a complex and tedious process. This paper proposes a method of obtaining a database for the training of a deep autoencoder network, which serves for the dimensionality reduction of robot learning, and thus accelerates the robot's ability to adapt to the real world. The presented method is based on a few real-world examples and statistical generalization. A comparison to using a simulated-only database on the use-case of robot throwing shows that the proposed approach achieves better real-world performance.
暂无评论