Since the works by Specht, the probabilistic neuralnetworks (PNNs) have attracted researchers due to their ability to increase training speed and their equivalence to the optimal Bayesian decision of classification t...
详细信息
Deep neuralnetworks have become a veritable alternative to classic speaker recognition and clustering methods in recent years. However, while the speech signal clearly is a time series, and despite the body of litera...
详细信息
ISBN:
(纸本)9783319999784;9783319999777
Deep neuralnetworks have become a veritable alternative to classic speaker recognition and clustering methods in recent years. However, while the speech signal clearly is a time series, and despite the body of literature on the benefits of prosodic (suprasegmental) features, identifying voices has usually not been approached with sequence learning methods. Only recently has a recurrent neural network (RNN) been successfully applied to this task, while the use of convolutional neuralnetworks (CNNs) (that are not able to capture arbitrary time dependencies, unlike RNNs) still prevails. In this paper, we show the effectiveness of RNNs for speaker recognition by improving state of the art speaker clustering performance and robustness on the classic TIMIT benchmark. We provide arguments why RNNs are superior by experimentally showing a "sweet spot" of the segment length for successfully capturing prosodic information that has been theoretically predicted in previous work.
The hypothesis is that in the lowest hidden layers of biological systems "local subnetworks" are smoothing an input signal. The smoothing accuracy may serve as a feature to feed the subsequent layers of the ...
详细信息
In this work we investigate the use of deep neuralnetworks for object detection in floor plan images. Object detection is important for understanding floor plans and is a preliminary step for their conversion into ot...
详细信息
ISBN:
(纸本)9783319999784;9783319999777
In this work we investigate the use of deep neuralnetworks for object detection in floor plan images. Object detection is important for understanding floor plans and is a preliminary step for their conversion into other representations. In particular, we evaluate the use of object detection architectures, originally designed and trained to recognize objects in images, for recognizing furniture objects as well as doors and windows in floor plans. Even if the problem is somehow easier than the original one in the case of this research the datasets available are extremely small and therefore the training of deep architectures can be problematic. In addition to the use of object detection architectures for floor plan images, another contribution of this paper is the creation of two datasets that have been used for performing the experiments covering different types of floor plans with different peculiarities.
The gray-scale morphological Hit-or-Miss transform is theoretically invariant to vertical translation of the input function, which is analogous to gray-value shift of the input images. Designing optimal structuring el...
详细信息
Deep neuralnetworks (DNNs) became very popular for learning abstract high-level representations from raw data. This lead to improvements in several classification tasks including emotion recognition in speech. Beside...
详细信息
ISBN:
(纸本)9783319461823;9783319461816
Deep neuralnetworks (DNNs) became very popular for learning abstract high-level representations from raw data. This lead to improvements in several classification tasks including emotion recognition in speech. Besides the use as feature learner a DNN can also be used as classifier. In any case it is a challenge to determine the number of hidden layers and neurons in each layer for such networks. In this work the architecture of a DNN is determined by a restricted grid-search with the aim to recognize emotion in human speech. Because speech signals are essentially time series the data will be transformed in an appropriate format to use it as input for deep feed forwardneuralnetworks without losing much time dependent information. Furthermore the Elman-Net will be examined. The results shows that by maintaining time dependent information in the data better classification accuracies can be achieved with deep architectures.
This paper introduces a new class of sign-based training algorithms for neuralnetworks that combine the sign-based updates of the Rprop algorithm with the composite nonlinear Jacobi method. The theoretical foundation...
详细信息
This paper introduces a new class of sign-based training algorithms for neuralnetworks that combine the sign-based updates of the Rprop algorithm with the composite nonlinear Jacobi method. The theoretical foundations of the class are described and a heuristic Rprop-based Jacobi algorithm is empirically investigated through simulation experiments in benchmark pattern classification problems. Numerical evidence shows that this new modification of the Rprop algorithm exhibits improved learning speed in all cases tested, and compares favorably against the Rprop and a recently proposed modification, the improved Rprop. (c) 2005 Elsevier B.V. All rights reserved.
This paper compares the performance of multilayer perceptron (MLP) networks trained with conventional bipolar target vectors (CBVs) and orthogonal bipolar new target vectors (OBVs) for biometric patternrecognition. T...
详细信息
In the context of machine learning on graph data, graph deep learning has captured the attention of many researcher. Due to the promising results of deep learning models in the most diverse fields of application, grea...
详细信息
ISBN:
(纸本)9783031206498;9783031206504
In the context of machine learning on graph data, graph deep learning has captured the attention of many researcher. Due to the promising results of deep learning models in the most diverse fields of application, great efforts have been made to replicate these successes when dealing with graph data. In this work, we propose a novel approach for processing graphs, with the intention of exploiting the already established capabilities of Convolutional neuralnetworks (CNNs) in image processing. To this end we propose a new representation for graphs, called GrapHisto, in the form of unique tensors encapsulating the features of any given graph to then process the new data using the CNN paradigm.
Kernel based clustering methods allow to unsupervised partition samples in feature space but have a. quadratic computation time O(n(2)) where n are the number of samples. Therefore these methods are generally ineligib...
详细信息
ISBN:
(纸本)9783642121586
Kernel based clustering methods allow to unsupervised partition samples in feature space but have a. quadratic computation time O(n(2)) where n are the number of samples. Therefore these methods are generally ineligible for large datasets. In this paper we propose a meta-algorithm that performs parallelized clusterings of subsets of the samples and merges them repeatedly. The algorithm is able to use many Kernel based clustering methods where we mainly emphasize on Kernel Fuzzy C-Means and Relational neural Gas. We show that the computation time of this algorithm is basicly linear, i.e. O (n). Further we statistically evaluate the performance of this meta-algorithm on a real-life dataset, namely the Enron Emails.
暂无评论