the proceedings contain 27 papers. the special focus in this conference is on Learning Algorithms, Architectures and Applications. the topics include: A spiking neural network for personalised modelling of electrogast...
ISBN:
(纸本)9783319461816
the proceedings contain 27 papers. the special focus in this conference is on Learning Algorithms, Architectures and Applications. the topics include: A spiking neural network for personalised modelling of electrogastrography;improving generalization abilities of maximal average margin classifiers;finding small sets of random Fourier features for shift-invariant kernel approximation;incremental construction of low-dimensional data representations;soft-constrained nonparametric density estimation withartificialneuralnetworks;interpretable classifiers in precision medicine;on the evaluation of tensor-based representations for optimum-path forest classification;on the harmony search using quaternions;learning parameters in deep belief networksthrough firefly algorithm;towards effective classification of imbalanced data with convolutional neuralnetworks;on CPU performance optimization of restricted Boltzmann machine and convolutional RBM;comparing incremental learning strategies for convolutional neuralnetworks;approximation of graph edit distance by means of a utility matrix;time series classification in reservoir- and model-space;objectness scoring and detection proposals in forward-looking sonar images with convolutional neuralnetworks;background categorization for automatic animal detection in aerial videos using neuralnetworks;predictive segmentation using multichannel neuralnetworks in Arabic OCR system;quad-tree based image segmentation and feature extraction to recognize online handwritten bangla characters;using radial basis function neuralnetworks for continuous and discrete pain estimation from bio-physiological signals;emotion recognition in speech with deep learning architectures and on gestures and postural behavior as a modality in ensemble methods.
EGG records the resultant body surface potential of gastric slow waves (electrical activity);while slow waves regulate contractions of gastric muscles, it is the electrical activity we are recording, not movement (lik...
详细信息
ISBN:
(纸本)9783319461823;9783319461816
EGG records the resultant body surface potential of gastric slow waves (electrical activity);while slow waves regulate contractions of gastric muscles, it is the electrical activity we are recording, not movement (like ECG records the cardiac electrical activity, but not the contractions of the heart, even the two are essentially related).
In the last decade, Convolutional neuralnetworks (CNNs) have shown to perform incredibly well in many computer vision tasks such as object recognition and object detection, being able to extract meaningful high-level...
详细信息
ISBN:
(纸本)9783319461823;9783319461816
In the last decade, Convolutional neuralnetworks (CNNs) have shown to perform incredibly well in many computer vision tasks such as object recognition and object detection, being able to extract meaningful high-level invariant features. However, partly because of their complex training and tricky hyper-parameters tuning, CNNs have been scarcely studied in the context of incremental learning where data are available in consecutive batches and retraining the model from scratch is unfeasible. In this work we compare different incremental learning strategies for CNN based architectures, targeting real-word applications.
In this work we present extensions for Radial Basis Function networks to improve their ability for discrete and continuous pain intensity estimation. Besides proposing a mid-level fusion scheme, the use of standardiza...
详细信息
ISBN:
(纸本)9783319461823;9783319461816
In this work we present extensions for Radial Basis Function networks to improve their ability for discrete and continuous pain intensity estimation. Besides proposing a mid-level fusion scheme, the use of standardization and unconventional loss functions are covered. We show that RBF networks can be improved in this way and present extensive experimental validation to support our findings on a multi-modal dataset.
this article offers an open vocabulary Arabic text recognition system using two neuralnetworks, one for segmentation and another one for characters recognition. the problem of words segmentation in Arabic language, l...
详细信息
ISBN:
(纸本)9783319461823;9783319461816
this article offers an open vocabulary Arabic text recognition system using two neuralnetworks, one for segmentation and another one for characters recognition. the problem of words segmentation in Arabic language, like many cursive languages, presents a challenge to the OCR systems. this paper presents a multichannel neural network to solve offline segmentation of machine-printed Arabic documents. the segmented characters are then used as input to a convolutional neural network for Arabic characters recognition. the accuracy of the segmentation model using one font is 98.9%, while four-font model showed 95.5% accuracy. the accuracy of characters recognition on Arabic Transparent font of size 18 pt from APTI data set is 94.8%.
the estimation of probability density functions (pdf) from unlabeled data samples is a relevant (and, still open) issue in patternrecognition and machine learning. Statistical parametric and nonparametric approaches ...
详细信息
ISBN:
(纸本)9783319461823;9783319461816
the estimation of probability density functions (pdf) from unlabeled data samples is a relevant (and, still open) issue in patternrecognition and machine learning. Statistical parametric and nonparametric approaches present severe drawbacks. Only a few instances of neuralnetworks for pdf estimation are found in the literature, due to the intrinsic difficulty of unsupervised learning under the necessary integral-equals-one constraint. In turn, also such neuralnetworks do suffer from serious limitations. the paper introduces a soft-constrained algorithm for training a multilayer perceptron (MLP) to estimate pdfs empirically. A variant of the Metropolis-Hastings algorithm (exploiting the very probabilistic nature of the MLP) is used to satisfy numerically the constraint on the integral of the function learned by the MLP. the preliminary outcomes of a simulation on data drawn from a mixture of Fisher-Tippett pdfs are reported on, and compared graphically withthe estimates yielded by statistical techniques, showing the viability of the approach.
Deep neuralnetworks (DNNs) became very popular for learning abstract high-level representations from raw data. this lead to improvements in several classification tasks including emotion recognition in speech. Beside...
详细信息
ISBN:
(纸本)9783319461823;9783319461816
Deep neuralnetworks (DNNs) became very popular for learning abstract high-level representations from raw data. this lead to improvements in several classification tasks including emotion recognition in speech. Besides the use as feature learner a DNN can also be used as classifier. In any case it is a challenge to determine the number of hidden layers and neurons in each layer for such networks. In this work the architecture of a DNN is determined by a restricted grid-search withthe aim to recognize emotion in human speech. Because speech signals are essentially time series the data will be transformed in an appropriate format to use it as input for deep feed forward neuralnetworks without losing much time dependent information. Furthermore the Elman-Net will be examined. the results shows that by maintaining time dependent information in the data better classification accuracies can be achieved with deep architectures.
Class imbalance in machine learning is a problem often found with real-world data, where data from one class clearly dominates the dataset. Most neural network classifiers fail to learn to classify such datasets corre...
详细信息
ISBN:
(纸本)9783319461823;9783319461816
Class imbalance in machine learning is a problem often found with real-world data, where data from one class clearly dominates the dataset. Most neural network classifiers fail to learn to classify such datasets correctly if class-to-class separability is poor due to a strong bias towards the majority class. In this paper we present an algorithmic solution, integrating different methods into a novel approach using a class-to-class separability score, to increase performance on poorly separable, imbalanced datasets using Cost Sensitive neuralnetworks. We compare different cost functions and methods that can be used for training Convolutional neuralnetworks on a highly imbalanced dataset of multi-channel time series data. Results show that, despite being imbalanced and poorly separable, performance metrics such as G-Mean as high as 92.8% could be reached by using cost sensitive Convolutional neuralnetworks to detect patterns and correctly classify time series from 3 different datasets.
Knowledge about the users emotional state is important to achieve human like, natural HCI in modern technical systems. Humans rely on body gestures and posture when communicating. We investigate the relation between g...
详细信息
ISBN:
(纸本)9783319461823;9783319461816
Knowledge about the users emotional state is important to achieve human like, natural HCI in modern technical systems. Humans rely on body gestures and posture when communicating. We investigate the relation between gestures and human emotion, specifically when completing tasks. the main focus of this work lies on discriminating between mental overload and mental underload, which can e.g. be useful in an e-tutorial system. Mental underload is a new term used to describe the state a person is in when completing a dull or boring task. It will be shown how to select suited features, such as gestures, movement and postural behavior. Furthermore those features will be investigated regarding their discriminative power. After features are selected, a multiple classifier system will be designed, trained and evaluated.
Tensor-based representations have been widely pursued in the last years due to the increasing number of high-dimensional datasets, which might be better described by the multilinear algebra. In this paper, we introduc...
详细信息
ISBN:
(纸本)9783319461823;9783319461816
Tensor-based representations have been widely pursued in the last years due to the increasing number of high-dimensional datasets, which might be better described by the multilinear algebra. In this paper, we introduced a recent patternrecognition technique called Optimum-Path Forest (OPF) in the context of tensor-oriented applications, as well as we evaluated its robustness to space transformations using Multilinear Principal Component Analysis in both face and human action recognition tasks considering image and video datasets. We have shown OPF can obtain more accurate recognition rates in some situations when working on tensor-oriented feature spaces.
暂无评论