Financial information extraction from big financial reports is a tedious task. This paper speaks about page-wise feature generation and applying learning algorithms for identifying financial information (balance sheet...
详细信息
ISBN:
(纸本)9781509036967
Financial information extraction from big financial reports is a tedious task. This paper speaks about page-wise feature generation and applying learning algorithms for identifying financial information (balance sheets, cash flows, and income statements) in Form 10-K or annual reports of companies. Balance sheets, cash flows, and income statements have some structure in them and are semi-structured information. This approach employs selection of unigrams and bigrams based on frequency of occurrence and expert advice, generation of page wise features, and applying learning models for identifying patterns of specific financial information. Different supervised learning models are applied yielding results with very high accuracy (greater than 99%).
Feature selection forms an important aspect of machine learning and character recognition. It is a process of selecting the most important features (attributes) from the dataset. Accurate feature selection results in ...
详细信息
ISBN:
(纸本)9789380544199
Feature selection forms an important aspect of machine learning and character recognition. It is a process of selecting the most important features (attributes) from the dataset. Accurate feature selection results in significant reduction in the number of irrelevant (constant, redundant) attributes thereby, reducing the processing time and increasing the accuracy of the model without any loss of information. This paper focuses on the impact of feature selection and engineering in the classification of handwritten text by identifying and extracting those attributes of the training dataset that will contribute most towards the classification task using classifiers like J48, NaiveBayes and Sequential Minimal Optimization (SMO). This results in improved accuracy of the classifiers as compared to the work reported earlier. Further, a comparative performance evaluation of the classifiers used for OCR and patternrecognition is done. Initial classification performance of all the classifiers listed above was recorded on the raw dataset. Finally, the dataset was transformed after performing relevant feature selection and engineering on its attributes. The same classifiers were again trained on the transformed dataset and their accuracy was recorded. This paper uses the widely used MNIST dataset of handwritten digits for training the classifiers.
The proceedings contain 27 papers. The special focus in this conference is on Learning Algorithms, Architectures and Applications. The topics include: A spiking neural network for personalised modelling of electrogast...
ISBN:
(纸本)9783319461816
The proceedings contain 27 papers. The special focus in this conference is on Learning Algorithms, Architectures and Applications. The topics include: A spiking neural network for personalised modelling of electrogastrography;improving generalization abilities of maximal average margin classifiers;finding small sets of random Fourier features for shift-invariant kernel approximation;incremental construction of low-dimensional data representations;soft-constrained nonparametric density estimation with artificial neural networks;interpretable classifiers in precision medicine;on the evaluation of tensor-based representations for optimum-path forest classification;on the harmony search using quaternions;learning parameters in deep belief networks through firefly algorithm;towards effective classification of imbalanced data with convolutional neural networks;on CPU performance optimization of restricted Boltzmann machine and convolutional RBM;comparing incremental learning strategies for convolutional neural networks;approximation of graph edit distance by means of a utility matrix;time series classification in reservoir- and model-space;objectness scoring and detection proposals in forward-looking sonar images with convolutional neural networks;background categorization for automatic animal detection in aerial videos using neural networks;predictive segmentation using multichannel neural networks in Arabic OCR system;quad-tree based image segmentation and feature extraction to recognize online handwritten bangla characters;using radial basis function neural networks for continuous and discrete pain estimation from bio-physiological signals;emotion recognition in speech with deep learning architectures and on gestures and postural behavior as a modality in ensemble methods.
We present a novel approach for abnormal breast mass classification from digitized mammography images. The proposed framework exploits rotation invariant uniform Local Binary pattern (LBP) as texture feature. These fe...
详细信息
A combinatorial algorithm to find a shortest triangular path (STP) between two points inside a digital object imposed on triangular grid is designed having (equation found) time complexity, n being the number of pixel...
详细信息
The proceedings contain 26 papers. The topics discussed include: a Malay text summarizer using pattern-growth method with sentence compression rules;using concatenation cost for unit selection of homosonic segments in...
ISBN:
(纸本)9781509029549
The proceedings contain 26 papers. The topics discussed include: a Malay text summarizer using pattern-growth method with sentence compression rules;using concatenation cost for unit selection of homosonic segments in concatenative sound synthesis;prosodic breaks on Malay speech corpus: evaluation of pitch, intensity and duration;information extraction: evaluating named entity recognition from classical Malay documents;ontology-based information retrieval for historical documents;graph-based text representation for Malay translated Hadith text;fuzzy logic on reading recommendation system;content-based image retrieval system for marine invertebrates;document level assessment for pairwise system evaluation;and query refinement for ontology information extraction.
The representation of emotion in human brain is an important question in the cognitive neuroscience. However, it was remained unclear to what extent the connections between different brain regions contribute to the em...
详细信息
The representation of emotion in human brain is an important question in the cognitive neuroscience. However, it was remained unclear to what extent the connections between different brain regions contribute to the emotion recognition. The present study mainly focused on the emotion decoding based on the functional connectivity patterns. We designed the experiment and collected the neural activities while participants viewed emotion stimuli using the functional magnetic resonance imaging(f MRI) technology. We constructed the whole-brain functional connectivity patterns for each emotion, and performed emotion classification using multivariate pattern analysis combined with machine learning algorithms. We found that emotions could be successfully decoded from the whole-brain functional connectivity patterns. These results provide new evidence that large-scale functional connectivity patterns contain rich emotion information and contribute to the emotion recognition. Our study extends exist f MRI studies on emotion perception and may further our understanding of how human beings achieve easy and quick recognition of emotions.
The effective factographic information retrieval problem has been investigated in the paper. The primary objective of this paper is to outline factographic information retrieval and new Lemma related to properties of ...
详细信息
ISBN:
(纸本)9781467393799
The effective factographic information retrieval problem has been investigated in the paper. The primary objective of this paper is to outline factographic information retrieval and new Lemma related to properties of factographic information retrieval. The special curriculum has been developed for the set competences. The curriculum contains specialized disciplines. These disciplines use factographic information retrieval. Factographic information retrieval has effectiveness indicators. New Lemma about effectiveness and properties of factographic information retrieval was proved.
The proceedings contain 34 papers. The special focus in this conference is on Combinatorial Tools, Discretization, Discrete Tomography and Combinatorial Topology. The topics include: Convergent geometric estimators wi...
ISBN:
(纸本)9783319323596
The proceedings contain 34 papers. The special focus in this conference is on Combinatorial Tools, Discretization, Discrete Tomography and Combinatorial Topology. The topics include: Convergent geometric estimators with digital volume and surface integrals;discrete calculus, optimisation and inverse problems in imaging;number of words characterizing digital balls on the triangular tiling;generation of digital planes using generalized continued-fractions algorithms;a tomographical interpretation of a sufficient condition on H-graphical sequences;geometrical characterization of the uniqueness regions under special sets of three directions in discrete tomography;a comparison of some methods for direct 2d reconstruction from discrete projected views;homotopic thinning in 2d and 3d cubical complexes based on critical kernels;p-simple points and general-simple deletion rules;two measures for the homology groups of binary volumes;shape classification according to LBP persistence of critical points;signature of a shape based on its pixel coverage representation;computation of the normal vector to a digital plane by sampling significant points;finding shortest triangular path in a digital object;digital surfaces of revolution made simple;on functionality of quadraginta octants of naive sphere with application to circle drawing;encoding specific 3d polyhedral complexes using 3d binary images;interactive curvature tensor visualization on digital surfaces;construction of digital ellipse by recursive integer intervals;digitization of partitions and tessellations;bijective rigid motions of the 2D Cartesian grid and adaptive tangential cover for noisy digital contours.
We build upon the work developed in [4] in which we presented a method to "locally repair" the cubical complex Q(I) associated to a 3D binary image I, to obtain a "well-composed" polyhedral complex...
详细信息
暂无评论