In the field of Document Image Analysis (DIA), it is common to find great heterogeneity in terms of the possible graphic domains. In this sense, it is interesting to build neural models that can be sequentially adapte...
详细信息
ISBN:
(数字)9781665490627
ISBN:
(纸本)9781665490627
In the field of Document Image Analysis (DIA), it is common to find great heterogeneity in terms of the possible graphic domains. In this sense, it is interesting to build neural models that can be sequentially adapted to new domains without losing the knowledge from the domains already learned. this learning paradigm is known as Continual (or Lifelong) Learning (CL). Although the adaptation comes along with a training set of the new domain, neural networks suffer what is known as "catastrophic forgetting". therefore, assuming the constraint of not keeping data from the domains already addressed, this paradigm represents a challenge yet to be solved. this work presents an approach for CL in document image binarization, one of the most considered tasks within the DIA field. Our results report that it is indeed feasible to address CL in this field, given that the approach is successfully implemented and outperforms the baseline by a wide margin in most of the analyzed scenarios.
this paper presents a novel classification framework combining Biomimetic patternrecognition (BPR) with Sparse Representation (SR) for Brain Computer Interface based on motor imagery. this framework can work well whe...
详细信息
ISBN:
(纸本)9783642318368
this paper presents a novel classification framework combining Biomimetic patternrecognition (BPR) with Sparse Representation (SR) for Brain Computer Interface based on motor imagery. this framework can work well when encountering the overlap coverage problem of BPR by introducing the idea of SR. Using Common Spatial pattern to extract the rhythm features of EEG data, we evaluate the performance of the proposed method in the datasets from previous BCI Competitions. By making comparison withthose of LDA, SVM and original BPR, our proposed method shows the better classification accuracy.
this paper addresses the problem of end-to-end self-supervised forecasting of depth and ego motion. Given a sequence of raw images, the aim is to forecast boththe geometry and ego-motion using a self supervised photo...
详细信息
ISBN:
(数字)9781665490627
ISBN:
(纸本)9781665490627
this paper addresses the problem of end-to-end self-supervised forecasting of depth and ego motion. Given a sequence of raw images, the aim is to forecast boththe geometry and ego-motion using a self supervised photometric loss. the architecture is designed using both convolution and transformer modules. this leverages the benefits of both modules: Inductive bias of CNN, and the multi-head attention of transformers, thus enabling a rich spatio-temporal representation that enables accurate depth forecasting. Prior work attempts to solve this problem using multi-modal input/output with supervised ground-truth data which is not practical since a large annotated dataset is required. Alternatively to prior methods, this paper forecasts depth and ego motion using only self-supervised raw images as input. the approach performs significantly well on the KITTI dataset benchmark with several performance criteria being even comparable to prior non-forecasting self-supervised monocular depth inference methods.
the KES-IDT-2016 proceedings give an excellent insight into recent research, boththeoretical and applied, in the field of intelligent decision making. the range of topics explored is wide, and covers methods of group...
ISBN:
(数字)9783319396279
ISBN:
(纸本)9783319396262
the KES-IDT-2016 proceedings give an excellent insight into recent research, boththeoretical and applied, in the field of intelligent decision making. the range of topics explored is wide, and covers methods of grouping, classification, prediction, decision support, modelling and many more in such areas as finance, linguistics, medicine, management and transportation. this proceedings contain several sections devoted to specific topics, such as: Specialized Decision Techniques for Data Mining, Transportation and Project Management patternrecognition for Decision Making Systems New Advances of Soft computing in Industrial and Management Engineering Recent Advances in Fuzzy Systems Intelligent Data Analysis and Applications Reasoning-based Intelligent Systems Intelligent Methods for Eye Movement Data Processing and Analysis Intelligent Decision Technologies for Water Resources Management Intelligent Decision Making for Uncertain Unstructured Big Data Decision Making theory for Economics Interdisciplinary Approaches in Business Intelligence Research and Practice patternrecognition in Audio and Speech Processing the KES-IDT conference is a well-established international annual conference, interdisciplinary in nature. these two volumes of proceedings form an excellent account of the latest results and outcomes of recent research in this leading-edge area.
the KES-IDT-2016 proceedings give an excellent insight into recent research, boththeoretical and applied, in the field of intelligent decision making. the range of topics explored is wide, and covers methods of group...
ISBN:
(数字)9783319396309
ISBN:
(纸本)9783319396293
the KES-IDT-2016 proceedings give an excellent insight into recent research, boththeoretical and applied, in the field of intelligent decision making. the range of topics explored is wide, and covers methods of grouping, classification, prediction, decision support, modelling and many more in such areas as finance, linguistics, medicine, management and transportation. this proceedings contain several sections devoted to specific topics, such as: Specialized Decision Techniques for Data Mining, Transportation and Project Management patternrecognition for Decision Making Systems New Advances of Soft computing in Industrial and Management Engineering Recent Advances in Fuzzy Systems Intelligent Data Analysis and Applications Reasoning-based Intelligent Systems Intelligent Methods for Eye Movement Data Processing and Analysis Intelligent Decision Technologies for Water Resources Management Intelligent Decision Making for Uncertain Unstructured Big Data Decision Making theory for Economics Interdisciplinary Approaches in Business Intelligence Research and Practice patternrecognition in Audio and Speech Processing the KES-IDT conference is a well-established international annual conference, interdisciplinary in nature. these two volumes of proceedings form an excellent account of the latest results and outcomes of recent research in this leading-edge area.
Adaptive Histogram Equalization(AHE) is a popular and effective algorithm for image contrast *** it's quite computationally expensive and time *** this paper,a fast implementation of AHE based on pure software tec...
详细信息
ISBN:
(纸本)0780397371
Adaptive Histogram Equalization(AHE) is a popular and effective algorithm for image contrast *** it's quite computationally expensive and time *** this paper,a fast implementation of AHE based on pure software techniques is *** accelerative techniques are combined to form the new fast AHE:First,local histogram is acquired by an iterative approach with a sliding window;Second,in computing cumulative histogram function,not more than half of the histogram is cumulated;third,by keep the block size W2 equal to the product of grey level number and integral power of 2,all the multiplication and division operations are replaced with fast bitwise *** theoretical analysis and experimental results demonstrate the proposed algorithm is effective.
the task of faults localization is discussed in a model-free setting. As a tool for its solution we consider a multiclass patternrecognition problem with a metric in the label space. then, this problem is approximate...
详细信息
ISBN:
(纸本)3540357483
the task of faults localization is discussed in a model-free setting. As a tool for its solution we consider a multiclass patternrecognition problem with a metric in the label space. then, this problem is approximately solved, providing hints on selecting appropriate RBF nets. It was shown that the approximate solution is the exact one in several important cases. Finally, we propose the algorithm for learning the proposed RBF net. the results of its testing are briefly reported.
pattern discovery is one of the fundamental tasks in bioinformatics and patternrecognition is a powerful technique for searching sequence patterns in the biological sequence databases. the significant increase in the...
详细信息
ISBN:
(纸本)9783319075815;9783319075808
pattern discovery is one of the fundamental tasks in bioinformatics and patternrecognition is a powerful technique for searching sequence patterns in the biological sequence databases. the significant increase in the number of DNA and protein sequences expands the need for raising the performance of pattern matching algorithms. For this purpose, heterogeneous architectures can be a good choice due to their potential for high performance and energy efficiency. In this paper we present an efficient implementation of Aho-Corasick (AC) and PFAC (Parallel Failureless Aho-Corasick) algorithm on a heterogeneous CPU/GPU architecture. We progressively redesigned the algorithms and data structures to fit on the GPU architecture. Our results on different protein sequence data sets show 15% speedup comparing to the original implementation of the PFAC algorithm.
Matching-based Semi-supervised video object segmentation (VOS) either resorts to non-local matching to retrieve and aggregate the spatiotemporal features of past frames or relies on template matching to learn similari...
详细信息
ISBN:
(数字)9781665490627
ISBN:
(纸本)9781665490627
Matching-based Semi-supervised video object segmentation (VOS) either resorts to non-local matching to retrieve and aggregate the spatiotemporal features of past frames or relies on template matching to learn similarity representation. Although achieving remarkable progress, they still suffer from considerable computation overhead and failure when confronting with large appearance changes, respectively. In this paper, we are motivated to address the above issues. Firstly, we propose a Context-aware Deformable Alignment (CDA) mechanism to align the spatio-temporal features of past frames more efficiently. To reduce computation complexity dramatically and retain the ability of modeling long-range spatio-temporal dependencies, the CDA mechanism that learns where to match in a deformable fashion belongs to local context-aware matching instead of nonlocal pixel-wise matching. Furthermore, we present a Dynamic Kernel Matching (DKM) technique to tackle the mismatches due to appearance and scale variations. DKM dynamically adapts the template feature to object appearance changes rather than fixing it, which improves the robustness for long-term VOS. Our framework dubbed CDANet is evaluated on popular benchmark sets, which achieves competitive performance compared with SOTA methods.
Variables selection is challenging task due mainly to huge search space. this study addresses the increasingly encountered challenge of variables selection. It addresses the application of machine learning techniques ...
详细信息
Variables selection is challenging task due mainly to huge search space. this study addresses the increasingly encountered challenge of variables selection. It addresses the application of machine learning techniques to the problem of variables selection. We detailed the various models of the variables selection and examined the basic steps that are used to select the cost-effective predictors. We also walked through the initial settings and all variables selection stages, including architecture configuration, strategy generation, learning, model induction, and scoring. Results from this study show that the cost and generalization were seen to improve significantly in terms of computing time and recognition accuracy when the proposed system is applied for medical diagnosis. Good comparisons with an experimental study demonstrate the multidisciplinary applications of our approach. 1877-0509 (C) 2017 the Authors. Published by Elsevier B.V.
暂无评论