image segmentation is one of the most significant and inevitable task in variety areas ranging from face/object/character recognition and medical imaging applications to robotic control and self-driving vehicular syst...
详细信息
ISBN:
(纸本)9781538676417
image segmentation is one of the most significant and inevitable task in variety areas ranging from face/object/character recognition and medical imaging applications to robotic control and self-driving vehicular systems. Accuracy and processing time of image segmentation processes are also prominent parameters for quality of such computervision systems. the proposed method incorporates three main pre-processing techniques such as Down Scaling/Sampling, Gamma Correction and Edge Preserving Smoothing so as to achieve accuracy and robustness of the segmentation. Pre-processing techniques are performed for both Fuzzy C-means (FCM) and K-means algorithm and all RGB information of image are taken into consideration while segmenting the image rather than using only gray scale. Performance analysis are performed on real-world images. Experiments show that, our method achieve higher accuracy levels and feasible processing time results compared to conventional FCM and K-means algorithms.
this talk deals with some fundamental aspects of biometrics and its applications. It basically includes the following subtopics: (1) Overvievv of Biometric Technology and Applications (2) Importance of Security: A Sce...
详细信息
ISBN:
(纸本)9789898111678
this talk deals with some fundamental aspects of biometrics and its applications. It basically includes the following subtopics: (1) Overvievv of Biometric Technology and Applications (2) Importance of Security: A Scenario of Terrorists Attack, (3) What are Biometric Technologies?, (4) Biometrics: Analysis vs Synthesis.(5) Analysis: Interactive Pattern Recognition Concept, (6) Concept of "Semantics" and "Ambiguitv", their Importance and Applications, (7)computervision (3D) and imageprocessing (2D), (8) imageprocessing & computergraphicsas Reverse Process, (9) thermal ImagingRecognition. (10) Synthesis in biometrics, (11) Modeling and Simulation, and (12) more Examples and Applications in Interactive Environment.
Recent approaches to image captioning typically follow an encoder-decoder architecture. the feature vectors extracted from the region proposals obtained from an object detector network serve as input to encoder. Witho...
详细信息
ISBN:
(纸本)9783031581809;9783031581816
Recent approaches to image captioning typically follow an encoder-decoder architecture. the feature vectors extracted from the region proposals obtained from an object detector network serve as input to encoder. Without any explicit spatial information about the visual regions, the caption synthesis model is limited to learn relationship from captions only. However, the structure between the semantic units in images and sentences is different. this work introduces a grid based spatial position encoding scheme to learn relationship from both domains. Furthermore, bi-linear pooling is used with attention for exploiting spatial and channel-wise attention distribution to capture second order interaction between multi-modal inputs. these are integrated within the Transformer architecture achieving a competitive CIDEr score.
Computed Tomography (CT) is one of the significant research areas in medical image analysis. One of the main aspects of CT that researchers remain focused, is on reducing the dosage as Xrays are generally harmful to h...
详细信息
ISBN:
(纸本)9783319681245;9783319681238
Computed Tomography (CT) is one of the significant research areas in medical image analysis. One of the main aspects of CT that researchers remain focused, is on reducing the dosage as Xrays are generally harmful to human bodies. In order to reduce radiation dosage, compressed sensing (CS) based methodologies appear to be promising. the basic premise is that medical images have inherent sparsity in some transformation domain. As a result, CS provides the possibility of recovering a high quality image from fewer projection data. In general, the sensing matrix in CT is generated from Radon projections by appropriately sampling the radial and angular parameters. In our work, by restricting the number of such parameters, we generate an under-determined linear system involving projection (Radon) data and a sparse sensing matrix, bringing thereby the problem into CS framework. Among various recent solvers, the Split-Bregman iterative scheme has of late become popular due to its suitability for solving a wide variety of optimization problems. Intending to exploit the underlying structure of sensing matrix, the present work analyzes its properties and finds a banded structure for an associated intermediate matrix. Using this observation, we simplify the Split-Bregman solver, proposing thereby a CT-specific solver of low complexity. We also provide the efficacy of proposed method empirically.
Brain tumour segmentation is a fundamental task in medical image analysis where each year a number of deep learning models are introduced to delineate the tumour regions with high precision. However, most of these wor...
详细信息
ISBN:
(纸本)9798400710759
Brain tumour segmentation is a fundamental task in medical image analysis where each year a number of deep learning models are introduced to delineate the tumour regions with high precision. However, most of these works rely on large number of parameters and higher computational cost, thus, rendering them ineffective in real world application. therefore, it is important to devise efficient models that can be easily deployed on resource-constrained devices and perform at par withthe existing large models. In this paper, we propose a semi-decoupled distillation technique which trains a lightweight "student" model using the features extracted from the decoder of the nnU-Net "teacher" model and its predictions on these features using a single point-wise convolution layer. the final classification layer remains the same for boththe models and is kept frozen while training the "student" network. Our approach follows a two stage training procedure where the tumour regions are detected and extracted in the first step, and then sent for second stage training to segment fine-grain classes, including edema, enhanced tumour and tumour core. Our extensive experimentation shows that a lightweight distilled model performs competitively with large models on brain tumour segmentation.
the proceedings contain 14 papers. the special focus in this conference is on Document Analysis and Recognition. the topics include: Word-Wise Handwriting Based Gender Identification Using Multi-Gabor Response Fusion;...
ISBN:
(纸本)9789811393600
the proceedings contain 14 papers. the special focus in this conference is on Document Analysis and Recognition. the topics include: Word-Wise Handwriting Based Gender Identification Using Multi-Gabor Response Fusion;a Secure and Light Weight User Authentication System Based on Online Signature Verification for Resource Constrained Mobile Networks;benchmark Datasets for Offline Handwritten Gurmukhi Script Recognition;benchmark Dataset: Offline Handwritten Gurmukhi City Names for Postal Automation;attributed Paths for Layout-Based Document Retrieval;textual Content Retrieval from Filled-in Form images;A Study on the Effect of CNN-Based Transfer Learning on Handwritten Indic and Mixed Numeral Recognition;symbol Spotting in Offline Handwritten Mathematical Expressions;online Handwritten Bangla Character Recognition Using Frechet Distance and Distance Based Features;an Efficient Multi Lingual Optical Character Recognition System for indian Languages through Use of Bharati Script;telugu Word Segmentation Using Fringe Maps;an Efficient Character Segmentation Algorithm for Connected Handwritten Documents.
this paper aims to discuss the implementation of phoneme based Manipuri Keyword Spotting System (MKWSS). Manipuri is a scheduled indian language of Tibeto- Burman origin. Around 5 hours of read speech are collected fr...
详细信息
A Abstract : Multi feature space representation is a common practise in computervision applications. Traditional features such as HOG, SIFT, SURF etc., individually encapsulates certain discriminative cues for visual...
详细信息
ISBN:
(纸本)9781450347532
A Abstract : Multi feature space representation is a common practise in computervision applications. Traditional features such as HOG, SIFT, SURF etc., individually encapsulates certain discriminative cues for visual classification. On the other hand, each layer of a deep neural network generates multi ordered representations. In this paper we present a novel approach for such multi feature representation learning using Adaptive Boosting (AdaBoost). General practise in AdaBoost [8] is to concatenate components of feature spaces and train base learners to classify examples as correctly/incorrectly classified. We posit that multi feature space learning should be viewed as a derivative of cooperative multi agent learning. To this end, we propose a mathematical framework to leverage performance of base learners over each feature space, gauge a measure of "difficulty" of training space and finally make soft weight updates rather than strict binary weight updates prevalent in regular AdaBoost. this is made possible by periodically sharing of response states by our learner agents in the boosting framework. theoretically, such soft weight update policy allows in finite combinations of weight updates on training space compared to only two possibilities in AdaBoost. this opens up the opportunity to identify 'more difficult' examples compared to 'less difficult' examples. We test our model on traditional multi feature representation of MNIST handwritten character dataset and 100-Leaves classification challenge. We consistently outperform traditional and variants of multi view boosting in terms of accuracy while margin analysis reveals that proposed method fosters formation of more con fident ensemble of learner agents. As an application of using our model in conjecture with deep neural network, we test our model on the challenging task of retinal blood vessel segmentation from fundus images of DRIVE dataset by using kernel dictionaries from layers of unsupervised trained stac
this book constitutes the refereed post-conference proceedings of the 5th and 6th International Workshops on Computational Forensics, IWCF 2012 and IWCF 2014, held in Tsukuba, Japan, in November 2010 and August 2014. ...
详细信息
ISBN:
(数字)9783319201252
ISBN:
(纸本)9783319201245
this book constitutes the refereed post-conference proceedings of the 5th and 6th International Workshops on Computational Forensics, IWCF 2012 and IWCF 2014, held in Tsukuba, Japan, in November 2010 and August 2014. the 16 revised full papers and 1 short paper were carefully selected from 34 submissions during a thorough review process. the papers are divided into three broad areas namely biometrics; document image inspection; and applications.
this paper present a novel attempt to introduce an HMM-based multi-resolution and multi-cue segmentation in combination withthe unscented Kalman filter tracking method. It combines multiple features distribution and ...
详细信息
暂无评论