Face detection and tracking is one of the emerging research areas in the image analysis and computer vision systems. This face detection and tracking helps local security forces to investigate crime incidents. This pa...
详细信息
ISBN:
(纸本)9781509010486
Face detection and tracking is one of the emerging research areas in the image analysis and computer vision systems. This face detection and tracking helps local security forces to investigate crime incidents. This paper describes a face tracking framework that is capable of tracking a face in real time rapidly frame by frame. Camshift algorithm and KLT algorithm implemented and a comparison study between these two algorithms has been described in this paper. A real time video is experimented using these two algorithms for face detection and tracking. The experimental results show that the KLT algorithm performance is better than the Camshift algorithm in detecting the face and tracking the face. In this article it has been shown how the KLT algorithm proved to be better tracking algorithm than Camshift algorithm.
Peak search for three-dimensional rotation electron diffraction image is almost the most important step in crystal structure determination. The difference of Gaussian method is the traditional approach for this task w...
详细信息
ISBN:
(纸本)9781509055227
Peak search for three-dimensional rotation electron diffraction image is almost the most important step in crystal structure determination. The difference of Gaussian method is the traditional approach for this task with the disadvantage that values of three tunable parameters need to be determined by users. To address this drawback, this paper presents a local gradient based peak search algorithm that needs only one tunable parameter. Experiments show that our proposed method is as effective as the difference of Gaussian method in peak detection task, but has obvious advantages in speed and convenience.
En route to an intraoperative imaging system for brain surgery, a thermographic camera associated with time-resolved data analysis is used for the identification of functional areas and the detection of brain tumours....
详细信息
ISBN:
(纸本)9781509029600
En route to an intraoperative imaging system for brain surgery, a thermographic camera associated with time-resolved data analysis is used for the identification of functional areas and the detection of brain tumours. In this contribution we propose a Cellular Nonlinear Network based imageprocessing system supporting massively-parallel and real-time processing of thermographic video data. In the current setup the system is implemented on a Xilinx Zynq FPGA and is used for the preprocessing of the raw data. The development and implementation of a digital low-pass filter for the compensation of local motion artefacts will be demonstrated as a typical application example of the freely programmable system.
As a kind of widely used form of two-dimension codes, the QR codes have been widely applied in various aspects of our life, but in the process of the use of the QR codes, the QR code images captured often suffer from ...
详细信息
ISBN:
(纸本)9781467388399
As a kind of widely used form of two-dimension codes, the QR codes have been widely applied in various aspects of our life, but in the process of the use of the QR codes, the QR code images captured often suffer from deformity because the QR code label is attached onto the object irregular surface (for example, a curve surface) or the camera and the QR code label are different in position, therefore, the QR code image recognition correction method based on the shape function has been put forward and verified, with the experimental results showing that the improved algorithm can restore the graphic effect.
Adaptive radiotherapy is a technique intended to increase the accuracy of radiotherapy. Currently, it is not clinically feasible due to the time required to process the images of patient anatomy. Hardware acceleration...
详细信息
Adaptive radiotherapy is a technique intended to increase the accuracy of radiotherapy. Currently, it is not clinically feasible due to the time required to process the images of patient anatomy. Hardware acceleration of imageprocessingalgorithms may allow them to be carried out in a clinically acceptable timeframe. This paper presents the experiences encountered using high-level synthesis tools to design an accelerated segmentation algorithm for computed tomography images targeted for implementation on a System on Chip. Hardware coprocessors and their interfaces for optimal threshold generation and 3D mean filter algorithms were synthesised from C++ functions. Hardware acceleration significantly outperformed the software only implementation. The high-level synthesis tools allowed the rapid exploration of different design options. However, hardware design knowledge was still necessary in order to interpret the results effectively.
Offline handwriting recognition systems require cropped text line images for both training and recognition. On the one hand, the annotation of position and transcript at line level is costly to obtain. On the other ha...
详细信息
ISBN:
(纸本)9781510838819
Offline handwriting recognition systems require cropped text line images for both training and recognition. On the one hand, the annotation of position and transcript at line level is costly to obtain. On the other hand, automatic line segmentation algorithms are prone to errors, compromising the subsequent recognition. In this paper, we propose a modification of the popular and efficient Multi-Dimensional Long Short-Term Memory Recurrent Neural Networks (MDLSTM-RNNs) to enable end-to-end processing of handwritten paragraphs. More particularly, we replace the collapse layer transforming the two-dimensional representation into a sequence of predictions by a recurrent version which can select one line at a time. In the proposed model, a neural network performs a kind of implicit line segmentation by computing attention weights on the image representation. The experiments on paragraphs of Rimes and LAM databases yield results that are competitive with those of networks trained at line level, and constitute a significant step towards end-to-end transcription of full documents.
The revolution of technologies (social networks, smartphones, GPS and Remote Sensing image) increase the volume of informations wich makes humanity in new need “Storage of huge volume of data”. the traditional strat...
详细信息
ISBN:
(纸本)9781467385275
The revolution of technologies (social networks, smartphones, GPS and Remote Sensing image) increase the volume of informations wich makes humanity in new need “Storage of huge volume of data”. the traditional strategy to store data become problem for humanity and then this need build new art to resolve this problems the “Spatial Big Data (SBD)” SBD store proncipally three types of data:vector data, raster data and network data. The complexity and nature of spatial databases make them ideal for applying parallel processing. This also emphasizes the need for developing new efficient geospatial analytic for analyzing spatial big data. So, we review the most used spatial data algorithms attracting human interests especially when the amount of satellite images continues to grow as more information becomes available. In this context, we propose a system based on Hadoop an open source system that implements the MapReduce programming model and that can improve the classification of large scale remote sensing image and benefit the power of spatial big data concept.
Optical coherence tomography (OCT) is the current very fast and accurate modality for noninvasive assessment of 3D retinal structure. Due to large amount of data acquired with this technique the resolution of 3D scans...
详细信息
ISBN:
(纸本)9781509026616
Optical coherence tomography (OCT) is the current very fast and accurate modality for noninvasive assessment of 3D retinal structure. Due to large amount of data acquired with this technique the resolution of 3D scans is limited. In this paper we present a new method for improving resolution of 3D macula scans while maintaining short acquisition time and robustness with respect to motion artifacts. Our approach is based on multi-frame super-resolution method applied to several 3D standard resolution OCT scans. Presented experiments where performed on volumetric data acquired from adult patients with the use of Avanti RTvue device. Each OCT cross-section (B-scan) was subjected to image denoising and retinal layers segmentation. The generated 3D super-resolution scans have significantly improved quality of the vertical cross-sections.
Iris recognition systems offer highly accurate personal identification both on small and very-large scale systems needed in government, forensic and commercial applications. The automatic segmentation of a noise-free ...
详细信息
ISBN:
(纸本)9781509041725
Iris recognition systems offer highly accurate personal identification both on small and very-large scale systems needed in government, forensic and commercial applications. The automatic segmentation of a noise-free iris region is imperative for optimal performance of the system. However, image characteristics such as brightness and contrast, the differing levels of pigmentation, occlusion by eyelashes and/or eyelids, coupled with varying sensor and environmental conditions, makes iris segmentation a huge and difficult task. This paper proposes an image pre-processing algorithm for robust iris segmentation of low contrast images, aimed at reducing mis-localization errors of basic curve-fitting algorithms. Similar to face detection, the algorithm performs iris detection with a k-NN classifier trained with features extracted by a rotation-invariant texture descriptor based on the co-occurrence of local binary patterns. The integration of the proposed algorithm into an existing open-source iris segmentation module offered a 40% improvement in execution time; a segmentation accuracy of 92% was also recorded over 1,898 low contrast eye images acquired from African subjects. The low contrast eye images were acquired to support diversity in iris recognition.
This paper describes an efficient edge detection algorithm that can be used as a plug-in for digital imageprocessingsystems. The proposed algorithm uses a method based on iterative clustering targeting a reduced num...
详细信息
This paper describes an efficient edge detection algorithm that can be used as a plug-in for digital imageprocessingsystems. The proposed algorithm uses a method based on iterative clustering targeting a reduced number of operations. The algorithm splits the image into two parts, background and foreground, and calculates the mean value for each of them. Based on these results, the new threshold value will be obtained and looped until the mean values remain unchanged. The only pixels affected by the change are the pixels with values between the previous two thresholds, so only they have to be redistributed to a new class. As a result, only few operations are needed in order to obtain the desired threshold. All the algorithms and results obtained in this paper are developed and tested using the C# programming language.
暂无评论