We propose a novel deep framework, TraCount, for highly overlapping vehicle counting in congested traffic scenes. TraCount uses multiple fully convolutional(FC) sub-networks to predict the density map for a given stat...
详细信息
ISBN:
(纸本)9781450347532
We propose a novel deep framework, TraCount, for highly overlapping vehicle counting in congested traffic scenes. TraCount uses multiple fully convolutional(FC) sub-networks to predict the density map for a given static image of a traffic scene. The different FC sub-networks provide a range in size of receptive fields that enable us to count vehicles whose perspective effect varies significantly in a scene due to the large visual field of surveillance cameras. The predictions of different FC sub-networks are fused by weighted averaging to obtain a final density map. We show that TraCount outperforms the state of the art methods on the challenging TRANCOS dataset that has a total of 46796 vehicles annotated across 1244 images.
Current denoising techniques use the classical orthonormal wavelets for decomposition of an image corrupted with additive white Gaussian noise, upon which various thresholding strategies are built. The use of availabl...
详细信息
ISBN:
(纸本)9781424442195
Current denoising techniques use the classical orthonormal wavelets for decomposition of an image corrupted with additive white Gaussian noise, upon which various thresholding strategies are built. The use of available biorthogonal wavelets in image denoising is less common because of their poor performance. hi this paper, we present a method to design image-matched biorthogonal wavelet bases and report on their potential for denoising. We have conducted experiments on various image datasets namely Natural, Satellite and Medical with the designed wavelets using two existing thresholding strategies. Test results front comparing the performance of matched and fixed biorthogonal wavelets show an average improvement of 35% in MSE for low SNR values (0 to 18db) in every dataset. This improvement was also seen in the PSNR and visual comparisons. This points to the importance of matching when using wavelet-based denoising.
image based methods are a new approach for solving problems in mobile robotics. Instead of building a metric (3D) model of the environment, these methods work directly in the sensor (image) space. The environment is r...
详细信息
ISBN:
(纸本)9781424442195
image based methods are a new approach for solving problems in mobile robotics. Instead of building a metric (3D) model of the environment, these methods work directly in the sensor (image) space. The environment is represented as a topological graph in which each node contains an image taken at some pose in the workspace, and edges connect poses between which a simple path exists. This type of representation is highly scalable and is also well suited to handle the data association problems that effect metric model based methods. In this paper we present an efficient, adaptive method for qualitative localization using content based image retrieval techniques. In addition, we demonstrate an algorithm which can convert this topological graph into a metric model of the environment by incorporating information about loop closures.
People suffering from autism have difficulty with recognizing other people's emotions and are therefore unable to react to it. Although there have been attempts aimed at developing a system for analyzing facial ex...
详细信息
ISBN:
(纸本)9781479915880
People suffering from autism have difficulty with recognizing other people's emotions and are therefore unable to react to it. Although there have been attempts aimed at developing a system for analyzing facial expressions for persons suffering from autism, very little has been explored for capturing one or more expressions from mixed expressions which are a mixture of two closely related expressions. This is essential for psychotherapeutic tool for analysis during counseling. This paper presents the idea of improving the recognition accuracy of one or more of the six prototypic expressions namely happiness, surprise, fear, disgust, sadness and anger from the mixture of two facial expressions. For this purpose a motion gradient based optical flow for muscle movement is computed between frames of a given video sequence. The computed optical flow is further used to generate feature vector as the signature of six basic prototypic expressions. Decision Tree generated rule base is used for clustering the feature vectors obtained in the video sequence and the result of clustering is used for recognition of expressions. The relative intensity of expressions for a given face present in a frame is measured. With the introduction of Component Based Analysis which is basically computing the feature vectors on the proposed regions of interest on a face, considerable improvement has been noticed regarding recognition of one or more expressions. The results have been validated against human judgement.
We present a novel eigenspace-based framework to model a dynamic hand gesture that incorporates both hand shape as well as trajectory information. We address the problem of choosing a gesture set that models an upper ...
详细信息
We present a novel eigenspace-based framework to model a dynamic hand gesture that incorporates both hand shape as well as trajectory information. We address the problem of choosing a gesture set that models an upper bound on gesture recognition efficiency. We show encouraging experimental results on a such a representative set. (c) 2006 Elsevier B.V. All rights reserved.
A nonlinear weighted averaging filter called bilateral filter depends mainly on two parameters. The spatial distance indicates the size, and the intensity distance indicates the contrast of the features to be preserve...
详细信息
Video based multimedia services are showing major growth in recent years. Though the video coding recommendation ITU-T H.265/HEVC is in operating state, the majority of real-time video applications including video con...
详细信息
ISBN:
(纸本)9781467385640
Video based multimedia services are showing major growth in recent years. Though the video coding recommendation ITU-T H.265/HEVC is in operating state, the majority of real-time video applications including video conferencing and live streaming of events rely mainly on less computationally intensive H.264/AVC coding standard which is well established among video industry. Rate control algorithms are indispensable for delivering superior quality video over limited bandwidth connections. In this paper, we propose a rate control technique which provides consistent quality video output over time in a limited bandwidth video conferencing scenario. Using the concept of video traffic prediction and linear complexity model, we compute the complexity of the video sequence in real time. We present a better estimate of quantization parameter at the GOP (group of picture) layer. Our proposed method maintains similar quality for reconstructed video when compared to rate control scheme of JM 19.0 implemented in real-time video conferencing system with restricted quality variation. On an average, 22% improvement has been observed in terms of standard deviation of frame PSNR values when compared to JM 19.0 rate control scheme implemented in high resolution video conferencing scenario.
Dopaminergic imaging using Single Photon Emission Computed Tomography (SPECT) with I-123-Ioflupane have shown to increase the diagnostic accuracy in Parkinson's Disease (PD). Studies show that around 10% of subjec...
详细信息
ISBN:
(纸本)9781479915880
Dopaminergic imaging using Single Photon Emission Computed Tomography (SPECT) with I-123-Ioflupane have shown to increase the diagnostic accuracy in Parkinson's Disease (PD). Studies show that around 10% of subjects who are clinically diagnosed as PD, have SPECT scans in the normal range and are called Scans Without Evidence of Dopaminergic Deficit (SWEDD) subjects. Subsequent follow-up on these subjects has indicated that they are unlikely to have PD. Detection and differentiation of PD and SWEDD is problematic in the early stages of the disease. Early and accurate diagnosis of PD and also SWEDD is crucial for early management, avoidance of unnecessary medical examinations and therapies;and their side-effects. We in our paper, use the SPECT images from 35 Normal, 36 PD and 38 SWEDD subjects as obtained from the Parkinson's Progression Markers Initiative (PPMI) database, to carry out intensity-based surface fitting using polynomial model. This is the first time that such kind of modeling is carried out on the SPECT images for the characterization of PD. Our results show that the surface profile in terms of model coefficients and goodness-of-fit parameters is different for Normal, Early PD and SWEDD subjects. Such kind of modeling may aid in the diagnosis of early PD and SWEDD from SPECT images.
The paper presents a hybrid thresholding approach for binarization and enhancement of degraded documents. Historical documents contain information of great cultural and scientific value. But such documents are frequen...
详细信息
ISBN:
(纸本)9781424442195
The paper presents a hybrid thresholding approach for binarization and enhancement of degraded documents. Historical documents contain information of great cultural and scientific value. But such documents are frequently degraded over time. Digitized degraded documents require specialized processing to remove different kinds of noise and to improve readability. The approach for enhancing degraded documents uses a combination of two thresholding algorithms. First, iterative global thresholding is applied to the smoothed degraded image until the stopping criteria is reached Then a threshold selection method from gray level histogram is used to binarize the image. The next step is detecting areas where noise still remains and applying iterative thresholding locally. A method to improve the quality of textual information in the document is also done as a post processing stage, thus making the approach efficient and better suited for character recognition applications.
We propose a novel physically based method to simulate explosions and other compressible fluid phenomena. The method uses compressible Navier Stokes equations for modeling the explosion with a Semi-Lagrangian integrat...
详细信息
ISBN:
(纸本)9781424442195
We propose a novel physically based method to simulate explosions and other compressible fluid phenomena. The method uses compressible Navier Stokes equations for modeling the explosion with a Semi-Lagrangian integration method. The proposed integration method addresses the issues of stability and larger timesteps. This is achieved by modifying the Semi-Lagrangian method to reduce dissipation and increase accuracy, using improved interpolation and an error correction method. The proposed method allows the rendering of related phenomena like a fireball, dust and smoke clouds, and the simulation of solid interaction like rigid fracture and rigid body simulation. Our method is flexible enough to afford substantial artistic control over the behavior of the explosion.
暂无评论