Many materials have distinct spectral profiles, which facilitates estimation of the material composition of a scene by processing its hyperspectral image (HSI). However, this process is inherently wasteful since high-...
详细信息
ISBN:
(纸本)9781728152301
Many materials have distinct spectral profiles, which facilitates estimation of the material composition of a scene by processing its hyperspectral image (HSI). However, this process is inherently wasteful since high-dimensional HSIs are expensive to acquire and only a set of linear projections of the HSI contribute to the classification task. This paper proposes the concept of programmable spectrometry for per-pixel material classification, where instead of sensing the HSI of the scene and then processing it, we optically compute the spectrally-filtered images. This is achieved using a computational camera with a programmable spectral response. Our approach provides gains both in terms of acquisition speed - since only the relevant measurements are acquired and in signal-to-noise ratio - since we invariably avoid narrowband filters that are light inefficient. Given ample training data, we use learning techniques to identify the bank of spectral profiles that facilitate material classification. We verify the method in simulations, as well as validate our findings using a lab prototype of the camera.
The significance of age recognition from faces in various domains, including minor security and online shopping, has drawn potential attention. It still faces a number of difficulties, though, particularly in an unres...
详细信息
ISBN:
(纸本)9798350351491;9798350351484
The significance of age recognition from faces in various domains, including minor security and online shopping, has drawn potential attention. It still faces a number of difficulties, though, particularly in an unrestricted setting, including anti-aging therapies, lighting circumstances, and filter effects. Advancements in computer vision offer promising avenues for more precise age prediction, particularly in facial analysis. However, challenges persist, including the need for robust datasets, standardized protocols, especially in computational approaches and generalization model's. This work provides an overview of the three methodologies employed in age estimation. The performance of their approach is evaluated by confusion matrix. The first test, which uses three datasets separately, can demonstrate that using just one dataset is insufficient to produce a high-quality confusion matrix. The second test overcome this problem by combine data from various sources, apply data augmentation technique and perform early stopping to achieve an excellent confusion matrix for age recognition. Obtained results validate the proposed approach.
In this paper, we propose a new camera calibration method for the 3D-based image synthesis and 3D reconstruction. We improve the problem as changing the principle point for obtaining the linear equation. According to ...
详细信息
作者:
Venkatesh, YVIndian Inst Sci
Dept Elect Engn Comp Vis & Artificial Intelligence Lab Bangalore 560012 Karnataka India
Generalized Hermite polynomials in two variables are employed for the reconstruction of images from a knowledge of their zero crossing contours. The problem of reconstruction of signals as functions of two variables i...
详细信息
ISBN:
(纸本)0818688211
Generalized Hermite polynomials in two variables are employed for the reconstruction of images from a knowledge of their zero crossing contours. The problem of reconstruction of signals as functions of two variables is not a mere extension of that of a single variable. This is a consequence of the fact that the spatial and spectral characteristics of two-variable functions are quite distinct from what one can expect from their separate projections on to the coordinate axes. One of the results of the paper is that we cannot guarantee uniqueness in reconstruction unless we impose certain constraints on, for instance, space-bandwidth products/ratios in the x - w(x), v - w(y) directions, of the unknown image. Further, a distinguishing feature of the proposed approach is that the standard assumption of bandlimitedness is not invoked. The proposed framework is believed to provide a more unified procedure for signal reconstruction (of uni- and multi-dimensional signals) from partial information than most of the results of the literature. For lack of space, only the main analytical and computational results are presented.
Single-molecule localization microscopy (SMLM) is a popular microscopic technique that achieves super resolution imaging by localizing individual blinking molecules in thousands of frames. Therefore, the reconstructed...
详细信息
ISBN:
(纸本)9781538636411
Single-molecule localization microscopy (SMLM) is a popular microscopic technique that achieves super resolution imaging by localizing individual blinking molecules in thousands of frames. Therefore, the reconstructed high-resolution image is a combination of millions of point sources. This particular computational reconstruction leads to the question of the estimation of the image resolution. Fourier-ring correlation (FRC) is the standard tool for assessing the resolution. It has been proposed for SMLM by computing a discrete correlation in the Fourier domain. In this work, we derive a closed-form expression to compute the continuous FRC. Our implementation provides an exact ERG and an alternative to compute a parameter-free FRC. In addition, it gives insights on the discrepancy of the discrete FRC and yields a rule to select its parameters such as the spatial sampling step or the width of the kernel used as density estimator.
Recent work on Capon beamforming suggest that it can provide increased lateral resolution when applied in a medical ultrasound imaging setting. In this paper, the high computational complexity of the Capon beamformer ...
详细信息
ISBN:
(纸本)9781467345620;9781467345613
Recent work on Capon beamforming suggest that it can provide increased lateral resolution when applied in a medical ultrasound imaging setting. In this paper, the high computational complexity of the Capon beamformer is targeted with the use of a Graphics processing Unit (GPU). In-vivo results with Capon beamforming applied on a full cardiac cycle in addition to simulations are presented. With the GPU we are able to process a 70 degrees sector cardiac image from a 64 element phased array at interactive frame rates using both spatial and temporal smoothing. For a typical cardiac ultrasound image of 80 x 400 pixels (70 degrees sector, 15 cm range) acquired using a 2.5 MHz, M=64 element phased array, we obtain 10 fps (subarray length L=M/2, temporal smoothing over 3 samples). If we perform a 2-element pre-beamforming, the channel count is reduced to 32, and frame rate is increased to 44 fps. For a 32 element phased array we need less beams to cover the sector (40 x 400 pixels), hence with the same parameters the frame rate increases to 87 fps. The target GPU was the Nvidias Quadro 6000, capable of 1 Tflops.
作者:
AnonIEEE
Industrial Electronics Soc New York NY USA IEEE Industrial Electronics Soc New York NY USA
The following topics are dealt with: automated inspection;robots;CAD;database systems;mobile robots;3-D sensors;pattern data capture;stereo vision;imageprocessing technology;object recognition;motion analysis;expert ...
详细信息
The following topics are dealt with: automated inspection;robots;CAD;database systems;mobile robots;3-D sensors;pattern data capture;stereo vision;imageprocessing technology;object recognition;motion analysis;expert systems;signalprocessing;image analysis;and feature detection. 63 papers are included in the proceedings.
Acne Vulgaris is the most common skin disease and affects 85% of population at some point in life, typically in adolescence. Objective evaluation of acne severity is necessary to asses the efficacy of medical treatmen...
详细信息
ISBN:
(纸本)9781538627266
Acne Vulgaris is the most common skin disease and affects 85% of population at some point in life, typically in adolescence. Objective evaluation of acne severity is necessary to asses the efficacy of medical treatment procedures. Traditionally acne evaluation is done by dermatologists manually counting the number of acne lesions through visual inspection or scanning acquired images of the patient's skin. This method is time consuming and requires excessive effort by the physician. In this paper a prototype application for automatic acne detection, lesion counting and reporting through the processing of distance picture taken by mobile devices is developed. The pipeline of the application is composed of body part detection, skin segmentation, heat-mapping, acne extraction and blob detection. Body part detection is accomplished using different Haar-Cascade classifiers;frontal face, right profile, left profile and torso are discriminated. Skin segmentation has been performed using an ensemble of random forest models trained on a augmented version of the FSD dataset. The set of features was engineered combining colour, texture, spatial, shape and unsupervised descriptors and selected by a feature importance step. The a* channel of the CIELab colour space was used for enhance the visual contrast between inflamed zones and healthy skin. Finally acne extraction and blob detection were accomplished through Adaptive Thresholding and Laplacian of Gaussian filtering. Reports are generated containing number, position and ray size of the detected spots.
One of the key challenges of deep learning based image retrieval remains in aggregating convolutional activations into one highly representative feature vector. Ideally, this descriptor should encode semantic, spatial...
详细信息
ISBN:
(纸本)9781728118178
One of the key challenges of deep learning based image retrieval remains in aggregating convolutional activations into one highly representative feature vector. Ideally, this descriptor should encode semantic, spatial and low level information. Even though off-the-shelf pre-trained neural networks can already produce good representations in combination with aggregation methods, appropriate fine tuning for the task of image retrieval has shown to significantly boost retrieval performance. In this paper we present a simple yet effective supervised aggregation method built on top of existing regional pooling approaches. In addition to the maximum activation of a given region, we calculate regional average activations of extracted feature maps. Subsequently, weights for each of the pooled feature vectors are learned to perform a weighted aggregation to a single feature vector. Furthermore, we apply our newly proposed NRA loss function for deep metric learning to fine tune the backbone neural network and to learn the aggregation weights. Our method achieves state-of-the-art results for the INRIA Holidays data set and competitive results for the Oxford Buildings and Paris data sets while reducing the training time significantly.
An elaborated parallel implementation of particle filter in GPGPU has been applied to track steering hands of a car driver with depth image sensor for extracting arm and hand regions. Use of depth image sensor can pro...
详细信息
ISBN:
(纸本)9781479931842
An elaborated parallel implementation of particle filter in GPGPU has been applied to track steering hands of a car driver with depth image sensor for extracting arm and hand regions. Use of depth image sensor can provide more clear extraction of arm and hand regions for left and right than conventional image sensor. Parallel implementation in GPGPU allows fast computation and use of significantly larger number of particles in particle filter algorithm. Experimental results show computational performance of the proposed method.
暂无评论