Detection of structural changes in images is one of the important tasks of remote sensing (RS) data thematic analysis. The effective way to solve it is applying the Pyt'ev's morphological projector to the pair...
Detection of structural changes in images is one of the important tasks of remote sensing (RS) data thematic analysis. The effective way to solve it is applying the Pyt'ev's morphological projector to the pair of images of the same scene acquired on different dates. The main advantage of this method is its invariance to global brightness transformations, which in the case of RS images correspond to different parameters of the atmosphere or the different values of the brightness-contrast ratio of the scene. However, the classical Pyt'ev's morphological projector and its regularized versions do not take into account the spatial connectivity of image samples. As a result, they ignore the textural features of images. In this article, we suggest the algorithm of structural changes detection based on superpixel segmentation and Pyt'ev's morphological projector that takes into account local characteristics of the image pixels. In the experimental research, we analyzed the accuracies of the proposed and classical Pyt'ev's structural change detection methods using simulated and real RS images. The comparison of two algorithms showed that the proposed method is more robust to the additive white Gaussian noise (AWGN) at different values of signal-to-noise (SNR) ratio. Additionally, the experiments with nonlinear brightness distortions (vignetting) of one of the pair of images demonstrated that the proposed method has lower false positive rates than the classical one.
The system for determining the human head position and orientation for vehicle simulators is considered in the work. It is necessary to take into account the current position and orientation of the driver's head w...
详细信息
The system for determining the human head position and orientation for vehicle simulators is considered in the work. It is necessary to take into account the current position and orientation of the driver's head when simulating a virtual space. The model for determining the human head position and orientation is developed and studied. In the course of the study, the developed position and orientation algorithm was compared with analogs belonging. This model is characterized by the joint use of three-dimensional reconstruction, stereo vision, the spectral theory of graphs, spectral embedding of graphs into a vector subspace, and allowing the head to rotate to 50° with an accuracy of 3°, which exceeds known approaches. A system for determining the human head position and orientation based on stereo images is constructed, which implements the developed algorithms. The developed system can be represented in the form of the following structure: stereo module, initialization module, tracking module, module for calculating the angles of the head orientation, head detection module, information transfer module.
Heterogeneous face recognition (HFR) has a prominent importance in sophisticated face recognition systems. Thermal to visible scenario, where the gallery and the probe images are respectively captured in visible and l...
详细信息
ISBN:
(纸本)9781538616291
Heterogeneous face recognition (HFR) has a prominent importance in sophisticated face recognition systems. Thermal to visible scenario, where the gallery and the probe images are respectively captured in visible and long wavelength infrared (LWIR) band, is one of the most challenging and interesting HFR scenarios. Since the formation of thermal images does not require an external illumination source, the deployment of thermal probe images is practical even in totally darkness conditions such as night security surveillance systems. In this paper, we propose an ensemble classifier which uses the random subspace idea for defining different representations of each image in distinct base learners, and exploits the sparse representation algorithm for the classification of thermal probe images. According to the experimental results, our proposed algorithm leads significant performance improvements in the area of thermal to visible face recognition and achieves the average Rank-1 accuracy of 89.33 percent.
A map of a location made of a human mindset is made briefly by recording a situation already seen in memory. Human behavior is easy to do without a lot of thinking. The process is observed more deeply that when the hu...
详细信息
The primary failure mechanism in brittle materials such as ceramics, granite and some metal alloys is through the presence of defects which result in crack formation and propagation under the application of load. We a...
详细信息
ISBN:
(纸本)9781538638002
The primary failure mechanism in brittle materials such as ceramics, granite and some metal alloys is through the presence of defects which result in crack formation and propagation under the application of load. We are interested in studying this process of crack propagation, interaction and coalescence, which degrades the strength of the specimen. Traditionally, engineering applications that study these materials employ finite element mesh-based methods that require hundreds of hours of processing time on multi-core high performance clusters. We have developed a graph-based reduced order model that captures key geometric and topological features of the dynamic fracture propagation network. We report here the early stages of our study in which deep neural networks will be applied to dynamic directed weighted graphs capturing various metrics of crack-pair interaction strength with the aim of predicting crack lengths, dynamic crack growth/coalescence properties, distributions of these properties over the entire material through time, failure paths and time to failure. Our graph-based representations allow us to consider detailed topology in conjunction with metric geometry to gain insights into the dominant mechanisms that drive the physics in these systems.
Laser-assisted hair removal devices aim to remove body hair permanently. In most cases, these devices irradiate the whole area of the skin with a homogenous power density. Thus, a significant portion of the skin, wher...
详细信息
ISBN:
(纸本)9781509028092
Laser-assisted hair removal devices aim to remove body hair permanently. In most cases, these devices irradiate the whole area of the skin with a homogenous power density. Thus, a significant portion of the skin, where hair is not present, is burnt unnecessarily causing health risks. Therefore, methods that can distinguish hair regions automatically would be very helpful avoiding these unnecessary applications of laser. This study proposes a new system of algorithms to detect hair regions with the help of a digital camera. Unlike previous limited number of studies, our methods are very fast allowing for real-time application. Proposed methods are based on certain features derived from histograms of hair and skin regions. We compare our algorithm with competing methods in terms of localization performance and computation time and show that a much faster real-time accurate localization of hair regions is possible with the proposed method. Our results show that the algorithm we have developed is extremely fast (around 45 milliseconds) allowing for real-time application with high accuracy hair localization (96.48 %).
Light field imaging is limited in its computational processing demands of high sampling for both spatial and angular dimensions. Single-shot light field cameras sacrifice spatial resolution to sample angular viewpoint...
详细信息
ISBN:
(纸本)9781538607336
Light field imaging is limited in its computational processing demands of high sampling for both spatial and angular dimensions. Single-shot light field cameras sacrifice spatial resolution to sample angular viewpoints, typically by multiplexing incoming rays onto a 2D sensor array. While this resolution can be recovered using compressive sensing, these iterative solutions are slow in processing a light field. We present a deep learning approach using a new, two branch network architecture, consisting jointly of an autoencoder and a 4D CNN, to recover a high resolution 4D light field from a single coded 2D image. This network decreases reconstruction time significantly while achieving average PSNR values of 26-32 dB on a variety of light fields. In particular, reconstruction time is decreased from 35 minutes to 6.7 minutes as compared to the dictionary method for equivalent visual quality. These reconstructions are performed at small sampling/compression ratios as low as 8%, allowing for cheaper coded light field cameras. We test our network reconstructions on synthetic light fields, simulated coded measurements of real light fields captured from a Lytro Illum camera, and real coded images from a custom CMOS diffractive light field camera. The combination of compressive light field capture with deep learning allows the potential for real-time light field video acquisition systems in the future.
The proceedings contain 40 papers. The special focus in this conference is on Information Hiding, Secret Sharing, Speech Signal processing, Communication Protocols, Techniques, Encryption and Authentication Methods. T...
ISBN:
(纸本)9783319502083
The proceedings contain 40 papers. The special focus in this conference is on Information Hiding, Secret Sharing, Speech Signal processing, Communication Protocols, Techniques, Encryption and Authentication Methods. The topics include: A revisit to LSB substitution based data hiding for embedding more information;behavior steganography in social network;robust steganography using texture synthesis;a quantization-based image watermarking scheme using vector dot product;high-capacity robust watermarking approach for protecting ownership right;a data hiding method based on multi-predictor and pixel value ordering;a large payload webpage data embedding method using CSS attributes modification;the study of steganographic algorithms based on pixel value difference;digital audio watermarking robust against locality sensitive hashing;copyright protection method based on the main feature of digital images;a study on tailor-made speech synthesis based on deep neural networks;an improved 5-2 channel downmix algorithm for 3D audio reproduction;investigation on the head-related modulation transfer function for monaural DOA;temporal characteristics of perceived reality of multimodal contents;research on frequency automatically switching technology for china highway traffic radio;an automatic decoding method for Morse signal based on clustering algorithm;a novel digital rights management mechanism on peer-to-peer streaming system;a framework for supporting application level interoperability between IPv4 and IPv6;a new image encryption instant communication method based on matrix transformation and a three-party password authenticated key exchange protocol resistant to stolen smart card attacks.
Additive noise is one among the prominent types of noises which degrades the quality of images. A very large number of algorithms, in spatial, frequency and wavelet domain have been proposed to enhance images corrupte...
详细信息
ISBN:
(纸本)9789811054273;9789811054266
Additive noise is one among the prominent types of noises which degrades the quality of images. A very large number of algorithms, in spatial, frequency and wavelet domain have been proposed to enhance images corrupted with additive noise. All the methods suggested have their own advantages as well as disadvantages. With the availability of parallel processing capability, in low end workstations and systems, fusion of two or more de-noising methods has become a topic of interest. In this paper, we have implemented one of the recent contributions to mean filter - a fuzzy filter. Also, as a complementary filter, the basic Non Local Means filter is implemented. Experiments were carried out by fusing the results obtained through the two filters. The results obtained establish the merit of the fusion approach.
Deep Convolutional Neural Networks (CNN) are the state-of-the-art performers for the object detection task. It is well known that object detection requires more computation and memory than image classification. In thi...
详细信息
ISBN:
(纸本)9781538607336
Deep Convolutional Neural Networks (CNN) are the state-of-the-art performers for the object detection task. It is well known that object detection requires more computation and memory than image classification. In this work, we propose LCDet, a fully-convolutional neural network for generic object detection that aims to work in embedded systems. We design and develop an end-to-end TensorFlow(TF)-based model. The detection works by a single forward pass through the network. Additionally, we employ 8-bit quantization on the learned weights. As a use case, we choose face detection and train the proposed model on images containing a varying number of faces of different sizes. We evaluate the face detection performance on publicly available dataset FDDB and Widerface. Our experimental results show that the proposed method achieves comparative accuracy comparing with state-of-the-art CNN-based face detection methods while reducing the model size by 3x and memory-BW by 3 - 4x comparing with one of the best real-time CNN-based object detector YOLO [23]. Our 8-bit fixed-point TF-model provides additional 4x memory reduction while keeping the accuracy nearly as good as the floating point model and achieves 20x performance gain compared to the floating point model. Thus the proposed model is amenable for embedded implementations and is generic to be extended to any number of categories of objects.
暂无评论