This paper proposes a video editor based on OpenShot with several state-of-the-art facial video editing algorithms as added functionalities. Our editor provides an easy-to-use interface to apply modern lip-syncing alg...
详细信息
The proceedings contain 80 papers. The topics discussed include: PredGAN - a deep multi-scale video prediction framework for detecting anomalies in videos;multiple kernel fisher discriminant metric learning for person...
ISBN:
(纸本)9781450366151
The proceedings contain 80 papers. The topics discussed include: PredGAN - a deep multi-scale video prediction framework for detecting anomalies in videos;multiple kernel fisher discriminant metric learning for person re-identification;vision-based steering angle prediction by the fusion of depth and intensity deep features;zero-shot learning using graph regularized latent discriminative cross-domain triplets;perfectly secure Shamir’s secret sharing scheme for privacy preserving imageprocessing over cloud;moving average recurrent neural network model for video-based person re-identification;activity recognition in egocentric videos using bag of key action units;and learning end-to-end autonomous driving using guided auxiliary supervision.
Down syndrome is a genetic disorder that affects 1 in every 1000 babies born worldwide. The cases of Down syndrome have increased in the past decade. It has been observed that humans with Down syndrome generally tend ...
详细信息
ISBN:
(纸本)9789813290884;9789813290877
Down syndrome is a genetic disorder that affects 1 in every 1000 babies born worldwide. The cases of Down syndrome have increased in the past decade. It has been observed that humans with Down syndrome generally tend to have distinct facial features. This paper proposes a model to identify people suffering from Down syndrome based on their facial features. Deep representation from different parts of the face is extracted and combined with the aid of Deep Convolutional Neural Networks. The combined representations are then classified using a Random Forest-based pipeline. The model was tested on a dataset of over 800 individuals suffering from Down syndrome and was able to achieve a recognition rate of 98.47%.
Reliable detection of defective parts is an essential step for ensuring high-quality assurance standards. This requirement is of primary importance for online vision-based automated pellet stacking system. During nucl...
详细信息
ISBN:
(纸本)9789813290884;9789813290877
Reliable detection of defective parts is an essential step for ensuring high-quality assurance standards. This requirement is of primary importance for online vision-based automated pellet stacking system. During nuclear fuel pin manufacturing, the image of a single row (consecutively placed components with no gap) is processed and analyzed to extract meaningful edges. Generally, these edges follow a regular pattern;however, the presence of surface cracks and chips can alter this pattern. In this paper, we formalize the detection of defective parts as a pattern matching problem. Three different patterns are proposed and evaluated for sensitivity, specificity, and accuracy. An experiment performed with the proposed pattern matching techniques show that multi-pattern matching is the most effective method for identifying defective parts.
High dynamic range (HDR) videos provide a more visually realistic experience than the standard low dynamic range (LDR) videos. Despite having significant progress in HDR imaging, it is still a challenging task to capt...
详细信息
Event-based cameras can capture changes in brightness in the form of asynchronous events, unlike traditional cameras, which has sparked tremendous interest due to their wide range of applications. In this work, we add...
详细信息
image captioning and object detection are some of the most growing and popular research areas in the field of computervision. Almost every upcoming technology uses vision in some way, and with various people research...
详细信息
ISBN:
(纸本)9789813292918;9789813292901
image captioning and object detection are some of the most growing and popular research areas in the field of computervision. Almost every upcoming technology uses vision in some way, and with various people researching in the field of object detection, many vision problems which seemed intractable seem close to solved now. But there has been less research in identifying regions associating actions with objects. Dense image Captioning [8] is one such application, which localizes all the important regions in an image along with their description. Something very similar to normal image captioning, but repeated for every salient region in the image. In this paper, we address the aforementioned problem of detecting regions explaining the query caption. We use edge boxes for efficient object proposals, which we further filter down using a scoremeasure. The object proposals are then captioned using a pretrained Inception [19] model. The captions of each of these regions are checked for similarity with the query caption using the skip-thought vectors [9]. This proposed framework produces interesting and efficient results. We provide a quantitative measure of our experiment by taking the intersection over union (IoU) with the ground truth on the visual genome [10] dataset. By combining the above techniques in an orderly manner, we have been able to achieve encouraging results.
Predicting where humans look in a given scene is a well-known problem with multiple applications in consumer cameras, human-computer interaction, robotics, and gaming. With large-scale image datasets available for hum...
详细信息
ISBN:
(纸本)9789813292918;9789813292901
Predicting where humans look in a given scene is a well-known problem with multiple applications in consumer cameras, human-computer interaction, robotics, and gaming. With large-scale image datasets available for human fixation, it is now possible to train deep neural networks for generating a fixationmap. Human fixations are a function of both local visual features and global context. We incorporate this in a deep neural network by using global and local features of an image to predict human fixations. We sample multi-scale features of the deep residual network and introduce a new method for incorporating these multi-scale features for the end-to-end training of our network. Our model DeepAttent obtains competitive results on SALICON and iSUN datasets and outperforms state-of-the-art methods on various metrics.
computervision is considered as the science and technology of the machines that see. When paired with deep learning, it has limitless applications in various fields. Among various applications, face recognition is on...
详细信息
ISBN:
(纸本)9789813292918;9789813292901
computervision is considered as the science and technology of the machines that see. When paired with deep learning, it has limitless applications in various fields. Among various applications, face recognition is one of the most useful real-life problem-solving applications. We propose a technique that uses image enhancement and facial recognition technique to develop an innovative and timesaving class attendance system. The idea is to train a Convolutional Neural Network (CNN) using the enhanced images of the students in a certain course and then using that learned model, to recognize multiple students present in a lecture. We propose the use of deep learning model that is provided by OpenFace to train and recognize the images. This proposed solution can be easily installed in any organization, if the images of all persons to be marked this way are available with the administration. The proposed system marks attendance of students 100% accurately when captured images have faces in right pose and are not occluded.
This paper proposes a novel method of designing a correlation filter for frequency domain pattern recognition. The proposed correlation filter is designed with linear regression technique and termed as linear regressi...
详细信息
ISBN:
(纸本)9789813290884;9789813290877
This paper proposes a novel method of designing a correlation filter for frequency domain pattern recognition. The proposed correlation filter is designed with linear regression technique and termed as linear regression correlation filter. The design methodology of linear regression correlation filter is completely different from standard correlation filter design techniques. The proposed linear regression correlation filter is estimated or predicted from a linear subspace of weak classifiers. The proposed filter is evaluated on standard benchmark database and promising results are reported.
暂无评论