Almost half of the car accidents are caused by fatigue and the sleepiness of the driver. The purpose of our work is to propose a non-intrusive and real-time system for drowsiness detection based on facial expression a...
详细信息
ISBN:
(数字)9781728110806
ISBN:
(纸本)9781728110813
Almost half of the car accidents are caused by fatigue and the sleepiness of the driver. The purpose of our work is to propose a non-intrusive and real-time system for drowsiness detection based on facial expression analysis. In this paper, we will find a summary of some research work with a comparative study of performance to determine the most relevant method for detecting drowsiness. We will be particularly interested in analyzing prediction methods by supervising the eyes state of the driver using an imageprocessing technique. The software simulation is performed via a C/C++ code for an I5 processor-based platform, a webcam, and a Linux operating system.
3D object detection serves as a crucial basis of visual perception, motion prediction, and planning for automated driving. To apply an algorithm for this purpose, the detection of all types of road users in real-time ...
详细信息
ISBN:
(数字)9781728141497
ISBN:
(纸本)9781728141503
3D object detection serves as a crucial basis of visual perception, motion prediction, and planning for automated driving. To apply an algorithm for this purpose, the detection of all types of road users in real-time is an essential condition. In this paper, we propose an approach that projects the 3D points of image-based bounding box proposals into so-called grid map patches. These patches are used to estimate the exact dimensions of the 3D box with the help of a lightweight CNN. The complete proposed processing chain is parallelized and implemented on a GPU. This makes our approach the fastest stereo-based 3D object detector on the KITTI benchmark while still achieving results that are within the range of the best image-based algorithms.
The CPU-FPGA heterogeneous architectures became an attractive option for developing hardware accelerators to process computer vision algorithms. In this paper, we improve the support for streaming processing on the In...
详细信息
ISBN:
(数字)9781728170442
ISBN:
(纸本)9781728170459
The CPU-FPGA heterogeneous architectures became an attractive option for developing hardware accelerators to process computer vision algorithms. In this paper, we improve the support for streaming processing on the Intel HARPv2 platform by proposing strategies such as data ordering, double buffer, and management of multiple memory addresses. We demonstrate the feasibility of this new strategy by a case study with a hardware implementation of the Semi-Global Matching (SGM) algorithm for stereo vision. With these strategies, we can process depth images with a resolution of 1920×1080 pixels achieving a processing rate of about 48 FPS. The processing performance overcomes the state-of-art CPU-FPGA heterogeneous architectures results for processing of the promissing SGM technique.
Deep learning platforms have become hugely popular due to their successes in natural language processing and imageprocessing. Our objective is to show how deep learning platforms can be used for control problems. We ...
详细信息
ISBN:
(数字)9781538682661
ISBN:
(纸本)9781538682678
Deep learning platforms have become hugely popular due to their successes in natural language processing and imageprocessing. Our objective is to show how deep learning platforms can be used for control problems. We do not make judgments about their performance as compared to traditional control approaches. We show that the main challenge when using deep learning platforms for learning control policies for nonlinear systems is ensuring the stability of the learning algorithm that depends on the stability of the closed loop system during the learning process. We discuss two approaches for overcoming the potential instability of the optimization algorithm, and showcase them in the context of learning a stabilizing controller for an inverted pendulum.
The evolution of the glass production process requires high accuracy in defects detection and faster production lines. Both requirements result in a reduction in the processing time of defect detection in case of real...
详细信息
One of the most common methods in breast cancer radiotherapy planning is Magnetic Resonance Imaging (MRI). It is also used for patient evaluation during treatment because of its sensitivity and lack of ionizing radiat...
详细信息
ISBN:
(纸本)9783319912110
One of the most common methods in breast cancer radiotherapy planning is Magnetic Resonance Imaging (MRI). It is also used for patient evaluation during treatment because of its sensitivity and lack of ionizing radiation. During each imaging session a patient position can be different and inaccuracies can occur. In this case it is very difficult to compare two image sets originating from different patient examination. The main goals of this work were to implement an algorithm, based on affine transformation with Mutual Information as the quality factor of images match and the method based on the Navier-Lame equation for elastic image co-registration. The rigid transformation is used for the preliminary processing, and the non-rigid transformation allows for successful co-registration of both image sets. Our results were evaluated visually, and the MI indices were calculated. These algorithms allowed for image co-registration in different imaging sessions during the course of treatment.
In the last two decades deep learning has attracted a lot of attention internationally, solving problems in different application domains and achieving results beyond expectations. For example it has been applied in b...
详细信息
Saliency prediction can be treated as the activity of the human visual system (HVS). The most effective method should highly approximate the response of HVS to the perceived information. Motivated by that orientation ...
详细信息
ISBN:
(数字)9781728180687
ISBN:
(纸本)9781728180694
Saliency prediction can be treated as the activity of the human visual system (HVS). The most effective method should highly approximate the response of HVS to the perceived information. Motivated by that orientation selectivity (OS) mechanism occuring in primary visual cortex (PVC) tells us how the HVS extracts visual information for scene understanding, we propose a novel saliency model by combining an orientation selectivity based local feature called "excitement" map and a visual acuity based global feature called "acuity" map. Further, a saliency augmented operator based on visual error sensitivity is designed to enhance the saliency map. Experimental results on three benchmark databases demonstrate the superior performance of the proposed method compared to ten classical/ state-of-the-art algorithms.
Pedestrian detection is a popular research topic from the last decade. Most of the pedestrian identification models are based on face recognition algorithms. It is a difficult task to detect and track individual pedes...
详细信息
ISBN:
(纸本)9781450376754
Pedestrian detection is a popular research topic from the last decade. Most of the pedestrian identification models are based on face recognition algorithms. It is a difficult task to detect and track individual pedestrians based on these algorithms because there is a compulsion that their faces should be towards the camera. This limitation makes face recognition algorithms inefficient to detect pedestrians from the backsides. In this paper, we proposed a method for pedestrian detection using marker recognition. Multiple pedestrians are detected and then tracked based on markers attached to their backsides. Attaching markers on back-side of pedestrians helps to recognize them even when they are looking the other way. After the marker is recognized, the unique character related to that marker is displayed as a 3D object. This marker-based pedestrian detection is carried out using a mobile phone system and can be applied to embedded systems. The proposed method makes it possible to recognize up to three pedestrians located at different positions from the camera.
Computer vision as an important branch of computer science and artificial intelligence has made rapid progress in the past thirty years. Binocular stereo vision is one of the most important parts in computer vision. B...
详细信息
ISBN:
(数字)9781728183046
ISBN:
(纸本)9781728194608
Computer vision as an important branch of computer science and artificial intelligence has made rapid progress in the past thirty years. Binocular stereo vision is one of the most important parts in computer vision. Binocular stereo vision can well simulate human eyes stereoscopic perception of three-dimensional objects, and has been widely used in various fields of industrial automation production and practical life. Binocular stereo vision technology is a comprehensive technology, whose knowledge includes optics, physics, imageprocessing, computer, artificial intelligence and electronic technology, and other fields of content. Camera calibration and feature point matching are difficult and key points in binocular stereo vision technology. In this paper, the 3D reconstruction of binocular stereo vision based on feature point matching method is discussed. The main research includes the following contents: binocular stereo vision system principle, feature matching algorithm research and 3D reconstruction system implementation. This project puts forward a set of feasible algorithms and finally can get a good three-dimensional reconstruction effect based on the analysis and research of a series of algorithms.
暂无评论