image dehazing is an important research topic in the field of imageprocessing and computervision. image dehazing aims to remove haze in images and make image scenes clearer. image dehazing based on dark channel prio...
详细信息
With the development of image/video based 3D pose estimation techniques, service robots, human-computer interaction, and 3D somatosensory games have been developed rapidly. However, 3D pose estimation is still one of ...
详细信息
ISBN:
(纸本)9789811379864;9789811379857
With the development of image/video based 3D pose estimation techniques, service robots, human-computer interaction, and 3D somatosensory games have been developed rapidly. However, 3D pose estimation is still one of the most challenging tasks in computervision. On the one hand, diversity of poses, occlusion and self-occlusion, change in illumination, and complex background increase the complexity of human pose estimation. On the other hand, many application scenarios require high real-time performance for 3D pose estimation. Therefore, we present a 3D pose estimation method based on binocular vision in this paper. For each frame of the binocular videos, the human body is detected firstly;Then Stacked-Hourglass network is used to detect the human joints, and the pixel coordinates of the key joints of all the human bodies in the binocular images are obtained. Finally, with the calibrated camera internal parameters and external parameters, the 3D coordinates of the major joints in the world coordinate system are estimated. This method does not rely on 3D data sets for training. It only requires binocular cameras to perform 3D pose estimation. The experimental results show that the method can locate key joints precisely and the real-time performance is achieved in complex background.
The accurate understanding of occlusion region is critical for trustworthy estimation of optical flow to prevent the negative influence of occluded pixels on disocclusion regions. However, occlusion is the result of m...
详细信息
image segmentation plays a significant role in computervision and imageprocessing. In this paper, we proposed a novel Local Chan–Vese (LCV) image segmentation model. The new model combined classical LCV model with ...
详细信息
The presence of atmospheric turbulence over horizontal imaging paths introduces time-varying perturbations and blur in the scene that severely degrade the performance of moving object detection and tracking systems of...
详细信息
With many emerging function modules, such as image acquisition, front-end local processing, wireless transmission and so on, the smartphone becomes a major front-end hardware in the mobile-cloud computervision system...
详细信息
ISBN:
(纸本)9781538622902
With many emerging function modules, such as image acquisition, front-end local processing, wireless transmission and so on, the smartphone becomes a major front-end hardware in the mobile-cloud computervision system. However, due to the limitations of local resources and camera performance, there are many problems in image acquisition with smartphones. For example, the images are not as clear as those captured by professional camera equipment. And the performance of image acquisition is much more sensitive to background procedures and environment. These shortcomings have brought great challenges in terms of accuracy and delay in computervision. In this paper, the Resolution Adaptive Algorithm (RAA) is proposed to select the optimal resolution for image acquisition in different situations. Furthermore, in order to improve the efficiency of local resources and reduce the processing delay, a low-quality image filtered method is presented to delete the invalid images. In our experiment, the average delay between image acquisition and display is about 100ms, which meets the requirement of detection in real time.
Motivated by the problems of vision-based mobile robot map building and localization, we present a comparative study of statistical methods for matching image features in a wide base line between learning and recognit...
详细信息
ISBN:
(纸本)9781538695760
Motivated by the problems of vision-based mobile robot map building and localization, we present a comparative study of statistical methods for matching image features in a wide base line between learning and recognition phases. A general methodology called feature-class method for the problem of fast matching image features in a wide base line is described in the context of mobile robots. The objective of this work is to discuss and to show the performance of such methods in an example of visual SLAM, with experiments done with real data.
In video surveillance, person re-identification (re-id) is a popular technique to automatically finding whether a person has been already seen in a group of cameras. In the recent years, availability of large-scale da...
详细信息
This paper presents a vision-based surface roughness evaluation system for end-milled metals, addressing digital reconstruction and calibration of inspected surfaces, and quantitative and qualitative evaluation of sur...
详细信息
This paper presents a vision-based surface roughness evaluation system for end-milled metals, addressing digital reconstruction and calibration of inspected surfaces, and quantitative and qualitative evaluation of surface texture. Specimens with different levels of surface roughness are machined, and a comparison between stylus-based and vision-based measurements is performed while using standard roughness parameters. The vison-based results vary among 9% and 11% compared to the stylus-based ones, which is a minor error to trade-off for faster measurements. Furthermore, surface texture evaluation is performed by detecting the generated cusp lines and tool marks on the machined surface. The tool marks' distribution is analysed in order to determine whether the machining is performed under optimal cutting conditions. Results show that under optimal cutting conditions, the detected tool marks are normally distributed along the feed direction and the distance between two consecutive tool marks does not vary significantly. Based on the proposed methods software is implemented that enables the three-dimensional reconstruction, calibration and evaluation of the inspected surface.
A significant increase in research of human activity recognition can be seen in recent years due to availability of low-cost RGB-D sensors and advancement of deep learning algorithms. In this paper, we augmented our p...
详细信息
暂无评论