The principal goal of this paper is to propose a technique that will allow extraction of human body parameters from uncalibrated image, having some known reference height. Analysis focuses on the algorithms for detect...
详细信息
The Neuroscience domain stands out from the field of sciences for its dependence on the study and characterization of complex, intertwining structures. Understanding the complexity of the brain has led to widespread a...
详细信息
ISBN:
(数字)9781728162515
ISBN:
(纸本)9781728162522
The Neuroscience domain stands out from the field of sciences for its dependence on the study and characterization of complex, intertwining structures. Understanding the complexity of the brain has led to widespread advances in the structure of large-scale computing resources and the design of artificially intelligent analysis systems. However, the scale of problems and data generated continues to grow and outpace the standards and practices of neuroscience. In this paper, we present an automated neuroscience reconstruction framework, called NeuroKube, for large-scale processing and labeling of neuroimage volumes. Automated labels are generated through a machine-learning (ML) workflow, with data-intensive steps feeding through multiple GPU stages and distributed data locations leveraging autoscalable cloud-native deployments on a multi-institution Kubernetes system. Leading-edge hardware and storage empower multiple stages of machine-learning, GPU-accelerated solutions. This demonstrates an abstract approach to allocating the resources and algorithms needed to elucidate the highly complex structures of the brain. We summarize an integrated gateway architecture, and a scalable workflow-driven segmentation and reconstruction environment that brings together image big data with state-of-the-art, extensible machine learning methods.
The increased use of digital video and imageprocessing technology has paved the way for extending the traffic enforcement applications to a wider range of violations as well as making the enforcement process more eff...
详细信息
ISBN:
(纸本)9789897583742
The increased use of digital video and imageprocessing technology has paved the way for extending the traffic enforcement applications to a wider range of violations as well as making the enforcement process more efficient. Automated traffic enforcement has mainly been applied towards speed and red light violations detection. In recent years, there has been an extension to other violation detection tasks such as seat-belt usage, tailgating and toll payment violations. In the recent years, automated driver cell phone usage violation detection methods have aroused considerable interest since it results in higher mortality rates than the intoxicated driving. In this study, we propose a novel automated technique towards driver's phone usage violation detection using deep learning algorithms. Using an existing license plate recognition camera system placed on an overhead gantry, installed on a highway, real world images are captured during day and night time. We performed experiments using more than 5900 real world images and achieved an overall accuracy of 90.8 % in the driver cell phone usage violation detection task.
In healthcare domain, there is persistent pressure to improve clinical outcomes while lowering costs. In this respect, healthcare organizations can leverage cloud computing resources to avoid building an expensive in-...
详细信息
Many volumetric rendering algorithms use spatial 3D grids as the underlying data structure. Efficient representation, construction, and traversal of these grids are essential to achieve real-time performance, particul...
详细信息
ISBN:
(数字)9781728192741
ISBN:
(纸本)9781728192758
Many volumetric rendering algorithms use spatial 3D grids as the underlying data structure. Efficient representation, construction, and traversal of these grids are essential to achieve real-time performance, particularly for time-varying data such as in fluid simulations. In this paper, we present improvements on algorithms for building and traversing Bounding Volume Hierarchies (BVH) designed for sparse volumes. Our main insight was to simplify data layout representation by grouping voxels in buckets, preserving their spatiality using Morton codes, instead of using bricks, as current solutions use. Our solution does not use pointers nor stacks, allowing for its usage directly on computing shaders and provides, on average, 9.3x improvement in construction speed, compared with state-of-the-art approaches for Linear Bounding Volume Hierarchies (LBVH).
Traditionally, the video panorama expansion algorithms were basically based on the assumption of the alignment of hole center and camera center. In real applications, due to hardware limitations, the borehole video im...
详细信息
ISBN:
(纸本)9781643680194;9781643680187
Traditionally, the video panorama expansion algorithms were basically based on the assumption of the alignment of hole center and camera center. In real applications, due to hardware limitations, the borehole video image in the actual resource survey and acquisition project was obtained under the condition that the center of the borehole and the camera were not aligned. To solve the difficult task of obtaining panoramic images from borehole imagers and eliminate distortion in the resultant image for centers unaligned situation, this paper proposes a new borehole video imageprocessing algorithm. Firstly, the image was expanded by using the ideal model (the aligned situation data), finding the corresponding circular region, and proper interpolating manipulation. The expanded image contains the cylinder hole wall information within certain depth scope but has some kind of distortion. Then by using least square fitting method and adopting the initial phase optimization estimation method, we obtained the high-order polynomial fitting depth direction correction curve. And the expanded images are corrected frame by frame. Lastly, the displacement of the horizontal direction is searched by the template region matching method, and the panoramic image of the borehole wall is obtained by pixel-level data fusion. The experimental results are evaluated. The peak signal-to-noise ratio of the algorithm is 21.0907 and the structural similarity is 0.7976 in terms of objective metric. While subjectively the average of the proposed algorithm is 3.6. Both objective and subjective measure are better than the existing algorithm. The resultant images are more intuitive than the original video.
The problem of unauthorized access to various content is an important task for the present. It becomes even more acute when using the existing toolkit for the processing and transmission of encrypted images with the f...
详细信息
The problem of unauthorized access to various content is an important task for the present. It becomes even more acute when using the existing toolkit for the processing and transmission of encrypted images with the fluctuation intensity function. The authors developed a modification of the RSA algorithm to use it in relation to the above -mentioned images. It is proposed a conceptual view for the joint use of quaternary fractional -linear transformations with elements of the basic RSA algorithm. The application of this mathematical apparatus in the basic RSA algorithm avoids contours of image objects in an encrypted sample. In addition, this combination provides additional stability to the basic RSA algorithm for unauthorized decryption. The simulation of the method was carried out in two described by the authors algorithms: using one line and four lines of the image matrix, for grayscale and color images. The high efficiency of the developed method for avoiding contours of objects on encrypted images has been confirmed. In both cases, contours after applying encryption procedures do not appear. The reverse procedure allows to get an image without visible distortion. 2019 The Authors. Published by Elsevier B.V.
Smart city applications covering a wide area such as traffic monitoring and pothole detection are gradually adopting more image machine learning algorithms utilizing ubiquitous camera sensors. To support such applicat...
详细信息
Recently, surveillance cameras are ubiquitous for both real-time monitoring and recording important moments. Temporarily seamless surveillance using multiple cameras requires increasing amount of human efforts and eno...
ISBN:
(数字)9781728151861
ISBN:
(纸本)9781728151878
Recently, surveillance cameras are ubiquitous for both real-time monitoring and recording important moments. Temporarily seamless surveillance using multiple cameras requires increasing amount of human efforts and enormous size of storage. The use of dynamic cameras further requires advanced computer vision algorithms, and is another challenge for intelligent visual surveillance. To solve those problems, we present an enhanced metadata extraction method for robust object search and a person re-identification. More specifically, the proposed method accurately extracts an object region using a modified DeepLab version 3, and then extracts metadata including representative color, size, aspect ratio, and moving trajectory of the object. The proposed metadata extraction method can be applied to a wide range of surveillance systems such as search for missing children in a large public space and crowd monitoring system.
LiDARs are usually more accurate than cameras in distance measuring. Hence, there is strong interest to apply LiDARs in autonomous driving. Different existing approaches process the rich 3D point clouds for object det...
详细信息
ISBN:
(数字)9781728141497
ISBN:
(纸本)9781728141503
LiDARs are usually more accurate than cameras in distance measuring. Hence, there is strong interest to apply LiDARs in autonomous driving. Different existing approaches process the rich 3D point clouds for object detection, tracking and recognition. These methods generally require two initial steps: (1) filter points on the ground plane and (2) cluster non-ground points into objects. This paper proposes a field-tested fast 3D point cloud segmentation method for these two steps. Our specially designed algorithms allow instantly process raw LiDAR data packets, which significantly reduce the processing delay. In our tests on Velodyne UltraPuck, a 32 layers spinning LiDAR, the processing delay of clustering all the 360° LiDAR measures is less than 1ms. Meanwhile, a coarse-to-fine scheme is applied to ensure the clustering quality. Our field experiments in public roads have shown that the proposed method significantly improves the speed of 3D point cloud clustering whilst maintains good accuracy.
暂无评论