A practical way for obtaining depth in computervision is the use of structured light systems. For panoramic depth reconstruction several images are needed which most likely implies the construction of a sensor with m...
详细信息
Speaker recognition systems perform better when clean speech signals are used for the task. In the presence of high levels of background noise, speech recorded from a close speaking microphone will be degraded and hen...
详细信息
Speaker recognition systems perform better when clean speech signals are used for the task. In the presence of high levels of background noise, speech recorded from a close speaking microphone will be degraded and hence the performance of the speaker recognition system. Use of a transducer held at the throat results in a signal that is clean even in a noisy environment. This paper discusses the prospect of using such signals for speaker recognition. A study of a text-independent speaker recognition system based on features extracted from speech simultaneously recorded using a throat microphone and a close-speaking microphone in clean and simulated noisy conditions is conducted. Autoassociative neural networks are used to model the speaker characteristics based on the vocal tract system and excitation source features represented by weighted linear prediction cepstral coefficients and linear prediction residual, respectively. The results of experimental studies show that the speech collected from the throat microphone can be used for tasks like speaker recognition, especially in noisy conditions.
Current acoustic localization systems for Autonomous Underwater Vehicles (AUV) rely on acoustic beacons at known positions. We present a method that combines rangeonly acoustic measures and odometry data into a factor...
Current acoustic localization systems for Autonomous Underwater Vehicles (AUV) rely on acoustic beacons at known positions. We present a method that combines rangeonly acoustic measures and odometry data into a factor graph structure, able to compute the AUV localization while estimating the position of the acoustic beacons without having previous information about their position. We designed several missions and carried them out in a real environment to gather data to test our algorithm offline. The output of this work is a pose-graph SLAM algorithm that combines the AUV internal measurements (i.e. DVL, IMU, and depth) with acoustic ranges from static beacons to estimate both the AUV state and the beacon’s position.
The use of omnidirectional cameras underwater is enabling many new and exciting applications in multiple fields. Among these, the creation of virtual tours from omnidirectional image surveys is expected to have a larg...
详细信息
ISBN:
(纸本)9781479987375
The use of omnidirectional cameras underwater is enabling many new and exciting applications in multiple fields. Among these, the creation of virtual tours from omnidirectional image surveys is expected to have a large impact in terms of science and conservation outreach. These surveys can be performed by Remotely Operated Vehicles (ROVs) and Autonomous Underwater Vehicles (AUVs) that can cover large marine areas with precise navigation. Virtual tours are relevant in zones of special interest such as shipwrecks or underwater nature reserves for both scientists and the general public. This paper presents the first results of surveys carried out by an AUV equipped with an omnidirectional underwater camera, and explores the process of automatically creating virtual tours from the most relevant images of the datasets.
This research presents a unique approach for monitoring the large-scale grid-connected photovoltaic modules in solar power plants using state-of-art object detection YOLOv5 algorithm and classical image processing tec...
详细信息
Building accurate bathymetries of the seabed has been a focus of study in the last decade. For this purpose seabed point cloud registration has been a focus for some researchers. Some of this registration methods are ...
详细信息
Building accurate bathymetries of the seabed has been a focus of study in the last decade. For this purpose seabed point cloud registration has been a focus for some researchers. Some of this registration methods are based on gathering the points of the cloud that contain more information for the registration (i.e. the ones that flat or smooth, normally being the seabed) and using them as part of ICP-derived methods. For this point picking purpose, we present a segmentation technique that distinguish between objects (interesting for registration) and ground (smooth and not interesting for registration). The method proposed here uses difference of normals for object's border detection and a variation of the Density-Based Spatial Clustering of Application with Noise for object clustering. Once the objects boundaries are detected and the points are clustered the rest of the points are classified as object or ground. This classification is done by taking all the points that lie within the object's border and checking it's depth compare to its closes border point. The method is evaluated using a multi-beam dataset gathered on the La Lune shipwreck, a site of archaeological interest.
Registering 2 or more range scans is a fundamental problem, with application to 3D modeling. While this problem is well addressed by existing techniques such as ICP when the views overlap significantly at a good initi...
详细信息
Registering 2 or more range scans is a fundamental problem, with application to 3D modeling. While this problem is well addressed by existing techniques such as ICP when the views overlap significantly at a good initialization, no satisfactory solution exists for wide baseline registration. We propose here a novel approach which leverages contour coherence and allows us to align two wide baseline range scans with limited overlap from a poor initialization. Inspired by ICP, we maximize the contour coherence by building robust corresponding pairs on apparent contours and minimizing their distances in an iterative fashion. We use the contour coherence under a multi-view rigid registration framework, and this enables the reconstruction of accurate and complete 3D models from as few as 4 frames. We further extend it to handle articulations, and this allows us to model articulated objects such as human body. Experimental results on both synthetic and real data demonstrate the effectiveness and robustness of our contour coherence based registration approach to wide baseline range scans, and to 3D modeling.
The purpose of this work is to analyse what happens to the surface information when the image resolution is modified. We deduce how the same surface appears if seen from different distances. Using 4-source Colour Phot...
详细信息
In this work, we describe an automated quality assurance system for pipes in warehouses and yards using simple handheld and mobile equipment like smartphone cameras. Currently, quality inspection for bent and crooked ...
详细信息
Catadioptric sensors are combinations of mirrors and lenses made in order to obtain a wide field of view. In this paper we propose a new sensor that has omnidirectional viewing ability and it also provides depth infor...
详细信息
Catadioptric sensors are combinations of mirrors and lenses made in order to obtain a wide field of view. In this paper we propose a new sensor that has omnidirectional viewing ability and it also provides depth information about the nearby surrounding. The sensor is based on a conventional camera coupled with a laser emitter and two hyperbolic mirrors. Mathematical formulation and precise specifications of the intrinsic and extrinsic parameters of the sensor are discussed. Our approach overcomes limitations of the existing omni-directional sensors and eventually leads to reduced costs of production.
暂无评论