Did you already imagine how would it be to watch a sport match without sounds? You would miss all this specific sport related sounds but also mostly miss a big part of the atmosphere present in the stadium, that is pa...
详细信息
ISBN:
(纸本)9798400705243
Did you already imagine how would it be to watch a sport match without sounds? You would miss all this specific sport related sounds but also mostly miss a big part of the atmosphere present in the stadium, that is particular to live events. This is what happens to most Deaf and Hard of Hearing persons. Towards Tokyo 2025 Deaflympics, we developed an AI-based system able to recognize sounds and players motion to render in realtime sound related Onomatopoeia over the match video as one could see in Comics or Manga.
Sediment plumes are generated from both natural and human activities in benthic environments, increasing the turbidity of the water and reducing the amount of sunlight reaching the benthic vegetation. Seagrasses, whic...
详细信息
ISBN:
(数字)9781510661714
ISBN:
(纸本)9781510661707;9781510661714
Sediment plumes are generated from both natural and human activities in benthic environments, increasing the turbidity of the water and reducing the amount of sunlight reaching the benthic vegetation. Seagrasses, which are photosynthetic bioindicators of their environment, are threatened by chronic reductions in sunlight, impacting entire aquatic food chains. This research uses UAV aerial video and imagery to investigate the characteristics of sediment plumes generated by a model of anthropogenic disturbance. The extent, speed and motion of the plumes were assessed as these parameters may pertain to the potential impacts of plume turbidity on seagrass communities. In a case study using UAV video, the turbidity plume was observed to spread over 250 feet over 20 minutes of the UAV campaign. The directional speed of the plume was estimated to be between 10.4 and 10.6 ft/min. This was corroborated by observation of greatest plume turbidity and sediment load near the location of disturbance and diminishing with distance. Further temporal studies are necessary to determine long-term, if any, impacts of human activity-generated sediment plumes on seagrass beds.
This paper presents the design and implementation of a camera surveillance picture quality inspection system. The system assesses the video stream from surveillance cameras and provides immediate feedback on image qua...
详细信息
video surveillance requires simultaneous monitoring of multiple areas. Consequently, real-time automatic change detection of the monitored areas becomes very important. In the context of wide field-of-view conditions,...
详细信息
In this paper, the 3D space imaging model of machine vision is constructed. Starting from the traditional machine vision imageprocessing algorithm flow, the image denoising process and target tracking process are opt...
详细信息
real-timevideo and imageprocessing are used in various industrial, medical, consumer electronics and embedded device applications. These applications typically demonstrate an increasing demand for computing power an...
详细信息
ISBN:
(纸本)9783031585012;9783031585029
real-timevideo and imageprocessing are used in various industrial, medical, consumer electronics and embedded device applications. These applications typically demonstrate an increasing demand for computing power and system complexity. Hence, edge detection is the most common and widely used technique in image or videoprocessing applications. Several traditional canny edge detection methods use fixed thresholding techniques to compare the pixel values. This sacrifices the edge detection performance and increases the computational complexity. Hence, the Canny Edge detection algorithm is preferred to enhance the image quality with reduced complexity. They adjust the quality of the image by manipulating the Sigma and Threshold parameters and detect the edges accurately by eliminating the noise. The reconfigurable canny edge detection algorithm presents a procedure for detecting edges without multipliers. The new algorithm uses a low-complex, non-uniform histogram gradient to compute thresholds and variable sigma values that replace the add and shift operator instead of multipliers to reduce the area and sigma. The simulation is done in the ModelSim platform using VHDL code which results in the output of bit sequences. By comparing the results of the reconfigurable canny edge detection and traditional algorithm, the new algorithm's performance can be observed with improvements of around 21% and 80% for consumed power and delay parameters respectively.
Monocular depth estimation algorithms aim to explore the possible links between 2D and 3D data, but challenges remain for existing methods to predict consistent depth from a casual video. Relying on camera poses and t...
详细信息
ISBN:
(纸本)9798400701788
Monocular depth estimation algorithms aim to explore the possible links between 2D and 3D data, but challenges remain for existing methods to predict consistent depth from a casual video. Relying on camera poses and the optical flow in the time-consuming testtime training phases makes these methods fail in many scenarios and cannot be used for practical applications. In this work, we present a data-driven post-processing method to overcome these challenges and achieve online processing. Based on a deep recurrent network, our method takes the adjacent original and optimized depth map as inputs to learn temporal consistency from the dataset and achieves higher depth accuracy. Our approach can be applied to multiple single-frame depth estimation models and used for various real-world scenes in real-time. In addition, to tackle the lack of a temporally consistent video depth training dataset of dynamic scenes, we propose an approach to generate the training video sequences dataset from a single image based on inferring motion field. To the best of our knowledge, this is the first datadriven plug-and-play method to improve the temporal consistency of depth estimation for casual videos. Extensive experiments on three datasets and three depth estimation models show that our method outperforms the state-of-the-art methods.
The article proposes an algorithm for processing parallel analysis of visual data obtained by a machine vision system, recorded information in the human visible spectrum, and information received by a range camera. An...
详细信息
ISBN:
(数字)9781510661714
ISBN:
(纸本)9781510661707;9781510661714
The article proposes an algorithm for processing parallel analysis of visual data obtained by a machine vision system, recorded information in the human visible spectrum, and information received by a range camera. An algorithm for the formation of stable features as elements of the human body, head and pupils of a person and parallel tracking of their increment is proposed. To highlight trend lines in element displacement and eliminate the high frequency component based on a combined criterion. The image is preliminarily processed to reduce the effect of the noise component based on a multi-criteria objective function. As test data used to evaluate the effectiveness, a video stream with a resolution of 1024x768 (8-bit, color image, visible range), 3D data, and expert evaluation data are used.
This research addresses urban parking challenges by allowing users to reserve parking spaces via a mobile app. The system integrates automated barriers and AI-powered cameras for accurate license plate recognition, en...
详细信息
Aiming at technical advantages of quickly discover and real-time tracking focused on targets with UAV video, we propose a multi-object tracking method based on spatial constraints. Utilizing the pre-training model of ...
详细信息
暂无评论