Efficient video transmission enables a wide range of applications in underwater environments, such as seabed survey, subsea equipment maintenance, oil pipe/bridge inspection, and marine life sample collection. At pres...
详细信息
ISBN:
(纸本)9798350362077
Efficient video transmission enables a wide range of applications in underwater environments, such as seabed survey, subsea equipment maintenance, oil pipe/bridge inspection, and marine life sample collection. At present, it is a common belief that real-time underwater video transmission through underwater acoustic communication is challenging due to the influence of complex underwater environments and the limitation of underwater acoustic communication. In this paper, we propose an adaptive real-time underwater video transmission system using underwater communication. The system consists of three modules, i.e, video pre-processing module, video transmission module and video post-processing module. In the first two modules, the sender adaptively adjusts the compression bitrate and transmission rate according to the video quality and channel conditions. In the third module, the deep learning-based video reconstruction algorithm for underwater image information recovery is exploited. The efficacy of this system is verified by real underwater videos collected in several sea fields. The results prove the proposed system is able to transmit video successfully and efficiently in the underwater environment.
Remote control vehicles require the transmission of large amounts of data, and video is one of the most important sources for the driver. To ensure reliable video transmission, the encoded video stream is transmitted ...
详细信息
ISBN:
(纸本)9781728198354
Remote control vehicles require the transmission of large amounts of data, and video is one of the most important sources for the driver. To ensure reliable video transmission, the encoded video stream is transmitted simultaneously over multiple channels. However, this solution incurs a high transmission cost. To address this issue, it is necessary to use more efficient video encoding methods that can make the video stream robust to noise. Moreover it should have a less complexity to adapt to the realtime requirement. In this paper, we propose a low-complexity, low-latency 2-channel Multiple Description Coding (MDC) solution with an adaptive Instantaneous Decoder Refresh (IDR) frame period, which is compatible with the HEVC standard with adaptive redundancy adjustment. This method shows a better resistance to high packet loss rates with lower complexity.
Fire smoke needs early detection and accurate identification, so as to protect people's lives and property, while manual control method has problems such as large time consumption, subjective misjudgment, so an ef...
详细信息
Background: The evolution of AI applications in dental imaging, covering caries detection, anatomical structure segmentation, and pathology identification, highlights the importance of high-quality datasets for effect...
详细信息
ISBN:
(纸本)9781510673199;9781510673182
Background: The evolution of AI applications in dental imaging, covering caries detection, anatomical structure segmentation, and pathology identification, highlights the importance of high-quality datasets for effective detection models. This paper focuses on optimizing dataset quality for real-time AI-based dental bitewing radiograph detection. Methods: We systematically analyze preprocessing methods suitable for dental bitewing radiographs, covering image enhancement, noise reduction, and contrast adjustment. These techniques are strategically chosen to address common challenges in dental radiograph images, including variations in lighting, contrast disparities, and noise fluctuations. We employ optimized algorithms to meet real-time constraints, ensuring efficient model training and inference. Results: Our study assesses the impact of each preprocessing step on dataset quality and its influence on AI model performance. Practical recommendations are provided to empower researchers and practitioners in creating datasets optimized for dental bitewing radiograph detection tasks, aiming to improve AI model accuracy while adhering to real-time requirements. In addition, a comparative analysis is conducted, evaluating datasets enhanced using conventional methods against the ResNet18 model for the segmentation of bitewing dental images. Conclusion: This paper serves as a valuable guide for the dental imaging community, offering insights into preprocessing steps that elevate dataset quality for AI-driven dental bitewing radiograph detection. By emphasizing the relevance of real-time performance and providing a comparison with conventional enhancements on the ResNet18 model, we contribute to advancing early diagnosis and enhancing oral healthcare outcomes.
Augmented reality is a visualization technology that displays information by adding virtual images to the real world. Effective implementation of augmented reality requires recognition of the current scene. Identifyin...
详细信息
ISBN:
(纸本)9781510673199;9781510673182
Augmented reality is a visualization technology that displays information by adding virtual images to the real world. Effective implementation of augmented reality requires recognition of the current scene. Identifying objects in real-timevideo on computationally limited hardware requires significant effort. One way to solve this problem is to create a hybrid system that, based on machine learning and computer vision technology, processes and analyzes visual data to identify and classify real-world objects. The proposed architecture is based on a combination of the Vuforia augmented system, which provides good performance by balancing prediction accuracy and efficiency. First, the Vuforia neural network architecture allows convenient interaction with AR in Unity and provides initial conditions for detecting 3D objects. The augmented reality construction algorithm is based on the ARCore framework and the OpenGL interface for embedded systems. The system integrates recognition data with an AR platform to display corresponding 3D models, allowing users to interact with them through the functionality of the AR application. This method also involves the development of an enhanced user interface for AR, making the augmented environment more accessible for navigation and control. Experimental research has shown that the proposed method significantly improves the accuracy of object recognition and the ease of working with 3D models in AR.
In recent years, there has been a growing interest among researchers and scholars in the analysis of sports activities, driven by the advancements of machine learning and the increased availability of public data. How...
详细信息
ISBN:
(纸本)9798350349405;9798350349399
In recent years, there has been a growing interest among researchers and scholars in the analysis of sports activities, driven by the advancements of machine learning and the increased availability of public data. However, there remains a scarcity of comprehensive sports video datasets that possess the necessary attributes to address various research tasks effectively. We present the "Badminton Benchmark" (BMT-BENCH) to facilitate reproducible machine learning research in the sports domain. This dataset comprises high-quality, high-speed video clips collected from official badminton tournaments involving two team players. The dataset is labeled and unlabeled, catering to different research problems such as video generation and real-time object detection. we feature a baseline system mainly for video generation tasks and provide a thorough evaluation of the challenges posed by the dataset's unique nature. The dataset is publicly accessible at https://***/drive/folders/1moYDb8tp5K-VDxPJU3sTorfYE7NnwVpf?usp=sharing and the baseline system is available at https://***/ziangshi/BMT_BENCH_baseline_repo.
The uncertainty of time and place is the characteristics of taking place of the emergent incidents in the land or sea. The aeronautical satellite image and video emergent transmission system has the characteristics of...
详细信息
ISBN:
(纸本)9798400709784
The uncertainty of time and place is the characteristics of taking place of the emergent incidents in the land or sea. The aeronautical satellite image and video emergent transmission system has the characteristics of high speed in motion and wider service coverage area in geography. Therefore, the aeronautical satellite image and video emergent transmission system has a great advantage over other communication methods in emergent realtimeimage and video transmission and incident rescuing as well as other remote commanding. To insure the performance of the aeronautical satellite image and video emergent transmission system under the environment of artificial strong jamming and fading, we propose a new design of the aeronautical satellite image and video emergent transmission system based on interference mitigation for artificial strong jamming and channel multiple fading and give the design of such transmission system. The application of the mitigation method based on the adaptive antenna array is expected the very effective to reduce the influence of the artificial strong jamming and fading on the performance of the aeronautical satellite image and video emergent transmission system.
We present a real-time system for vehicle detection and classification in road intersections, incorporating imageprocessing techniques. This system estimates the traffic flow at a specific point, as it is capable of ...
详细信息
ISBN:
(纸本)9781510673199;9781510673182
We present a real-time system for vehicle detection and classification in road intersections, incorporating imageprocessing techniques. This system estimates the traffic flow at a specific point, as it is capable of recognizing the trajectories of different vehicles at an intersection, inferring whether they leave or enter the city. It is designed to be integrated into a high-fidelity digital twin, aiding in estimating environmental traffic pollutants. Since Computational Fluid Dynamics (CFD) use estimators like average or aggregate measurements, we use more accurate methods to estimate pollution. The implications of our study are significant for urban planning and traffic management. It allows for immediate decisions and informs long-term infrastructure planning by providing a deep understanding of intersection dynamics. Our research offers a comprehensive perspective on traffic analysis, introducing data-driven traffic management strategies for efficient urban mobility. The code developed for this purpose can be found in https://***/capo- urjc/TrackingSORT
Versatile video Coding (VVC) provides new coding tools for more efficient intra prediction but with a substantial increase in computational complexity. This paper introduces vectorized kernels for 8-bit angular intra ...
详细信息
ISBN:
(纸本)9798331529543;9798331529550
Versatile video Coding (VVC) provides new coding tools for more efficient intra prediction but with a substantial increase in computational complexity. This paper introduces vectorized kernels for 8-bit angular intra prediction and position dependent intra prediction combination (PDPC), which are carefully optimized for all block sizes and prediction modes of VVC. The proposed kernels streamline the filtering process and utilize optimized memory access patterns. Our standalone tests show that the proposed vectorization achieves speedups of 6.68x for luma and 4.40x for chroma predictions over scalar implementations. Integrating these kernels into the practical uvg266 VVC encoder provides speedups of 1.07x in the slowest configuration and 1.68x in the fastest configuration. The reported speedups are obtained without any coding overhead, so the proposed vectorization plays an integral role in pursuing real-time VVC coding with high coding efficiency.
The quality of image and videos plays a vital role in case of real-time systems. images are captured without sufficient illumination, lead to low dynamic range and high propensity for generating high noise levels. The...
详细信息
暂无评论