Infrared imaging technology is widely used in military and civilian fields, but in practical applications, accurate and effective detection and tracking of infrared small targets is a bottleneck problem that needs to ...
详细信息
The non-contact heart rate detection system avoids direct contact between the sensor and the skin, improving portability, comfort and real-time heart rate monitoring. This paper presents an embedded-based non-contact ...
详细信息
In geological exploration, the texture and context information generated by different rocks on the inner wall of the drilling hole is of great significance to understand the geological condition. Drilling camera techn...
详细信息
Holography creates three-dimensional (3D) images with depth and relief, and can be applied to medicine, engineering, and entertainment, enabling holographic communication, live streaming, and virtual gatherings. Digit...
详细信息
ISBN:
(纸本)9798350354744;9798350354737
Holography creates three-dimensional (3D) images with depth and relief, and can be applied to medicine, engineering, and entertainment, enabling holographic communication, live streaming, and virtual gatherings. Digital holography involves using digital displays, cameras, and beams. This paper discusses various holographic systems, including computational holography, which uses computers to produce 3D images, Deep Holography (DH), which utilizes deep neural networks (DNN), and tensor holography, which combines DNN and machine learning (ML) to produce 3D images in mid-air. This paper also addresses network challenges, such as the high data transmission, bandwidth limitations that can hinder the quality of the holographic image, and the low latency crucial for maintaining real-time interaction in holographic communication. While some scholars introduce the capacities of 6G networks, other authors propose a compression and differentiated prioritization technique. Holography can create striking images using convolutional neural networks (CNN) and an anti-aliasing double-phase (AA-DPM) method. Based on the communication performance test that has been carried out, it can be concluded that ensuring access to powerful and optimized hardware and software is crucial for both text and holographic image generation. As the complexity and length of the text increase, processingtime augments as well. There are significant differences between using powerful servers and weaker hardware. Finally, it can be concluded that it is imperative to prioritize hardware and software for image generation to facilitate a smooth conversation.
In the realm of videoprocessing and analysis, accurate prediction of future frames is crucial in applications like video compression, anomaly detection and augmented reality. This paper introduces a novel approach th...
详细信息
Robotic surgery requires endoscope 3D tracking to navigate the endoscope in the body. This paper proposes an accurate multiscale selective fusion framework to register 2D endoscopic videoimages to 3D pre-operative CT...
详细信息
ISBN:
(纸本)9781665405409
Robotic surgery requires endoscope 3D tracking to navigate the endoscope in the body. This paper proposes an accurate multiscale selective fusion framework to register 2D endoscopic videoimages to 3D pre-operative CT data for endoscope 3D tracking. Current video-based 3D tracking depends on the performance of the 2D-3D fusion procedure that suffers from inaccurate similarity and image uncertainties. To boost video-based 3D tracking, we develop multiscale selective similarity characterization to enhance the 2D-3D fusion procedure. Such fusion not only uses image pyramids in multiple scales to represent endoscopic images but also selects specific structure information from these multiscale images to compute the similarity. We validated our method on clinical data. Our method can reduce the current tracking error from 8.9 to 5.4 mm without using any external trackers, while it provides surgeons with robust real-time surgical 3D tracking.
Aiming at the problem that pointer instrument detection algorithm has slow locating speed and low realtime performance in edge equipment, this paper proposes a pointer instrument video detection method based on impro...
详细信息
The proceedings contain 106 papers. The topics discussed include: macro-AUC-driven active learning strategy for multi-label classification enhancement;mitigating privacy threats without degrading visual quality of VR ...
ISBN:
(纸本)9798350351422
The proceedings contain 106 papers. The topics discussed include: macro-AUC-driven active learning strategy for multi-label classification enhancement;mitigating privacy threats without degrading visual quality of VR applications: using re-identification attack as a case study;attenuation-aware weighted optical flow with medium transmission map for learning-based visual odometry in underwater terrain;GeoVQA: a comprehensive multimodal geometry dataset for secondary education;pulse of the crowd: quantifying crowd energy through audio and video analysis;automated recognition of optic disc and blood vessels in diabetic fundoscopy images using real-timeimage analysis;GeoSecure-B: a method for secure bearing calculation;and exploiting correlation between facial action units for detecting deepfake videos.
Conventional vehicle counting using various techniques such as manual counts are no longer efficient in the era of industrial revolution 4.0. The algorithm within the intelligence system using a realtimevideo and im...
详细信息
ISBN:
(纸本)9781510691704
Conventional vehicle counting using various techniques such as manual counts are no longer efficient in the era of industrial revolution 4.0. The algorithm within the intelligence system using a realtimevideo and imageprocessing technique is proposed due to its reliability, efficiency, cost effectiveness and safety for gathering data. Surveillance cameras commonly installed in large cities could be used to obtain traffic data recording, allowing for an automated system to be easily adopted at minimal cost. This study provides an alternative and economical means to estimate traffic density via video-imageprocessing which adopts OpenCV in the Python code. This method only requires a fixed video camera be positioned at an elevated position such as on a pedestrian bridge or a light pole. The images are processed automatically through OpenCV code bindings in Python. The system requires frames from the video to be captured so background subtraction can be performed to detect and count the vehicles using Gaussian Mixture Model. The classification of vehicles by size is done by comparing the contour areas to the assumed values. The proposed algorithm can be adapted to meet the requirements of the user and the camera’s position. The algorithm allows traffic data to be obtained, which may assist local authorities make decisions regarding urban planning and the design of transportation systems. Sample videos of traffic scenes were used to compare the detection and classification of vehicles. Results from the proposed algorithm were compared with manual count results from the field. Analysis of the classification and volume count of vehicles using the proposed algorithm is shown to have an error rate of 1.3% compared to an error rate of 6.4% using the manual tally counter method. The results confirmed that the proposed automatic counting system performed better when compared to the manual tally counter method with the additional benefits of increase cost efficiency and impr
The proceedings contain 114 papers. The topics discussed include: application of artificial neural networks for processing some biomedical data;distinguishing between AI images and realimages with hybrid image classi...
ISBN:
(纸本)9798350387568
The proceedings contain 114 papers. The topics discussed include: application of artificial neural networks for processing some biomedical data;distinguishing between AI images and realimages with hybrid image classification methods;securing Durres Port's digital transformation: cybersecurity strategy for maritime industry;linguistic encryption for underwater communication;a toolset for blood pressure visualization and measurement in time, frequency and time-frequency domains;using a shape from polarization to determine the 3D surface of objects with thermal radiation;on the influence of cell libraries and other parameters to SCA resistance of crypto IP cores;integration of PXROS-HR with micro-ROS in robotic systems;and traffic-aware video streaming topology reconfiguration for smart city applications.
暂无评论