In this study, we propose a parallel processing method for analyzing video-image radiation-response signals and suppressing radiation noise. We studied the linear-representation law of various image-information compon...
详细信息
In this study, we propose a parallel processing method for analyzing video-image radiation-response signals and suppressing radiation noise. We studied the linear-representation law of various image-information components on the radiation dose rate. Subsequently, the simu-lation images were used to examine the response-signal extract and radiation-noise suppression. The results indicate that the majority of response signals in the global image comprise forward superposition. The peak signal-to-noise ratio of the red channel was significantly improved when the noise signal-substitution algorithm and median filter were applied successively. real-time radiation dose-rate measurements and clear images under irradiation can be obtained simultane-ously.
This paper introduces a novel approach for assessing piano performance through video analysis using Dynamic time Warping (DTW). Traditional methods of evaluating piano playing often rely on auditory cues or sheet musi...
详细信息
With the ageing of the social population, in order to avoid elderly people living alone at home because of accidental falls and not timely treatment, this article put forward a computer vision based on the elderly fal...
详细信息
The proceedings contain 31 papers. The topics discussed include: region based image contrast enhancement depending on local dominant color component;a new cosine hyperbolic window function-based FIR filter design for ...
ISBN:
(纸本)9798350367157
The proceedings contain 31 papers. The topics discussed include: region based image contrast enhancement depending on local dominant color component;a new cosine hyperbolic window function-based FIR filter design for audio to spectrogram conversion;primary target detection method based on connected domain filtering in millimeter-wave radar;preprocessing of pure tone audiometry data and design of machine learning models for hearing loss classification;U-Net with dense connections in encoder layers for processing defocused fingerprint images;intelligent fault diagnosis and health monitoring system for on-orbit FPGA critical components;faster than real-time detection of shot boundaries, sampling structure and dynamic keyframes in video;and research on measuring respiratory rate and volume using thermal cameras.
In this paper, a method of detecting flame from video stream is proposed exploiting the characteristics of the disordered movement, rapid deformation and intense colour of the flame. Firstly, the frame difference betw...
详细信息
In this paper, a method of detecting flame from video stream is proposed exploiting the characteristics of the disordered movement, rapid deformation and intense colour of the flame. Firstly, the frame difference between video frame and background frame is calculated to obtain the main part of the moving object, and the difference between frames is calculated frame by frame in time series to obtain the deformation part of the moving object, and then the sum of cumulative difference between frames and the background difference between frames are added to generate a binary image containing the moving object and the deformed part. Secondly, the binary image is morphologically opened, and rectangular segmentation is carried out to obtain multiple suspicious flame regions. Finally, in the light of the intense colour of the flame, the corresponding area is extracted from the original picture by using the segmentation rectangle, and the colour statistics of the area are carried out to further judge whether there is a burning flame in the area. The experimental results show that the algorithm can accurately detect the burning area of flame in the real scene and eliminate the light interference and the movement interference.
In recent years, real-timevideoprocessing has advanced thanks to dramatic improvements in object detection algorithms. This has increased the demand for video object detection by various edge devices. An example is ...
详细信息
The authors have developed a cable-stayed bridge (CSB) cable inspection robot to improve the efficiency of cable inspection of CSB. Six cameras were mounted on the robot, which takes a video of the whole circumference...
详细信息
The authors have developed a cable-stayed bridge (CSB) cable inspection robot to improve the efficiency of cable inspection of CSB. Six cameras were mounted on the robot, which takes a video of the whole circumference of the cable surface as itmoves up and down the cable. The method is characterized by making an image development diagram from the taken video. In this paper, the red LEDs on the green base plate are installed in the CSB cable inspection robot in order to synchronize the cameras. Then, a time synchronization method using imageprocessing is proposed to automatically detect the red LEDs from the taken video. Finally, this paper discusses the results of automatically detecting the detection accuracy of red LEDs lighting from the actual videos of the inclined cables, which demonstrate the robot's ability to improve work efficiency and reduce workplace hazards.
This paper aims to research and implement a real-timevideo target tracking algorithm based on Convolutional Neural Networks (CNN), enhancing the accuracy and robustness of target tracking in complex scenarios. Addres...
详细信息
We introduce MultiDiff, a novel approach for consistent novel view synthesis of scenes from a single RGB image. The task of synthesizing novel views from a single reference image is highly ill-posed by nature, as ther...
详细信息
ISBN:
(纸本)9798350353006
We introduce MultiDiff, a novel approach for consistent novel view synthesis of scenes from a single RGB image. The task of synthesizing novel views from a single reference image is highly ill-posed by nature, as there exist multiple, plausible explanations for unobserved areas. To address this issue, we incorporate strong priors in form of monocular depth predictors and video-diffusion models. Monocular depth enables us to condition our model on warped reference images for the target views, increasing geometric stability. The video-diffusion prior provides a strong proxy for 3D scenes, allowing the model to learn continuous and pixel-accurate correspondences across generated images. In contrast to approaches relying on autoregressive image generation that are prone to drifts and error accumulation, MultiDiff jointly synthesizes a sequence of frames yielding high-quality and multi-view consistent results - even for long-term scene generation with large camera movements, while reducing inference time by an order of magnitude. For additional consistency and image quality improvements, we introduce a novel, structured noise distribution. Our experimental results demonstrate that MultiDiff outperforms state-of-the-art methods on the challenging, real-world datasets realEstate10K and ScanNet. Finally, our model naturally supports multi-view consistent editing without the need for further tuning.
Mixed reality technology (MR) combines the advantages of virtual reality (VR) and augmented reality (MR), by introducing the real scene information in the virtual environment. The real space is first scanned and recog...
详细信息
暂无评论