Deploying style transfer methods on resource-constrained devices is challenging, which limits their real-world applicability. To tackle this issue, we propose using pruning techniques to accelerate various visual styl...
详细信息
ISBN:
(纸本)9798350344868;9798350344851
Deploying style transfer methods on resource-constrained devices is challenging, which limits their real-world applicability. To tackle this issue, we propose using pruning techniques to accelerate various visual style transfer methods. We argue that typical pruning methods may not be well-suited for style transfer methods and present an iterative correlation-based channel pruning (ICCP) strategy for encoder-transform-decoder-based image/video style transfer models. The correlation-based channel regularization preserves the feature distributions for content and style references, and the iterative pruning strategy prevents layer collapse when pruning on the encoder-decoder structure. Experiments demonstrate that the proposed ICCP can generate visual competitive results compared to SOTA style transfer methods and significantly reduces the number of parameters (at least 70K) and inference time. Model is available at https://***/wukx-wukx/ICCP.
We present two deep unfolding neural networks for the simultaneous tasks of background subtraction and foreground detection in video. Unlike conventional neural networks based on deep feature extraction, we incorporat...
详细信息
We present two deep unfolding neural networks for the simultaneous tasks of background subtraction and foreground detection in video. Unlike conventional neural networks based on deep feature extraction, we incorporate domain knowledge models by considering a masked variation of the robust principal component analysis problem (RPCA). With this approach, we separate video clips into low-rank and sparse components, respectively corresponding to the backgrounds and foreground masks indicating the presence of moving objects. Our models, coined ROMAN-S and ROMAN-R, map the iterations of two alternating direction of multipliers methods (ADMM) to trainable convolutional layers, and the proximal operators are mapped to non-linear activation functions with trainable thresholds. This approach leads to lightweight networks with enhanced interpretability that can be trained on limited data. In ROMAN-S, the correlation in time of successive binary masks is controlled with side-information based on l(1)-l(1) minimization. ROMAN-R enhances the foreground detection by learning a dictionary of atoms to represent the moving foreground in a high-dimensional feature space and by using reweighted-l(1)-l(1) minimization. Experiments are conducted on both synthetic and realvideo datasets, for which we also include an analysis of the generalization to unseen clips. Comparisons are made with existing deep unfolding RPCA neural networks, which do not use a mask formulation for the foreground, and with a 3D U-Net baseline. Results show that our proposed models outperform other deep unfolding networks, as well as the untrained optimization algorithms. ROMAN-R, in particular, is competitive with the U-Net baseline for foreground detection, with the additional advantage of providing video backgrounds and requiring substantially fewer training parameters and smaller training sets.
作者:
Wang, LuLiu, YuxiangMeng, FanxuZhang, ZaichenYu, XutaoSoutheast Univ
Sch Informat Sci & Engn 2 Southeast Univ Rd Nanjing 211189 Jiangsu Peoples R China Southeast Univ
State Key Lab Millimeter Waves 2 Southeast Univ Rd Nanjing 211189 Jiangsu Peoples R China Southeast Univ
Frontiers Sci Ctr Mobile Informat Commun & Secur 2 Southeast Univ Rd Nanjing 211189 Jiangsu Peoples R China Nanjing Tech Univ
Coll Artificial Intelligence 30 Puzhu Nan Rd Nanjing 211800 Jiangsu Peoples R China Southeast Univ
Natl Mobile Commun Res Lab 2 Southeast Univ Rd Nanjing 211189 Jiangsu Peoples R China Purple Mt Labs
9 Mozhou Dong Rd Nanjing 211111 Jiangsu Peoples R China
Classical algorithms for moving target segmentation have made significant progress, but the real-time problem has become a significant obstacle for them as the data volume grows. Quantum computing has been proven to b...
详细信息
Classical algorithms for moving target segmentation have made significant progress, but the real-time problem has become a significant obstacle for them as the data volume grows. Quantum computing has been proven to be beneficial for image segmentation, but is still scarce for video. In this paper, a quantum moving target segmentation algorithm based on mean background modeling is proposed, which can utilize the quantum mechanism to do segmentation operations on all pixels in a video at the same time. In addition, a quantum divider with lower quantum cost is designed calculate pixel mean, and then, a number of quantum modules are designed according to the algorithmic steps to build the complete quantum algorithmic circuit. For a video containing 2m\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$2<^>m$$\end{document} frames (every frame is a 2nx2n\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$2<^>n \times 2<^>n$$\end{document} image with q grayscale levels), the proposed algorithm is superior compared to both existing quantum and classical algorithms. Finally, the experiment on IBM Q shows the feasibility of the algorithm in the NISQ era.
In order to improve the visual quality of low-resolution video frames, this study introduces a new superresolution method for real-timevideoprocessing that is powered by artificial intelligence. With little computat...
详细信息
The proceedings contain 39 papers. The topics discussed include: optimization method of loop detection based on shadow compensation;realtime lane detection model based on lightweight;research on image detection algor...
ISBN:
(纸本)9781450389075
The proceedings contain 39 papers. The topics discussed include: optimization method of loop detection based on shadow compensation;realtime lane detection model based on lightweight;research on image detection algorithm based on improved retinanet;a study of student learning status classification based on the detection of key objects within the visual field;an outlier detection method based on symmetry and curvature threshold;research on adaptive object detection method of kernel correlation filtering;attention enhanced multi-patch deformable network for image deblurring;recaptured image forensics based on image illumination and texture features;and using temporal convolutional networks to enable action recognition for construction equipment.
Noise interference during the acquisition of digital images can severely degrade image quality, particularly for images captured under low-light conditions;however, the removal of image noise requires sophisticated di...
详细信息
Noise interference during the acquisition of digital images can severely degrade image quality, particularly for images captured under low-light conditions;however, the removal of image noise requires sophisticated digital imageprocessing systems. This study presents a hardware-based solution to real-timeimage denoising using an existing algorithm designed for the removal of mixed impulse noise (salt-and-pepper and random-valued impulse noise), while preserving image edge details and image borders, without the need for additional computation time or memory capacity. Note that mixed impulse noise is typical of most real-world situations, such as the video noise associated with dashboard cameras. The proposed design was implemented using 180 nm complementary metal-oxide-semiconductor (CMOS) technology, consuming only 21.7 mW when operated at 200 MHz. This operating frequency allows the proposed chip to process noisy video streams with resolution of 1920x1080 at 60 frames per second in realtime. In terms of image restoration, the proposed algorithm achieved image quality on par with that achieved using software simulation. We also demonstrated the efficacy of the proposed scheme in denoising noisy videoimages from a dashboard camera.
With the popularization of visual machine learning, intelligent video analytics can automatically analyze and extract information from video streams, yet it brings heavy computing burdens. Edge computing can improve t...
详细信息
ISBN:
(纸本)9798350310900
With the popularization of visual machine learning, intelligent video analytics can automatically analyze and extract information from video streams, yet it brings heavy computing burdens. Edge computing can improve the processing experience by bringing computing resources near users. On top of this, various processing methods and settings have different resource requirements and output different user experiences. How to dynamically select the videoprocessing configuration according to system states becomes a critical problem that remains to be addressed. In this paper, we propose an edge-assisted video analytic framework based on adaptive sampling and detection-based tracking. We design four functional modules to realize a cooperative computing processing flow. We consider two performance metrics, recognition accuracy and processingtime to estimate the experience of real-timevideo analytics. Further, we design an online configuration method based on Double Deep Q-Network, which can adaptively select analytic configurations under the condition of system dynamics. Experimental results based on a real dataset demonstrate the superior performance of the proposed framework on reward, mean Intersection over Union (IoU), and processingtime.
In the field of sports science, visualization techniques are being used more and more widely, especially in the analysis of athletes' movements and body functions. Traditional motion analysis methods often rely on...
详细信息
In the field of sports science, visualization techniques are being used more and more widely, especially in the analysis of athletes' movements and body functions. Traditional motion analysis methods often rely on video recording and sensor data, but these methods are in the capture of subtle physiological changes, this study developed a thermal radiation image analysis method based on image tracking for visual simulation of aerobics movements. In this paper, a high-resolution thermal camera is used to capture the thermal radiation images of aerobics athletes when they perform specific actions in realtime, and imageprocessing software is used to analyze the acquired thermal radiation images and extract the key temperature data and thermal radiation patterns. Combined with the video data recorded by the high-speed camera, the athletes' movements are accurately tracked to ensure that the thermal radiation data corresponds to the specific movements. Statistical methods were used to analyze the thermal radiation image data to explore the influence of different movements on the body's thermal radiation distribution. According to the analysis results, a visual model is constructed to simulate the thermal radiation change process of athletes performing *** results show that there are significant differences in the distribution of heat radiation in different parts of the body when aerobics athletes perform different movements. When performing high-intensity jumping and rotating movements, increased muscle activity leads to increased local temperature;In static stretching, the muscles relax and the local temperature is relatively low. These changes were clearly observed through visual simulation of thermal radiation images, and it was found that the distribution of thermal radiation was closely related to the intensity of muscle activity and blood circulation.
video Analytics is an imageprocessing method that takes video as input and extracts information. It is the latest technology that analyzes and processes a digital video signal for realtime monitoring. Some of the co...
详细信息
Monocular 3D object detection is a crucial topic in autonomous driving and Intelligent transportation systems (ITS). Most existing methods are evaluated on clean datasets but exhibit arresting performance degradation ...
详细信息
暂无评论