检索结果-内蒙古大学图书馆

IEEE/CVF conference on Computer Vision and Pattern Recognition (CVPR)

作者： Xu, Gangwei Wang, Yujin Gu, Jinwei Xue, Tianfan Yang, Xin Huazhong Univ Sci & Technol Sch EIC Wuhan Peoples R China Shanghai AI Lab Shanghai Peoples R China Chinese Univ Hong Kong Hong Kong Peoples R China

ISBN: (纸本)9798350353006

Reconstructing High Dynamic Range (HDR) video from image sequences captured with alternating exposures is challenging, especially in the presence of large camera or object motion. Existing methods typically align low dynamic range sequences using optical flow or attention mechanism for deghosting. However, they often struggle to handle large complex motions and are computationally expensive. To address these challenges, we propose a robust and efficient flow estimator tailored for real-time HDR video reconstruction, named HDRFlow. HDRFlow has three novel designs: an HDR-domain alignment loss (HALoss), an efficient flow network with a multi-size large kernel (MLK), and a new HDR flow training scheme. The HALoss supervises our flow network to learn an HDR-oriented flow for accurate alignment in saturated and dark regions. The MLK can effectively model large motions at a negligible cost. In addition, we incorporate synthetic data, Sintel, into our training dataset, utilizing both its provided forward flow and backward flow generated by us to supervise our flow network, enhancing our performance in large motion regions. Extensive experiments demonstrate that our HDRFlow outperforms previous methods on standard benchmarks. To the best of our knowledge, HDRFlow is the first real-time HDR video reconstruction method for video sequences captured with alternating exposures, capable of processing 720p resolution inputs at 25ms. Project website: https://***/HDRFlow/.

关键词： HDR video Optical Flow

来源：评论

学校读者我要写书评

暂无评论

6th IEEE International conference on image processing, Applications and Systems, IPAS 2025 - Proceedings

6th IEEE International Conference on Image Processing, Appli...

引用

6th IEEE International conference on image processing, Applications and Systems, IPAS 2025

ISBN: (纸本)9798331506520

The proceedings contain 86 papers. The topics discussed include: robust real-time monitoring of complex human activities using multi modal video analytics;a robust approach for classifying laparoscopic video distortions using ResNet-50;enhancing x-ray image classification through neural architecture;revolutionary MRI imaging for Alzheimer’s: cutting-edge GANs and vision transformer solutions;advanced deep learning strategies for breast cancer image analysis;identifying surgical instruments in pedagogical cataract surgery videos through an optimized aggregation network;enhancing auxiliary cancer classification task for multi-task breast ultrasound diagnosis network;and bioinspired computer vision for effective extended reality applications.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Small UAV Urban Overhead Transmission Line Autonomous Correction Inspection System Based on Radar and RGB Camera

引用

IEEE SENSORS JOURNAL 2024年第5期24卷 5593-5608页

作者： Li, Ziran Wu, Hao Wang, Qi Wang, Wei Suzuki, Satoshi Namiki, Akio Chiba Univ Dept Mech Engn Chiba 2638522 Japan Nanjing Univ Informat Sci & Technol Jiangsu Collaborat Innovat Ctr Atmospher Environm Nanjing 210044 Peoples R China

As the scale of the power grid continues to expand, drone inspection operations are becoming increasingly popular. However, most of the existing inspection drones are for transmission line inspection in the open field environment, with the characteristics of large size and high quality, which is difficult to be directly applied to transmission line inspection around the city area. To address the above issues, in this article, a small unmanned aerial vehicle (UAV) inspection system is designed with the aim of achieving autonomous inspection of overhead ground wires in urban peripheral areas, combined with image processing technology, with a total weight of less than 400 g. Specifically, during the inspection of small UAV, Raspberry Pi uses traditional image processing methods, such as Hough transform, to obtain the position information and pixel error of ground wire from real-time video stream;the flight control system uses the results of image processing combined with data from millimeter-wave radar to achieve conversion from pixel error to actual distance error;finally, the ground wire is made to be in the center of the video as much as possible through the correction strategy, thus realizing the autonomous inspection task of the small UAV along the line. The experimental results show that the small UAV can stably identify the target transmission lines and achieve autonomous flight along the lines with horizontal deviation within plus or minus 0.3 m and height deviation within plus or minus 0.1 m, which is of great reference value for the application of small UAV in urban transmission line inspection.

关键词： Autonomous correction system control system error conversion image processing small unmannedaerial vehicle (UAV)

来源：评论

学校读者我要写书评

暂无评论

SketchAnimator: Animate Sketch via Motion Customization of Text-to-video Diffusion Models

SketchAnimator: Animate Sketch via Motion Customization of T...

引用

2024 conference on Visual Communications and image processing

作者： Yang, Ruolin Li, Da Zhang, Honggang Song, Yi-Zhe Beijing Univ Posts & Telecommun Sch Artificial Intelligence PRIS Beijing Peoples R China Univ Surrey SketchX CVSSP Guildford Surrey England

ISBN: (纸本)9798331529543;9798331529550

Sketching is a uniquely human tool for expressing ideas and creativity. The animation of sketches infuses life into these static drawings, opening a new dimension for designers. Animating sketches is a time-consuming process that demands professional skills and extensive experience, often proving daunting for amateurs. In this paper, we propose a novel sketch animation model SketchAnimator, which enables adding creative motion to a given sketch, like "a jumping car". Namely, given an input sketch and a reference video, we divide the sketch animation into three stages: Appearance Learning, Motion Learning and video Prior Distillation. In stages 1 and 2, we utilize LoRA to integrate sketch appearance information and motion dynamics from the reference video into the pre-trained T2V model. In the third stage, we utilize Score Distillation Sampling (SDS) to update the parameters of the Bezier curves in each sketch frame according to the acquired motion information. Consequently, our model produces a sketch video that not only retains the original appearance of the sketch but also mirrors the dynamic movements of the reference video. We compare our method with alternative approaches and demonstrate that it generates the desired sketch video under the challenge of one-shot motion customization.

关键词： Sketch animation diffusion process generative model video generation motion extraction

来源：评论

学校读者我要写书评

暂无评论

Client-Server Application for real-time video Streaming 47

Client-Server Application for Real-Time Video Streaming

引用

47th International conference on Telecommunications and Signal processing, TSP 2024

作者： Dobrea, Silviu Nicolae Petrisor, Daniel 'Gheorghe Asachi' Technical University of Iasi Faculty of Electrical Engineering Iasi Romania

ISBN: (纸本)9798350365597

This paper presents the design and implementation of a client-server application for real-time video data transfer between two devices. A large number of real-time video data transfer techniques already exist. Movie and media platforms, video conferencing and webinars, video surveillance and even video calling use real-time video data transfer techniques. How-ever, most applications in the fields mentioned above use proprietary solutions, offering limited or no access to the techniques. This paper details the work to design, implement and evaluate a client-server system that can be used to further develop advanced features and applications that need to transfer video data in real-time. Our performance tests indicate that the created system is able to transfer video data with at least 24 frames per second on two target platforms. Performance evaluations shows dependencies on compute, network bandwidth and storage as influenced by the target hardware devices. Therefore, this paper describes a system that can be used as framework in the development of real-time video data transfer applications11Source code available at: https://***/silviu-nicolae-dobrea/Client-Server-Application-for-real-time-video-Streaming. © 2024 IEEE.

关键词： Data transfer

来源：评论

学校读者我要写书评

暂无评论

Improving image encoding quality with a low-complexity DCT approximation using 14 additions

引用

JOURNAL OF real-time image processing 2023年第3期20卷 58页

作者： Mefoued, Abdelkader Kouadria, Nasreddine Harize, Saliha Doghmane, Noureddine Badji Mokhtar Annaba Univ Fac Technol Elect Dept Lab Automat & Signals Annaba LASA Annaba 23000 Angola

The quality of images is crucial in image and video compression, especially for resource-constrained systems that prioritize simplicity. To achieve fast and low-energy compression, such systems aim to strike a balance between image quality and computational complexity. While various Discrete Cosine Transform (DCT) approximations have been proposed, only two approximations with 14 additions are currently available. This paper presents a novel 8-point DCT approximation that improves image quality compared to the previous 14-addition transformations. Additionally, a pruned version is derived and shown to be efficient. The proposed approximation achieves an average quality gain of up to 1 dB while maintaining a similar computational structure to the previous transformations, resulting in comparable energy consumption. Therefore, this solution provides a compelling option for resource-constrained systems seeking efficient image compression while preserving high image quality.

关键词： DCT approximation Low complexity algorithm Low power consumption image compression

来源：评论

学校读者我要写书评

暂无评论

Bidirectional Temporal Fusion video Denoising Based on W-Net 7

Bidirectional Temporal Fusion Video Denoising Based on W-Net

引用

7th International conference on video and image processing, ICVIP 2023

作者： Li, Derui Zhang, Haikun Hu, Yueli Shanghai University School of Mechatronic Engineering and Automation Department of Electrical Engineering Shanghai China

ISBN: (纸本)9798400709388

The paper provided a brief analysis of video denoising characteristics, discussed and analyzed various existing video denoising methods, and proposed a new video denoising algorithm based on bidirectional time fusion and the W-Net architecture, designed to meet the requirements of real-time video denoising. This algorithm effectively combines past and future information, increases the temporal receptive field, and reduces memory usage. Additionally, by selecting a deeper W-Net backbone network, the algorithm achieves high-fidelity real-time video denoising. Comparative analysis with other video denoising models demonstrated that this approach outperforms others in terms of fidelity. © 2023 Copyright held by the owner/author(s).

关键词： video signal processing

来源：评论

学校读者我要写书评

暂无评论

The Biomechanical Analysis on the Tennis Batting Angle Selection Under Deep Learning

引用

IEEE ACCESS 2023年 11卷 97758-97768页

作者： Li, Jian Zhang, Xiaolong Yang, Guobing Xian Int Studies Univ Sports Dept Xian 710000 Shaanxi Peoples R China Xizang Univ Natl Inst Phys Educ Xianyang 712082 Shaanxi Peoples R China

The objective is to study the impacts of batting strength and angle of tennis players on batting results based on deep learning (DL) image processing technology. A real-time evaluation algorithm of human motion is constructed based on the camera video image and convolution neural network (CNN), and the selection of joint angles in volley training of tennis players is analyzed from the perspective of biomechanics. Gaussian Mixture Model (GMM), Visual Background Extractor (VIBE), and Optical Flow (OF) are introduced for simulation and comparison. Then, the proposed algorithm is applied to the volley experiments in areas A, B, and C of 6 tennis players (denoted by P1, P2, P3, P4, P5, and P6). The results show that the processing frame rate and batting and follow-up similarity score of the proposed algorithm based on the camera video image and CNN are significantly higher than those of GMM, VIBE, and OF. The return success rates of P1 in different areas are the highest, which are 75.46%, 75.62%, and 68.94%, respectively;while those of P6 are the lowest (19.55%, 17.46%, and 21.65%, respectively). The left ankle angle of P6 is much greater than that of P1, the angle of P1 is significantly lower than that of P3, P4, P5, and P6. The batting speed of P1 is significantly slower than that of P3, P4, P5, and P6, which is not much different from that of the left knee joint. The angles of the subjects' right forearm ring, left lower leg ring, and left thigh ring is obvious. Additionally, the displacement of the left foot of P1 and P6 in area A is 0.916m and 0.548m, respectively. Therefore, in the volley preparation stage, the left ankle angle (103-108 & DEG;) is greater than that of the right ankle (98-103 & DEG;);the tennis batting speed should be basically the same as that of the left knee joint to lower the gravity center of player. Thus, the proposed algorithm outperforms other algorithms in the volley experiment of tennis players.

关键词： Sports Training real-time systems Cameras Biomechanics Convolutional neural networks Games video recording Human activity recognition monocular camera video image real-time evaluation algorithm of human motion volley experiment of tennis biomechanics

来源：评论

学校读者我要写书评

暂无评论

real-time video Denoising Acceleration Using Pixel Shuffle and FP16 16th

Real-Time Video Denoising Acceleration Using Pixel Shuffle a...

引用

16th International conference on Genetic and Evolutionary Computing, ICGEC 2024

作者： Masuko, Riku Sugiura, Yosuke Shimamura, Tetsuya Graduate School of Science and Engineering Saitama University Shimo-Okubo 255 Sakura-ku Saitama-Shi Saitama338-8570 Japan

ISBN: (纸本)9789819615346

When capturing videos with cameras, noise can occur due to variations in lighting conditions, movements of subjects or cameras, and the quality of camera sensors. The presence of noise complicates object detection and tracking. To mitigate these issues, video denoising techniques have been developed. Numerous video denoising techniques have been proposed, with convolutional neural networks (CNNs) being predominant in recent years. While CNN-based methods achieve high accuracy in video denoising, they often lack causality because they utilize temporal information from the next frame as well as the current frame, which poses challenges for real-time processing without degrading image quality. Therefore, in this paper, we propose a new architecture that enhances processing speed by improving the Efficient Multi-stage video Denoising (EMVD), which is of the state-of-the-art video denoising methods. Through experiments, it was demonstrated that the proposed method reduced the computation time by approximately 75% while limiting the accuracy degradation to 0.7% compared to conventional methods. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Convolution

来源：评论

学校读者我要写书评

暂无评论

Animation Style Background Production Based on GAN 6

Animation Style Background Production Based on GAN

引用

6th International conference on image, video processing, and Artificial Intelligence, IVPAI 2024

作者： Wang, Zijian Zhang, Shanjun Department of computer science Kanagawa University 3-27-1 Rokkakubashi Kanagawa-ku Kanagawa Yokohama-shi Japan

ISBN: (纸本)9781510681781

processing images using object detection, image restoration, and generative adversarial networks to directly convert real-world images into high-quality anime-style background images is one of today's research hotspots in computer vision. Input real-world images, object detection using the cutting-edge target detection algorithm DETR and generation of masks for the detected objects. The image restoration algorithm LaMa is then used to erase areas of the image with masked portions, generating a real-world background image. Finally, AnimeGAN generative adversarial network is used to convert the real world background image into anime style background image. Aiming at the current popular AnimeGAN's problems such as color distortion in image migration, a new AnimeGAN-SE is proposed by introducing SE-Residual Block (Squeeze Excitation Residual Block) to solve the problem of low color of the migrated image of AnimeGAN. The experimental results show that the network works well for animated pictures. © 2024 SPIE.

关键词： image reconstruction

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：