检索结果-内蒙古大学图书馆

2024 International conference on image processing

作者： Hirose, Shota Kotoyori, Kazuki Arunruangsirilert, Kasidis Lin, Fangzheng Sun, Heming Katto, Jiro Waseda Univ Sch Fundamental Sci & Engn Tokyo Japan Tokyo Inst Technol Sch Engn Tokyo Japan Yokohama Natl Univ Yokohama Kanagawa Japan

ISBN: (纸本)9798350349405;9798350349399

Transmission latency significantly affects users' quality of experience in real-time interaction and actuation. As latency is principally inevitable, video prediction can be utilized to mitigate the latency and ultimately enable zero-latency transmission. However, most of the existing video prediction methods are computationally expensive and impractical for real-time applications. In this work, we therefore propose real-time video prediction towards the zero-latency interaction over networks, called IFRVP (Intermediate Feature Refinement video Prediction). Firstly, we propose three training methods for video prediction that extend frame interpolation models, where we utilize a simple convolution-only frame interpolation network based on IFRNet. Secondly, we introduce ELAN-based residual blocks into the prediction models to improve both inference speed and accuracy. Our evaluations show that our proposed models perform efficiently and achieve the best trade-off between prediction accuracy and computational speed among the existing video prediction methods. A demonstration movie is also provided at http://***/IFRVPDemo.

关键词： video Prediction Frame Interpolation Remote Operation Lightweight Model Efficient Layer Aggregation Network (ELAN)

来源：评论

学校读者我要写书评

暂无评论

A video Streaming Encryption Method and Experimental System Based on Reconfigurable Quaternary Logic Operators

引用

IEEE ACCESS 2024年 12卷 25034-25051页

作者： Zhou, Xinyu Wang, Hongjian Li, Kuiyan Tang, Lifeng Mo, Ningchun Jin, Yi Donghua Univ Sch Comp Sci & Technol Shanghai 201620 Peoples R China Shanghai Astronaut Elect Co Ltd Shanghai 201821 Peoples R China Shanghai Meiduo Commun Equipment Co Ltd Shanghai 200333 Peoples R China Shanghai Univ Sch Comp Engn & Sci Shanghai 200444 Peoples R China

Multiple-valued logics (MVL) have abundant operation functions which can be used for encryption. A reconfigurable MVL operator can perform all MVL functions with a universal circuit structure at fast operation speed, based on which a one-time-pad cryptosystem is expected to be built. However, we find that when the existing MVL encryption method is applied to video data encryption, the color edges in the plaintext image will remain in the ciphertext image, resulting in partial leakage of information. To solve this problem, we propose byte reorganization and random mask strategies, forming an improved MVL encryption method for video streaming. For verifying the effectiveness of the method, we implement an FPGA-based experimental system to encrypt and decrypt real-time video streaming data. In this system, 16-quit reconfigurable quaternary logic operators are implemented to encrypt, decrypt and derive keys. The process of either encryption or decryption only takes 34 clock cycles. The encryption and decryption modules are capable of processing streaming data at a speed of 6.21 Gbit/s, showing that the system has real-time processing capability. For proving that our method is secure, we compare our improved MVL encryption method with existing image encryption methods in terms of common security evaluation metrics. Experimental results show that our method solves the problem of remained color edge and the ciphertext exhibits good statistical properties.

关键词： Encryption Cryptography Streaming media real-time systems Symbols image edge detection Field programmable gate arrays videos image processing Reconfigurable architectures video and image encryption multiple-valued logic reconfigurable quaternary logic operator field programmable gate array

来源：评论

学校读者我要写书评

暂无评论

Novel parametric based time efficient portable real-time dehazing system

引用

JOURNAL OF real-time image processing 2023年第2期20卷 23页

作者： Ghosh, Avra Ali, Asfak Roy, Sangita Chaudhuri, Sheli Sinha Jadavpur Univ Dept Elect & Telecommun Engn 188 Raja SC Mallick Rd Kolkata 700032 India Narula Inst Technol Elect & Commun Engn North 24 Paraganas Kolkata 700109 India

Research and development on dehazing algorithms have come a long way and the current algorithms work very efficiently in generating clear dehazed images, restoring the images whose contrast gets impaired due to presence of aerosols in the atmosphere. However these algorithms do not work well when applied to dehaze video sequences of hazy scenes because of the time taken to do so, making them unsuitable in real time applications. In this paper, a real-time video dehazing technique has been proposed with a novel haze parameter 'SATVAL' which is the ratio of maximum saturation to maximum value of a RGB image applied on image scattering model using a few video frames processing in a second. A frame with a 'SATVAL' ratio below threshold value is considered to be dehazed or else passed without dehazing. This makes a dehazed video sequence perform accurately in real-time comparable to other contemporary methods. A portable "Raspberry pi model 4B" is used for validation video-on-board or a remote server displaying on a LCD screen. Extensive experimental studies have been carried out to test the effectiveness of the method both at hardware and software levels in comparisons with four existing methods qualitatively and quantitatively. MSE, SSIM, Correlation, PSNR, FPS are the evaluating parameters showing promising output with high quality video in real-time. Finally ten video datasets have been developed for successful implementation of this method.

关键词： real-time video processing Dehazing Haze parameter Raspberry Pi SATVAL dataset

来源：评论

学校读者我要写书评

暂无评论

A method for real-time translation of online video subtitles in sports events

引用

SIGNAL image AND video processing 2025年第1期19卷 1-13页

作者： Zhiliang, Zeng Lei, Wang Qiang, Liu Lanzhou Univ Dept Phys Educ Teaching & Res Lanzhou 730000 Gansu Peoples R China Tangshan Normal Univ Dept Phys Educ Tangshan 063000 Hebei Peoples R China Yancheng Kindergarten Teachers Coll Yancheng 224005 Jiangsu Peoples R China

This study offers a fresh technique for translating subtitles in sports events, addressing the issues of real-time translation with improved accuracy and efficiency. Different from standard methods, which often result in delayed or inaccurate subtitles, the proposed method integrates advanced annotation techniques and machine learning algorithms to increase subtitle recognition and extraction. Annotation techniques in this study include systematically labeling spoken elements like commentary and dialogue, enabling accurate subtitle recognition and real-time adjustments in live sports broadcasts to ensure both accuracy and contextual relevance. These novel ideas allow for seamless adjustments to multiple language types, including the voices of commentators, off-site hosts, and athletes, while maintaining critical information within strict word count limits. Key improvements include faster processing times and increased translation precision, which are crucial for the dynamic environment of live sports broadcasts. The study builds on past studies in audiovisual translation, specifically tailoring its strategy to the unique demands of sports media. By emphasizing the importance of clear and contextually appropriate real-time subtitles, this research presents significant advancements over existing methods, providing valuable insights for future translation projects in sports and similar contexts. The results contribute to a more effective subtitle translation framework, enhancing the accessibility and viewing experience for audiences during live sports events.

关键词： Sports events Online video Subtitle translation real-time

来源：评论

学校读者我要写书评

暂无评论

Semantic-Driven Ghosting-Free image Fusion Using Dual Gain video Stream With FPGA

引用

INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS 2025年第1期21卷

作者： Xu, Yang Xie, Longhua Huang, Hongchuan Yu, Feihong Zhao, Tingyu Zhejiang Sci Tech Univ Optoelect Informat Engn Hangzhou Peoples R China Zhejiang Univ Opt Engn Hangzhou Peoples R China Lib Zhejiang Scitech Univ Hangzhou 310018 Zhejiang Peoples R China

Conventional methods that merge multiple images with different exposure levels often suffer from blur and ghosting due to object movement. Existing ghosting removal algorithms are usually complex and slow, making them unsuitable for real-time video applications. To address this challenge, on an FPGA. IMX662 image sensor is employed, which simultaneously captures both HCG and LCG images with the same exposure time, enabling efficient HDR image synthesis. The proposed method directly addresses the source of the problem, eliminating the need for post-processing steps, thereby preserving algorithmic simplicity. Experimental results reveal that the proposed method not only removes ghosting by 100% but also processes data on an FPGA 98.79% faster than traditional software-based HDR fusion techniques, enabling real-time video stream processing. This dual gain, ghosting-free fusion algorithm demonstrates promising potential for use in high-speed photography and surveillance.

关键词： image Fusion Ghosting-Free High Conversion Gain Low Conversion Gain Field Programmable Gate Array

来源：评论

学校读者我要写书评

暂无评论

Fast Machine Learning Aided Intra Mode Decision for real-time VVC Intra Coding

Fast Machine Learning Aided Intra Mode Decision for Real-Tim...

引用

2024 conference on Visual Communications and image processing

作者： Sainio, Joose Ataman, Baran Marie, Alban Mercat, Alexandre Vanne, Jarno Tampere Univ Ultra Video Grp Tampere Finland

ISBN: (纸本)9798331529543;9798331529550

Reducing the huge computational complexity of intra mode decision is the key to real-time video Coding (VVC). This paper proposes a fast intra mode decision scheme that takes advantage of lightweight machine learning (ML) models to classify intra modes into fifteen clusters. The cluster is further refined using one of the three proposed strategies to select the most optimal mode. Our experimental results with the fastest configuration of the practical uvg266 encoder show that the proposed methods yield a competitive rate-distortion-complexity trade-off over a conventional rough mode decision (RMD). To the best of our knowledge, this is the first work to successfully reduce the complexity of RMD in a practical VVC encoder with the use of ML techniques.

关键词： real-time encoding Versatile video Coding (VVC) intra mode decision machine learning

来源：评论

学校读者我要写书评

暂无评论

real-time statistical image and video processing for remote sensing and surveillance applications

引用

JOURNAL OF real-time image processing 2021年第5期18卷 1435-1439页

作者： Khosravi, Mohammad R. Tavallali, Pooya Persian Gulf Univ Dept Comp Engn Bushehr Iran Univ Calif Merced Dept Elect Engn & Comp Sci EECS Merced CA 95343 USA

[...]in recent years, there has been much research focus on reducing the complexity of Deep Learning models and essentially improving their speed while preserving their accuracy. [...]a very fundamental and hot topic is the application of such models in image and video processing tasks such as remote sensing. [...]by taking inertial measurement error and the motion model’s error with respect to the coordinate, the coordinate variation is corrected. [...]the method is parallelized to achieve further reduction of processing time.

关键词： Share this articleAnyone you share the following link with will be able to read this content:Get shareable linkSorry a shareable link is not currently available for this article.Copy to clipboard Provided by the Springer Nature SharedIt content-sharing initiative

来源：评论

学校读者我要写书评

暂无评论

VSIP 2021 - Proceedings of 2021 3rd International conference on video, Signal and image processing

VSIP 2021 - Proceedings of 2021 3rd International Conference...

引用

3rd International conference on video, Signal and image processing, VSIP 2021

ISBN: (纸本)9781450385886

The proceedings contain 22 papers. The topics discussed include: real time hand gesture recognition in industry;epidemic prevention system based on voice recognition combined with intelligent recognition of mask and helmet;activity recognition in industrial environment using two layers learning;a new method of specific emitter feature extraction based on IQ imbalance;mixup augmentation for deep hashing;multi-resolution Gabor descriptor for corrosion detection in pipeline video sequences;image deep steganography detection based on knowledge distillation in teacher-student network;a multi-scale framework for visual grounding;a comparison of three swarm-based optimization algorithms in wind turbine radar clutter micro-motion parameters estimation;the influence of accounting information system quality and human resource competency on information quality;measurement and analysis of electrophysiological propagation on the cardiac slice-based biosensor;and improve the field-of-view of cameras: consideration on the micro lens array.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Speed-Up DDPM for real-time Underwater image Enhancement

引用

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR video TECHNOLOGY 2024年第5期34卷 3576-3588页

作者： Lu, Siqi Guan, Fengxu Zhang, Hanyu Lai, Haitao Harbin Engn Univ Coll Intelligent Syst Sci & Engn Harbin 150001 Heilongjiang Peoples R China

Underwater images often suffer from serious color bias and blurred features because of the effect of the water bodies on the light. To enhance underwater images, we present SU-DDPM, a method of real-time underwater image enhancement (UIE) based on a denoising diffusion probabilistic model (DDPM). SU-DDPM outperforms other baseline and generative adversarial network models in underwater image enhancement, thus establishing a new state-of-the-art baseline. SU-DDPM processes images more rapidly than the diffusion model, which makes it competitive with other deep learning-based methods. We demonstrate that if conditional DDPM is used directly for the UIE task, the processing speed is slow, and the enhanced images are of poor quality and show color bias. The quality of the enhanced image is improved by combining the degraded image with the reference image in the diffusion stage to create a fusion-DDPM model. The specificity of the UIE task allows us to accelerate the inference process by changing the initial sampling distribution and reducing the number of iterations in the denoising stage of the model. We evaluate SU-DDPM on the UIE task using challenging real underwater image datasets and a synthetic image dataset and compare it to state-of-the-art models. SU-DDPM ensures increased enhancement quality, and enhancement processing speed is comparable to the speed of real-time enhancement models.

关键词： Underwater image enhancement denoising diffusion probabilistic model (DDPM) underwater image restoration deep learning

来源：评论

学校读者我要写书评

暂无评论

Optimization and sensitivity analysis for developing a real-time non-contact physiological parameters measurement and monitoring system using IPPG signal for biomedical applications

引用

SIGNAL image AND video processing 2025年第3期19卷 1-10页

作者： Bhadouria, Vikesh Singh Park, You-rim Eom, Joo Beom Dankook Univ Dept Biomed Sci Sch Med Cheonan si 31116 South Korea

Healthcare monitoring depends on the accuracy of the measured physiological parameters in real-time, given the ongoing increase in the number of patients as compared to the limited medical physicians. Imaging photoplethysmography (IPPG) is one of the emerging non-invasive techniques for the measurement of vital signs, including oxygen saturation (SpO2), heart rate (HR), and respiratory rate (RR). This work explores a comprehensive sensitivity analysis to evaluate the impact of the critical acquisition parameters such as (1) image resize, from 100 to 2%, (2) the region of interest (ROI) within the images, and (3) acquisition duration, from 5 s to 30 s, using image sequences obtained at 30 frames per second. To evaluate and validate the performance of the system, the study consists of several mouse examinations to enhance both precision and consistency in real-time monitoring. The analysis reveals that how image resize influences signal integrity, image resolution, and processing efficiency, which is crucial for resource-limited applications. The ROI selection analysis discovers the key regions to optimize the accuracy of measured vital signs, while the evaluation of acquisition duration provides insights in terms of ensuring the reliable minimum duration for vital signs. These comprehensive analysis advances the current state of the art and addresses the previously overlooked but important factors that offers a robust framework for effective real-time monitoring for research and medical applications.

关键词： Physiological parameters Imaging photoplethysmography SpO2 monitoring Heart rate Respiratory rate Automation image processing Signal processing image acquisition

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：