检索结果-内蒙古大学图书馆

2024 International conference on image processing

作者： Chen, Bowen Shang, Zaixi Bovik, Alan C. Chung, Jae Won Lerner, David Univ Texas Austin Dept Elect & Comp Engn Austin TX 78712 USA Viasat Carlsbad CA USA

ISBN: (纸本)9798350349405;9798350349399

In the rapidly growing streaming service market, including satellite options, Internet Service Providers (ISPs) face the challenge of continually optimizing network performance to deliver superior video streaming quality, which is vital to optimize customer satisfaction. This pressing need has sparked a drive towards developing advanced Quality of Experience (QoE) prediction models, which are essential in enhancing streaming protocols and guaranteeing smooth viewing experiences for users. However, the efficacy of these models hinges on the availability of extensive, diverse datasets. To fill this critical data void, our study introduces the publicly available LIVE-Viasat real-World Satellite QoE Database, with 179 videos from real-world streaming, encompassing a range of distortions. Enhanced by a study with 54 participants providing detailed QoE feedback, our work not only provides a rich analysis of the determinants of subjective QoE but also delves into how various streaming impairments influence user behavior, thereby offering a more holistic understanding of user satisfaction.

关键词： Quality of Experience subjective video quality assessment objective QoE model

来源：评论

学校读者我要写书评

暂无评论

realtime Scene Enhancement of Low Light image/video 5

Realtime Scene Enhancement of Low Light Image/Video

引用

5th International conference on Intelligent Computing and Human-Computer Interaction, ICHCI 2024

作者： Zeng, Huixia Xiao, Ping School of Electronic and Information Engineering Guangzhou City University of Technology Guangdong Guangzhou510800 China

ISBN: (纸本)9798350368284

The quality of image and videos plays a vital role in case of real-time systems. images are captured without sufficient illumination, lead to low dynamic range and high propensity for generating high noise levels. The processing ideas and networks of several mainstream low-light image enhancement neural network algorithms such as DRBN, DSLR, ZeroDCE, EnlightenGAN, as well as the key technologies used are compared and analyzed. In this paper, the EnglightenGAN neural network is used as the core image enhancement algorithm, and NVIDIA AGX Xavier equipment and video stream processing technology are used to achieve the entire process of video acquisition, video frame encoding, image enhancement processing, frame decoding, and video stream display. It can adapt to most low-light and dark scenes which has a good enhancement effect on video streams collected in low-light scenes. The real-time performance and image enhancement effect of the system achieved good results, with FPS reaching 10, SSIM reaching 0.51753, and PSNR reaching 27.89973. © 2024 IEEE.

关键词： video streaming

来源：评论

学校读者我要写书评

暂无评论

Implementation of the image super-resolution DWT based algorithm on Raspberry Pi platform for real-time applications

Implementation of the image super-resolution DWT based algor...

引用

conference on real-time processing of image, Depth, and video Information

作者： Osorno-Ortiz, Raul J. Ponomaryov, Volodymyr, I Reyes-Reyes, Rogelio Cruz-Ramos, Clara Garcia-Salgado, Beatriz P. Sadovnychiy, Sergiy Inst Politecn Nacl ESIME Culhuacan Ciudad De Mexico Mexico Inst Mexicano Petr Ciudad De Mexico Mexico

ISBN: (纸本)9781510673199;9781510673182

Super resolution (SR) is a technique designed for increasing the spatial resolution in an image from a low resolution (LR) to high resolution (HR) size. SR technology has had a considerable demand in a wide variety of applications to recover HR images, such as medicine, engineering, computer vision, pattern recognition and video production, etc. In contrast to interpolation-based algorithms that often introduce distortions or irregular borders, this study proposes an implementation that can preserve the edges and fine details of an original image through the computation of the wavelet decomposition. Different Discrete Wavelet Transform (DWT) families such as: Daubechies, Symlet, and Coiflet were evaluated. The proposed system was implemented on a Raspberry Pi 4 model B, an embedded device, to get around the PC's mobility limitations, making it possible to create an in-expensive and energy- efficient SR system, reducing their complexity in realtime applications. To investigate the visual performance, SR images have been analysed in subjective matter via human perception view, guaranteeing good perception for the images of different nature from three different datasets such as FullHD (DIV2K), medical (Raabin WBC), and remote sensing (Sentinel- 1). The experimental results of designed implementations appear to demonstrate good performance in commonly used objective criteria: execution time, SSIM, and PSNR (0.742 sec., 0.9164, and 38.72 dB), respectively for images with a super resolution size of 1356 x 2040 pixels.

关键词： Super resolution Discrete Wavelet Transform (DWT) Parallel programing Raspberry Pi real-time processing

来源：评论

学校读者我要写书评

暂无评论

Modified bernoulli map-based scramble and s-box supported colour image encryption

引用

SIGNAL image AND video processing 2025年第1期19卷 1-10页

作者： Etem, Taha Kaya, Turgay Cankiri Karatekin Univ Dept Comp Engn TR-23250 Merkez Turkiye Firat Univ Dept Elect & Elect Engn TR-23250 Merkez Turkiye

The exponential growth of digital image sharing has amplified concerns regarding data privacy and security, especially for colour images of varying sizes and resolutions. Traditional encryption algorithms often fall short in balancing speed, scalability, and robust security for such diverse image datasets. Addressing this gap, we introduce a novel colour image encryption scheme that synergizes modified Bernoulli map-based random number generation for pixel scrambling with an S-Box-supported diffusion process. Our approach first employs a chaotic random number generator to effectively reorder pixel positions, enhancing confusion. This is followed by a diffusion phase utilizing a robust Khan S-Box to introduce nonlinearity and further obfuscate pixel values. To evaluate the security and efficiency of our method, we conducted extensive tests including differential cryptanalysis using NPCR (Number of Pixel Change Rate) and UACI (Unified Average Changing Intensity) metrics. The results demonstrate that our encryption system exhibits high resistance to differential attacks and achieves superior performance compared to existing methods. By combining fast random number generation with strong S-Box diffusion, our scheme offers a scalable and secure solution for real-time colour image encryption, contributing significant advancements to the field of cryptographic image processing.

关键词： image Encryption Scrambling S-Box Bernoulli Map Random Number Generator

来源：评论

学校读者我要写书评

暂无评论

A real-time algorithm for human action recognition in RGB and thermal video

A real-time algorithm for human action recognition in RGB an...

引用

conference on real-time image processing and Deep Learning

作者： Fassold, Hannes Gutjahr, Karlheinz Weber, Anna Perko, Roland JOANNEUM RES DIGITAL Steyrergasse 17 A-8010 Graz Austria

ISBN: (数字)9781510661714

ISBN: (纸本)9781510661707;9781510661714

Monitoring the movement and actions of humans in video in real-time is an important task. We present a deep learning based algorithm for human action recognition for both RGB and thermal cameras. It is able to detect and track humans and recognize four basic actions (standing, walking, running, lying) in real-time on a notebook with a NVIDIA GPU. For this, it combines state of the art components for object detection (Scaled-YoloV4), optical flow (RAFT) and pose estimation (EvoSkeleton). Qualitative experiments on a set of tunnel videos show that the proposed algorithm works robustly for both RGB and thermal video.

关键词： Human action recognition object detection pose estimation thermal video

来源：评论

学校读者我要写书评

暂无评论

Design and Analysis of Imaging Chip using High-Speed AXI-Interface for MPSOC Applications on FPGA Platform

引用

WIRELESS PERSONAL COMMUNICATIONS 2024年第1期135卷 163-182页

作者： Archana, H. R. Reddy, C. R. Byra BMSCE Dept ECE Bangalore Karnataka India BIT Dept ECE Bangalore Karnataka India

The recent innovations in real-time video and image enhancements are allowing much advancement in a wide range of diverse applications. These innovations and advancements provide a new hardware architecture that aims to improve image visualization, processing speed, and complexity reduction in hardware. The imaging chip concept is introduced in this article to support the Multiprocessing system-on-chip (MPSoC) applications in real-time scenarios on a single chip. The imaging chip model is designed using high-speed interface protocol, which includes different image enhancement algorithms that act as a master model, Advanced Extensible Interface (AXI)-4 as an interface model, and dual-port Memory as a slave model. The image enhancement algorithm includes Brightness control, contrast stretching, Adaptive Median Filtering (AMF), Edge-detection techniques, image Thresholding, and image Histogram. The AXI-4 provides a high-speed interface for communicating master and slave modules. The proposed model works based on the modes of operation to process the enhanced image output in MPSoC. The design supports multiple masters and multiple slave modules with a reconfigurable nature. The imaging chip is a module on the Xilinx ISE environment and implemented on Artix-7 Field-Programmable Gate Array (FPGA), along with the performance metrics like chip Area, time, power, and memory utilization are analyzed with improvements. The model offers low latency and high throughput architecture for real-time Multimedia applications.

关键词： image Enhancement AXI Protocol System on Chip (SoC) MPSoC FPGA Adaptive median filter Memory Interface protocol

来源：评论

学校读者我要写书评

暂无评论

Local feature-based video captioning with multiple classifier and CARU-attention

引用

IET image processing 2024年第9期18卷 2304-2317页

作者： Im, Sio-Kei Chan, Ka-Hou Macao Polytech Univ Fac Appl Sci Macau Peoples R China Macao Polytech Univ Engn Res Ctr Appl Technol Machine Translat & Artif Macau Peoples R China

video captioning aims to identify multiple objects and their behaviours in a video event and generate captions for the current scene. This task aims to generate a detailed description of the current video in real-time using natural language, which requires deep learning to analyze and determine the relationships between interesting objects in the frame sequence. In practice, existing methods typically involve detecting objects in the frame sequence and then generating captions based on features extracted through object coverage locations. Therefore, the results of caption generation are highly dependent on the performance of object detection and identification. This work proposes an advanced video captioning approach that works in adaptively and effectively addresses the interdependence between event proposals and captions. Additionally, an attention-based multimodel framework is introduced to capture the main context from the frame and sound in the video scene. Also, an intermediate model is presented to collect the hidden states captured from the input sequence, which performs to extract the main features and implicitly produce multiple event proposals. For caption prediction, the proposed method employs the CARU layer with attention consideration as the primary RNN layer for decoding. Experimental results showed that the proposed work achieves improvements compared to the baseline method and also better performance compared to other state-of-the-art models on the ActivityNet dataset, presenting competitive results in the tasks of video captioning. An advanced video captioning approach is proposed that works in adaptively and effectively addresses the interdependence between event proposals and captions. Additionally, an attention-based multimodel framework is introduced to capture the main context from the frame and sound in the video scene. image

关键词： convolutional neural nets feature extraction pattern classification recurrent neural nets video signal processing

来源：评论

学校读者我要写书评

暂无评论

Advanced video encryption using the opposition lotus effect-elliptic curve cryptography in signal processing applications

引用

SIGNAL image AND video processing 2025年第5期19卷 1-12页

作者： Alsowail, Rakan A. King Saud Univ Comp Skills Selfdev Skills Dept Deanship Common Year 1 Riyadh 11362 Saudi Arabia

The rapid advancement of network technologies and multimedia applications across various sectors, including military and industry, underscores the importance of safeguarding digital data, especially videos and images. Encryption of video is essential in making sensitive content into a completely unrecognizable form. However, traditional methods of encryption face severe difficulties in terms of resolution loss, high computational complexity, and low optimization, which makes the technique less practical and authentic. Addressing such issues, this paper describes the Opposition Lotus Effect-Elliptic Curve Cryptography (OLE-ECC) algorithm. The integration of elliptic curve cryptography with the Opposition Lotus Effect Algorithm enhances video encryption by generating secure key pairs resistant to cryptographic attacks. The video data is encrypted using generated keys by dividing the video stream into segments and applying encryption algorithms to each. This process involves four key phases such as channel segmentation, channel scrambling, generation of key streams from the logistic graph, and channel propagation. The use of multi-equation multi-key cryptography increases the safety of video data significantly while encrypted because it involves applying a number of mathematical equations along with multiple keys. This technique effectively manages the encryption of dynamically generated video files and allows the possibility of encrypting different segments of the video with different keys, making it efficient in performance for real-time streaming applications. For video decryption, the encrypted video is sent over the communication channel, the decryption process includes bit-wise exclusion, rearrangement of channel blocks, and pixel reordering, which increases the reliability of the recovered video frames. Furthermore, the program for revealing, combined with XOR operations, allows the recovery of hidden pixels without loss of quality. Message Authorization Codes a

关键词： Elliptic curve cryptography Lotus effective algorithm Multi-equation multi-key cryptography Opposition-based learning strategy video encryption

来源：评论

学校读者我要写书评

暂无评论

Estimation of video sequence homography based on a first-order estimation method

引用

PATTERN RECOGNITION 2025年 158卷

作者： Zheng, Qixuan Li, Muyu Yan, Hong City Univ Hong Kong Dept Elect Engn Hong Kong 999077 Peoples R China City Univ Hong Kong Ctr Intelligent Multidimens Data Anal Hong Kong 999077 Peoples R China Dalian Univ Technol Sch Control Sci & Engn Dalian 116081 Peoples R China

As it is a pre-processing task, estimation of video-sequence-based homography requires low computational costs and fast evaluation. However, current algorithms for video sequence tasks are commonly based on image-pair homography and do not consider the inner properties of the video sequences. Therefore they take unnecessary computational resources. In this work, we propose a novel algorithm with a first-order estimation method to fill the gap between estimation of image-pairs and video sequence homography. By considering the continuous movement of the camera, the proposed algorithm adopts a first-order estimation to accelerate the estimation process while maintaining its robustness. Instead of extracting many image features from every frame, we demonstrate that estimating a homography matrix with pixel-based texture patterns is effective and sufficient for video sequences. Experiments show that homography estimation with simple one-dimensional texture vectors, as used in our algorithm, can surpass state-of-the-art feature-based algorithms and deep-learning-based methods. This first-order estimation method was more than 40 times faster and real-time estimation used only the CPU.

关键词： Homography estimation First-order estimation method video analysis Camera motion compensation video stabilization

来源：评论

学校读者我要写书评

暂无评论

FPGA-Based Multi-Channel image Acquisition and processing System 2

FPGA-Based Multi-Channel Image Acquisition and Processing Sy...

引用

2nd International conference on Signal processing and Intelligent Computing, SPIC 2024

作者： Chu, Xiaohui Jiang, Qiang Zhang, Zhichao Chen, Tong Wang, Xin Yuan, Yi Shenyang Ligong University School of Automation and Electrical Engineering Shenyang110159 China Shenyang Ligong University Shenyang110159 China

ISBN: (纸本)9798350368888

The deep integration of new-generation information technology and manufacturing is triggering far-reaching industrial changes. Machine vision inspection is widely used in large-scale repetitive industrial production processes. In order to solve the problem of the limited field of view of the machine vision inspection system and the obstruction of the camera view during the inspection process, this article takes the glue dispenser vision inspection system as the project background, and designs a high-performance and low-cost FPGA-based multi-channel image acquisition and transmission system scheme. The system can realize parallel acquisition, decoding, storage, display, and transmission of data from 4 image sensors. The OV5640 is used as the video sensor for video capture, and video data is cached using DDR3 SDRAM and transmitted to the host computer through Gigabit Ethernet communication technology. This thesis utilizes the advantages of FPGA in the field of image processing to perform real-time pipeline operations on video data, solving the limitation of video capture by a single camera and improving the system's real-time performance. Implementing multi-channel image grayscale conversion, filtering, edge detection, and stitching fusion in FPGA. © 2024 IEEE.

关键词： Inspection

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：