检索结果-内蒙古大学图书馆

Energy Optimization of Distributed video processing System using Genetic Algorithm with Bayesian Attractor Model 9

Energy Optimization of Distributed Video Processing System u...

9th IEEE International conference on Network Softwarization (IEEE NetSoft) - Boosting Future Networks through Advanced Softwarization

作者： Shimonishi, Hideyuki Murata, Masayuki Hasegawa, Go Techasarntikul, Nattaon Osaka Univ Osaka Japan Tohoku Univ Sendai Miyagi Japan

ISBN: (纸本)9798350399806

For the future cyber-physical system (CPS) society, it is necessary to construct digital twins (DTs) of a real world in real time using a lot of cameras and sensors. Hence, the energy efficiency of both networks and computers for largescale distributed video analysis is a major challenge for the full-scale spread of CPSs and DTs. Toward this goal, we first propose a model to arbitrarily split and distribute the video analysis task to terminals, edge servers, and cloud servers and dynamically assign appropriate CNN models to them. System-wide optimization of such distributed processing can reduce overall system power consumption by reducing network bandwidth and efficiently utilizing distributed CPU/GPU resources. To realize this optimization in a real system, we also propose a model to estimate the GPU load, processing time, and power consumption of these devices based on massive experimental measurements. Since such a large-scale optimization is difficult because of the dynamic and multi-objective nature of the problem, we propose a new optimization algorithm composed of Genetic Algorithm and Bayesian Attractor Model. Finally, simulation evaluations are performed to demonstrate that the proposed method can minimize system power consumption and satisfy latency and recognition accuracy requirements of each video analysis, even under changing environmental conditions.

关键词： Digital twin video analysis system optimization energy efficiency genetic algorithm Bayesian Attractor Model

来源：评论

学校读者我要写书评

暂无评论

FC3DNET: A FULLY CONNECTED ENCODER-DECODER FOR EFFICIENT DEMOIREING 31

FC3DNET: A FULLY CONNECTED ENCODER-DECODER FOR EFFICIENT DEM...

引用

2024 International conference on image processing

作者： Du, Zhibo Peng, Long Wang, Yang Cao, Yang Zha, Zheng-Jun Univ Sci & Technol China Dept Automat Hefei Peoples R China

ISBN: (纸本)9798350349405;9798350349399

Moire patterns are commonly seen when taking photos of screens. Camera devices usually have limited hardware performance but take high-resolution photos. However, users are sensitive to the photo processing time, which presents a hardly considered challenge of efficiency for demoireing methods. To balance the network speed and quality of results, we propose a Fully Connected enCoder-deCoder based Demoireing Network (FC3DNet). FC3DNet utilizes features with multiple scales in each stage of the decoder for comprehensive information, which contains long-range patterns as well as various local moire styles that both are crucial aspects in demoireing. Besides, to make full use of multiple features, we design a Multi-Feature Multi-Attention Fusion (MFMAF) module to weigh the importance of each feature and compress them for efficiency. These designs enable our network to achieve performance comparable to state-of-the-art (SOTA) methods in real-world datasets while utilizing only a fraction of parameters, FLOPs, and runtime.

关键词： Screenshot demoireing image restoration image processing multi-scale architecture

来源：评论

学校读者我要写书评

暂无评论

STNAM: A Spatio-Temporal Non-autoregressive Model for video Prediction

STNAM: A Spatio-Temporal Non-autoregressive Model for Video ...

引用

International Joint conference on Neural Networks (IJCNN)

作者： Yuan, Yu Meng, Zhaohui Hohai Univ Coll Comp & Informat Nanjing Peoples R China

ISBN: (纸本)9798350359329;9798350359312

video prediction requires efficient models capable of forecasting future frames which is a crucial task in various domains. However, many current methodologies are based on autoregressive mechanism, suffering from low computing efficiency, error propagation and difficulty in parallel processing of data. With an emphasis on efficiency, we propose the Spatio-Temporal Non-autoregressive Model (STNAM) designed for video prediction tasks. This model aims to achieve superior computational efficiency and reduced error accumulation compared to conventional methodologies. The STNAM is grounded in encoder-prediction-decoder framework with a Spatio-Temporal Attention and a Positional encoding. Experimental evaluations on benchmark video datasets showcase the efficacy of the proposed model. It demonstrates competitive performance in predicting video sequences, establishing its potential for real-time video forecasting applications.

关键词： encoder-prediction-decoder attention positional encoding video prediction

来源：评论

学校读者我要写书评

暂无评论

Smart Parking System Based on image processing 7

Smart Parking System Based on Image Processing

引用

7th IET Smart Cities Symposium, SCS 2023

作者： Yusuf, Fatema H. Mangoud, Mohab A. Department of Electrical Engineering University of Bahrain Bahrain

ISBN: (纸本)9781839539831

This paper proposed a smart parking system that helps drivers in seeking out available parking slots based on image processing. With the increased number of vehicles which leads to the parking congestion, finding an empty parking spaces become a time-consuming task for many drivers specially during rush hours. From this comes the importance of implementing a novel camera-based system that facilities the parking issue to a huge level by detecting the available parking spaces in real-time. The proposed system counts the number of empty slots and detects the vehicles using a single webcam without the need of changing the parking infrastructure. A webcam is positioned in a high place that it can see all the parking slots in the parking area. An image of the parking area is taken as a reference and then all parking slots are selected. A webcam is used to record a video of the parking area while vehicles enter and exit. Both the image and the video are used to determine whether or not the parking space is available. The paper shows the implementation of the proposed system from scratch and then presents the results. The challenges of developing such a solution are also mentioned in addition to possible enhancements for future work. © The Institution of Engineering & Technology 2023.

关键词： image processing

来源：评论

学校读者我要写书评

暂无评论

Lightweight Monocular 3D Object Detection Based on Attention Mechanism 5

Lightweight Monocular 3D Object Detection Based on Attention...

引用

5th Asia-Pacific conference on image processing, Electronics and Computers, IPEC 2024

作者： Zhan, Feng Zhang, Zhi Wuhan University of Science and Technology College of Computer Science and Technology Hubei Wuhan430065 China Information Processing and Real-time Industrial System Hubei Province Key Laboratory of Intelligent Hubei Wuhan430065 China Information Processing and Real-time Industrial System Hubei Province Key Laboratory of Intelligent Hubei Wuhan30065 China

ISBN: (纸本)9798350374407

In the field of autonomous driving, 3D target detection is an important technology. In view of the shortcomings of existing monocular 3D detection algorithms in terms of accuracy and real-time performance, we propose a lightweight 3D target detection algorithm based on the attention mechanism. In order to ensure the real-time performance of the model, we use depthwise separable convolutions in the backbone network. The feature maps of each scale are then connected to an EMA module, which can fuse spatial attention and cross-dimensional Coordinate attention is used to enhance the feature extraction capability of the network, and BiFPN is subsequently used for multi-scale feature fusion. Finally, experiments were conducted on the KITTI dataset, and the results showed that our algorithm was superior to other methods in both accuracy and speed. © 2024 IEEE.

关键词： Object recognition

来源：评论

学校读者我要写书评

暂无评论

VLAB – ENHANCED video AND DATA PRE-processing 58

VLAB – ENHANCED VIDEO AND DATA PRE-PROCESSING

引用

58th Annual International Telemetering conference, ITC 2023

作者： Hardt, Simon Faber, Marc Safran Data Systems GmbH Friedrich-Ebert-Str 75 Bergisch Gladbach51429 Germany

In today's Flight Test Instrumentation (FTI) video telemetry applications, parallel video channels of the same video signal are acquired with the on-board data recorder. One is typically a high-quality video channel that is recorded directly in the recorder and the other is typically a reduced bit rate channel for real-time telemetry downlink to a ground receiving station. Thus, a certain amount of bandwidth is still required for the telemetry, even though not all image content may be urgently needed. In our approach, we offer an embedded video toolbox to the instrumentation engineer. A way to perform pre-processing, custom composition and extraction of video streams in the onboard recorder. The results are generated using Artificial Intelligence (AI) and create/hide video streams that, among other things, contain only information or regions of interest of video streams in an extremely bandwidth efficient manner. In addition, the AI can be used for other data interpretation methods. © 2023 International Foundation for Telemetering. All rights reserved.

关键词： Telemetering

来源：评论

学校读者我要写书评

暂无评论

Generative AI for HTTP Adaptive Streaming 24

Generative AI for HTTP Adaptive Streaming

引用

15th ACM Multimedia Systems conference (ACM MMSys)

作者： Artioli, Emanuele Alpen Adria Univ Klagenfurt Christian Doppler Lab ATHENA Klagenfurt Austria

ISBN: (纸本)9798400704123

video streaming stands as the cornerstone of telecommunication networks, constituting over 60% of mobile data traffic as of June 2023. The paramount challenge faced by video streaming service providers is ensuring high Quality of Experience (QoE) for users. In HTTP Adaptive Streaming (HAS), including DASH and HLS, video content is encoded at multiple quality versions, with an Adaptive Bitrate (ABR) algorithm dynamically selecting versions based on network conditions. Concurrently, Artificial Intelligence (AI) is revolutionizing the industry, particularly in content recommendation and personalization. Leveraging user data and advanced algorithms, AI enhances user engagement, satisfaction, and video quality through super-resolution and denoising techniques. However, challenges persist, such as real-time processing on resource-constrained devices, the need for diverse training datasets, privacy concerns, and model interpretability. Despite these hurdles, the promise of Generative Artificial Intelligence emerges as a transformative force. Generative AI, capable of synthesizing new data based on learned patterns, holds vast potential in the video streaming landscape. In the context of video streaming, it can create realistic and immersive content, adapt in real time to individual preferences, and optimize video compression for seamless streaming in low-bandwidth conditions This research proposal outlines a comprehensive exploration at the intersection of advanced AI algorithms and digital entertainment, focusing on the potential of generative AI to elevate video quality, user interactivity, and the overall streaming experience. The objective is to integrate generative models into video streaming pipelines, unraveling novel avenues that promise a future of dynamic, personalized, and visually captivating streaming experiences for viewers.

关键词： video Streaming Generative AI

来源：评论

学校读者我要写书评

暂无评论

A new hardware architecture of lightweight and efficient real-time video chaos-based encryption algorithm

引用

JOURNAL OF real-time image processing 2022年第6期19卷 1049-1062页

作者： Hadjadj, Mahieddine Anouar Sadoudi, Said Azzaz, Mohamed Salah Bendecheche, Hichem Kaibou, Redouane Ecole Mil Polytech Lab Telecommun BP 17 Algiers 16111 Algeria Ecole Mil Polytech Lab Syst Elect & Numer BP 17 Bordj El Bahri 16111 Algeirs Algeria

In this paper, we propose a novel chaotic-based encryption scheme for securing real-time video data. The proposed encryption algorithm is based on the One-time Pad (OTP) scheme and the unified Lorenz chaotic generator. The peculiarity of the latter is that it can change the chaotic system's and its behaviour as well as its parameters. This provides the system with an important dynamic reconfiguration dimension, especially for real-time applications, in case the key is under attack. As a result, the attacker is obliged to perform these calculations again and again. The 3D unified chaotic generator can switch between three chaotic systems according to a control parameter. As a result, the cryptosystem will offer several advantages, namely a very large dimension of the secret key, low resource and energy consumption and low latency. An extensive security and differential analysis have been performed, demonstrating the high resistance of the proposed scheme to different attacks. The proposed encryption algorithm is validated for real-time video through an experimental implementation of FPGA interfaced with a camera. Experimental results indicate that the proposed hardware architecture is very promising since it provides good performance and can be useful in many embedded applications.

关键词： video FPGA VHDL Lorenz Unified Switching Chaotic encryption Embedded systems

来源：评论

学校读者我要写书评

暂无评论

ASA-BiSeNet: improved real-time approach for road lane semantic segmentation of low-light autonomous driving road scenes

引用

APPLIED OPTICS 2023年第19期62卷 5224-5235页

作者： Liu, Yang Yi, Fulong Ma, Yuhua Wang, Yongfu Northeastern Univ Sch Mech Engn & Automat Shenyang 110819 Peoples R China Huaneng Power Int Inc Dandong Power Plant Dandong 118000 Peoples R China

The solution to the problem of road environmental perception is one of the essential prerequisites to realizing the autonomous driving of intelligent vehicles, and road lane detection plays a crucial role in road environmental per-ception. However, road lane detection in complex road scenes is challenging due to poor illumination conditions, the occlusion of other objects, and the influence of unrelated road markings. It also hinders the commercial appli-cation of autonomous driving technology in various road scenes. In order to minimize the impact of illumination factors on road lane detection tasks, researchers use deep learning (DL) technology to enhance low-light images. In this study, road lane detection is regarded as an image segmentation problem, and road lane detection is studied based on the DL approach to meet the challenge of rapid environmental changes during driving. First, the Zero-DCE++ approach is used to enhance the video frame of the road scene under low-light conditions. Then, based on the bilateral segmentation network (BiSeNet) approach, the approach of associate self-attention with BiSeNet (ASA-BiSeNet) integrating two attention mechanisms is designed to improve the road lane detection ability. Finally, the ASA-BiSeNet approach is trained based on the self-made road lane dataset for the road lane detection task. At the same time, the approach based on the BiSeNet approach is compared with the ASA-BiSeNet approach. The experimental results show that the frames per second (FPS) of the ASA-BiSeNet approach is about 152.5 FPS, and its mean intersection over union is 71.39%, which can meet the requirements of real-time autonomous driving. & COPY;2023 Optica Publishing Group

关键词： Feature extraction image enhancement image processing image quality image quality assessment image recognition

来源：评论

学校读者我要写书评

暂无评论

Water-to-Air Imaging: A Recovery Method for the Instantaneous Distorted image Based on Structured Light and Local Approximate Registration

ENGINEERING REPORTS

引用

ENGINEERING REPORTS 2025年第4期7卷

作者： Jian, Bijian Peng, Ting Zhang, Xuebo Lin, Changyong Hezhou Univ Coll Artificial Intelligence Hezhou Guangxi Peoples R China China West Normal Univ Sch Elect Informat Engn Nanchong Sichuan Peoples R China

Imaging through a continuously fluctuating water-air interface (WAI) is challenging. The image obtained in this way will suffer from complex refraction distortions that hinder the observer's accurate identification of the object. Reversing these distortions is an ill-posed problem, and the current restoration methods using high-resolution video streams are difficult to adapt to real-time observation scenarios. This paper proposes a method for restoring instantaneous distorted images based on structured light and local approximate registration. The scheme first uses structured light measurement technology to obtain the fluctuation information of the water surface. Then, the displacement information of the feature points on the distorted structured light image and the standard structured light image is obtained through the feature extraction algorithm and is used to estimate the distortion vector field of the corresponding sampling points in the distorted scene image. On this basis, the local approximate algorithm is used to reconstruct the distortion-free scene image. Experimental results show that the proposed algorithm can not only reduce image distortion and improve image visualization, but also has significantly better computational efficiency than other methods, achieving an "end-to-end" processing effect.

关键词： image reconstruction refractive distortion structured light water-air imaging

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：