检索结果-内蒙古大学图书馆

5th Asia-Pacific conference on image processing, Electronics and Computers, IPEC 2024

作者： Zhan, Feng Zhang, Zhi Wuhan University of Science and Technology College of Computer Science and Technology Hubei Wuhan430065 China Information Processing and Real-time Industrial System Hubei Province Key Laboratory of Intelligent Hubei Wuhan430065 China Information Processing and Real-time Industrial System Hubei Province Key Laboratory of Intelligent Hubei Wuhan30065 China

ISBN: (纸本)9798350374407

In the field of autonomous driving, 3D target detection is an important technology. In view of the shortcomings of existing monocular 3D detection algorithms in terms of accuracy and real-time performance, we propose a lightweight 3D target detection algorithm based on the attention mechanism. In order to ensure the real-time performance of the model, we use depthwise separable convolutions in the backbone network. The feature maps of each scale are then connected to an EMA module, which can fuse spatial attention and cross-dimensional Coordinate attention is used to enhance the feature extraction capability of the network, and BiFPN is subsequently used for multi-scale feature fusion. Finally, experiments were conducted on the KITTI dataset, and the results showed that our algorithm was superior to other methods in both accuracy and speed. © 2024 IEEE.

关键词： Object recognition

来源：评论

学校读者我要写书评

暂无评论

Energy Optimization of Distributed video processing System using Genetic Algorithm with Bayesian Attractor Model 9

Energy Optimization of Distributed Video Processing System u...

引用

9th IEEE International conference on Network Softwarization (IEEE NetSoft) - Boosting Future Networks through Advanced Softwarization

作者： Shimonishi, Hideyuki Murata, Masayuki Hasegawa, Go Techasarntikul, Nattaon Osaka Univ Osaka Japan Tohoku Univ Sendai Miyagi Japan

ISBN: (纸本)9798350399806

For the future cyber-physical system (CPS) society, it is necessary to construct digital twins (DTs) of a real world in real time using a lot of cameras and sensors. Hence, the energy efficiency of both networks and computers for largescale distributed video analysis is a major challenge for the full-scale spread of CPSs and DTs. Toward this goal, we first propose a model to arbitrarily split and distribute the video analysis task to terminals, edge servers, and cloud servers and dynamically assign appropriate CNN models to them. System-wide optimization of such distributed processing can reduce overall system power consumption by reducing network bandwidth and efficiently utilizing distributed CPU/GPU resources. To realize this optimization in a real system, we also propose a model to estimate the GPU load, processing time, and power consumption of these devices based on massive experimental measurements. Since such a large-scale optimization is difficult because of the dynamic and multi-objective nature of the problem, we propose a new optimization algorithm composed of Genetic Algorithm and Bayesian Attractor Model. Finally, simulation evaluations are performed to demonstrate that the proposed method can minimize system power consumption and satisfy latency and recognition accuracy requirements of each video analysis, even under changing environmental conditions.

关键词： Digital twin video analysis system optimization energy efficiency genetic algorithm Bayesian Attractor Model

来源：评论

学校读者我要写书评

暂无评论

Smart Parking System Based on image processing 7

Smart Parking System Based on Image Processing

引用

7th IET Smart Cities Symposium, SCS 2023

作者： Yusuf, Fatema H. Mangoud, Mohab A. Department of Electrical Engineering University of Bahrain Bahrain

ISBN: (纸本)9781839539831

This paper proposed a smart parking system that helps drivers in seeking out available parking slots based on image processing. With the increased number of vehicles which leads to the parking congestion, finding an empty parking spaces become a time-consuming task for many drivers specially during rush hours. From this comes the importance of implementing a novel camera-based system that facilities the parking issue to a huge level by detecting the available parking spaces in real-time. The proposed system counts the number of empty slots and detects the vehicles using a single webcam without the need of changing the parking infrastructure. A webcam is positioned in a high place that it can see all the parking slots in the parking area. An image of the parking area is taken as a reference and then all parking slots are selected. A webcam is used to record a video of the parking area while vehicles enter and exit. Both the image and the video are used to determine whether or not the parking space is available. The paper shows the implementation of the proposed system from scratch and then presents the results. The challenges of developing such a solution are also mentioned in addition to possible enhancements for future work. © The Institution of Engineering & Technology 2023.

关键词： image processing

来源：评论

学校读者我要写书评

暂无评论

Generative AI for HTTP Adaptive Streaming 24

Generative AI for HTTP Adaptive Streaming

引用

15th ACM Multimedia Systems conference (ACM MMSys)

作者： Artioli, Emanuele Alpen Adria Univ Klagenfurt Christian Doppler Lab ATHENA Klagenfurt Austria

ISBN: (纸本)9798400704123

video streaming stands as the cornerstone of telecommunication networks, constituting over 60% of mobile data traffic as of June 2023. The paramount challenge faced by video streaming service providers is ensuring high Quality of Experience (QoE) for users. In HTTP Adaptive Streaming (HAS), including DASH and HLS, video content is encoded at multiple quality versions, with an Adaptive Bitrate (ABR) algorithm dynamically selecting versions based on network conditions. Concurrently, Artificial Intelligence (AI) is revolutionizing the industry, particularly in content recommendation and personalization. Leveraging user data and advanced algorithms, AI enhances user engagement, satisfaction, and video quality through super-resolution and denoising techniques. However, challenges persist, such as real-time processing on resource-constrained devices, the need for diverse training datasets, privacy concerns, and model interpretability. Despite these hurdles, the promise of Generative Artificial Intelligence emerges as a transformative force. Generative AI, capable of synthesizing new data based on learned patterns, holds vast potential in the video streaming landscape. In the context of video streaming, it can create realistic and immersive content, adapt in real time to individual preferences, and optimize video compression for seamless streaming in low-bandwidth conditions This research proposal outlines a comprehensive exploration at the intersection of advanced AI algorithms and digital entertainment, focusing on the potential of generative AI to elevate video quality, user interactivity, and the overall streaming experience. The objective is to integrate generative models into video streaming pipelines, unraveling novel avenues that promise a future of dynamic, personalized, and visually captivating streaming experiences for viewers.

关键词： video Streaming Generative AI

来源：评论

学校读者我要写书评

暂无评论

Water-to-Air Imaging: A Recovery Method for the Instantaneous Distorted image Based on Structured Light and Local Approximate Registration

ENGINEERING REPORTS

引用

ENGINEERING REPORTS 2025年第4期7卷

作者： Jian, Bijian Peng, Ting Zhang, Xuebo Lin, Changyong Hezhou Univ Coll Artificial Intelligence Hezhou Guangxi Peoples R China China West Normal Univ Sch Elect Informat Engn Nanchong Sichuan Peoples R China

Imaging through a continuously fluctuating water-air interface (WAI) is challenging. The image obtained in this way will suffer from complex refraction distortions that hinder the observer's accurate identification of the object. Reversing these distortions is an ill-posed problem, and the current restoration methods using high-resolution video streams are difficult to adapt to real-time observation scenarios. This paper proposes a method for restoring instantaneous distorted images based on structured light and local approximate registration. The scheme first uses structured light measurement technology to obtain the fluctuation information of the water surface. Then, the displacement information of the feature points on the distorted structured light image and the standard structured light image is obtained through the feature extraction algorithm and is used to estimate the distortion vector field of the corresponding sampling points in the distorted scene image. On this basis, the local approximate algorithm is used to reconstruct the distortion-free scene image. Experimental results show that the proposed algorithm can not only reduce image distortion and improve image visualization, but also has significantly better computational efficiency than other methods, achieving an "end-to-end" processing effect.

关键词： image reconstruction refractive distortion structured light water-air imaging

来源：评论

学校读者我要写书评

暂无评论

An automated hybrid decoupled convolutional network for laceration segmentation and grading of retinal diseases using optical coherence tomography (OCT) images

引用

SIGNAL image AND video processing 2024年第3期18卷 2903-2927页

作者： Mani, Pavithra Ramachandran, Neelaveni Paul, Sweety Jose Ramesh, Prasanna Venkatesh Kongu Engn Coll Dept ECE 85 Perumalkadu StChennimalai Rd Erode 638060 India PSG Coll Technol Dept EEE Coimbatore India Mahathma Eye Hosp Pvt Ltd Dept Glaucoma & Res Trichy Tamil Nadu India

Diabetic retinopathy (DR) is a complication of diabetes that damages the retina and can cause blindness if untreated due to high blood sugar levels. To accurately diagnose and grade DR, it is important to identify retinal lacerations or biomarkers. Optical coherence tomography (OCT) imaging is a commonly used tool by ophthalmologists due to its detailed visualisation of retinal lacerations, which aids in the precise treatment of retinal abnormalities. However, the number of scans obtained daily exceeds the ophthalmologist's capacity to meaningfully analyse them, given the wide range of severe OCT applications and the prevalence of visual disorders. In the past, several research studies have attempted to address this issue using OCT scans. However, none of them have tried to simultaneously perform retinal laceration segmentation and DR grading. To address this problem, we have proposed a new architecture-a cutting-edge decoupled convolutional network consisting of three distinct modules that work together to achieve accurate DR grading based on clinical standards aided by retinal laceration segmentation. Our proposed paper introduces a deep learning framework that leverages dual guidance to improve performance on two related tasks. It was extensively tested using 26,841 multi-vendor scans, four publicly available datasets, and a real-time dataset containing 307 OCT scans from various patients. The results confirmed the effectiveness of our design, with a mean Dice score of 0.88 (4.76% improvement) in retinal laceration segmentation and 98.93% accuracy in DR grading, with an actual positive rate of about 98.46% and a true negative rate of 99.37%.

关键词： Retinal image Optical coherence tomography (OCT) Diabetic retinopathy (DR) Deep learning Convolution neural network (CNN) Noise removal Segmentation

来源：评论

学校读者我要写书评

暂无评论

ASA-BiSeNet: improved real-time approach for road lane semantic segmentation of low-light autonomous driving road scenes

引用

APPLIED OPTICS 2023年第19期62卷 5224-5235页

作者： Liu, Yang Yi, Fulong Ma, Yuhua Wang, Yongfu Northeastern Univ Sch Mech Engn & Automat Shenyang 110819 Peoples R China Huaneng Power Int Inc Dandong Power Plant Dandong 118000 Peoples R China

The solution to the problem of road environmental perception is one of the essential prerequisites to realizing the autonomous driving of intelligent vehicles, and road lane detection plays a crucial role in road environmental per-ception. However, road lane detection in complex road scenes is challenging due to poor illumination conditions, the occlusion of other objects, and the influence of unrelated road markings. It also hinders the commercial appli-cation of autonomous driving technology in various road scenes. In order to minimize the impact of illumination factors on road lane detection tasks, researchers use deep learning (DL) technology to enhance low-light images. In this study, road lane detection is regarded as an image segmentation problem, and road lane detection is studied based on the DL approach to meet the challenge of rapid environmental changes during driving. First, the Zero-DCE++ approach is used to enhance the video frame of the road scene under low-light conditions. Then, based on the bilateral segmentation network (BiSeNet) approach, the approach of associate self-attention with BiSeNet (ASA-BiSeNet) integrating two attention mechanisms is designed to improve the road lane detection ability. Finally, the ASA-BiSeNet approach is trained based on the self-made road lane dataset for the road lane detection task. At the same time, the approach based on the BiSeNet approach is compared with the ASA-BiSeNet approach. The experimental results show that the frames per second (FPS) of the ASA-BiSeNet approach is about 152.5 FPS, and its mean intersection over union is 71.39%, which can meet the requirements of real-time autonomous driving. & COPY;2023 Optica Publishing Group

关键词： Feature extraction image enhancement image processing image quality image quality assessment image recognition

来源：评论

学校读者我要写书评

暂无评论

Binarydnet53: a lightweight binarized CNN for monkeypox virus image classification

引用

SIGNAL image AND video processing 2024年第10期18卷 7107-7118页

作者： Biswas, Debojyoti Tesic, Jelena Texas State Univ Dept Comp Sci 601 Univ Dr San Marcos TX 78666 USA

The recent widespread increase of the Mpox (formerly monkeypox) virus infections in South Asian and African countries has raised concerns among medical professionals regarding the potential emergence of another pandemic in those regions. According to the World Health Organization (WHO) "emergency meeting" on May 20, 2022, there were 82,809 confirmed cases reported in 110 countries. With the number of available test kits surpassing the count of positive/probable cases, there is a pressing need to develop a robust and lightweight classifier model that can alleviate the burden of physical testing kits and expedite the detection process. The existing state-of-the-art primarily focuses on achieving high accuracy in modeling Mpox without considering factors such as modeling suitability, real-time inferencing, and adaptability to resource-constrained CPU-only mobile devices. In this research, we propose a novel lightweight binarized DarkNet53 model, referred to as BinaryDNet53, which is approximately similar to 20x\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sim 20\times $$\end{document} more computationally efficient and similar to 2x\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sim 2\times $$\end{document} more power-efficient than the current state-of-the-art. This model demonstrates smooth detection capabilities when deployed on small hand-held or embedded devices. Firstly, we binarize the weights and biases of the DarkNet53 model to prevent high computational costs and memory usage. Next, our work introduces large-margin feature learning and weighted loss calculation to enhance results, particularly on c

关键词： Monkeypox classification Lightweight CNN Binarized CNN Power efficient DNN Deep learning approaches to detect Mpox

来源：评论

学校读者我要写书评

暂无评论

VLAB – ENHANCED video AND DATA PRE-processing 58

VLAB – ENHANCED VIDEO AND DATA PRE-PROCESSING

引用

58th Annual International Telemetering conference, ITC 2023

作者： Hardt, Simon Faber, Marc Safran Data Systems GmbH Friedrich-Ebert-Str 75 Bergisch Gladbach51429 Germany

In today's Flight Test Instrumentation (FTI) video telemetry applications, parallel video channels of the same video signal are acquired with the on-board data recorder. One is typically a high-quality video channel that is recorded directly in the recorder and the other is typically a reduced bit rate channel for real-time telemetry downlink to a ground receiving station. Thus, a certain amount of bandwidth is still required for the telemetry, even though not all image content may be urgently needed. In our approach, we offer an embedded video toolbox to the instrumentation engineer. A way to perform pre-processing, custom composition and extraction of video streams in the onboard recorder. The results are generated using Artificial Intelligence (AI) and create/hide video streams that, among other things, contain only information or regions of interest of video streams in an extremely bandwidth efficient manner. In addition, the AI can be used for other data interpretation methods. © 2023 International Foundation for Telemetering. All rights reserved.

关键词： Telemetering

来源：评论

学校读者我要写书评

暂无评论

real-time Computational Efficiency Vehicle Detection and Counting Utilizing the Background Subtraction Technique and Non-Maximum Suppression Techniques

Informatica (Slovenia)

引用

Informatica (Slovenia) 2025年第18期49卷 29-38页

作者： Mezaal, Jameelah Kadhim Dept. of Student Activities University of Basrah Basrah Iraq

By combining cloud computing, computer vision, and Internet of Things (IoT), it would be able to make the most of both sides. Because the IoT is mostly composed of connected, contained gadgets, it can store and process data gathered through the application of computer vision algorithms. It is able to achieve this by making use of the almost infinite resources provided by cloud organizations, including processing and storage services. The development and execution of a computer vision-based system are examined in this paper. that counts and identifies automobiles using machine learning (ML). The system consists of multiple stages, including initialization, background subtraction, object detection, bounding rectangles, vehicles counting and evaluation criteria. The proposed methodology first separates moving objects from the background and then employs a statistical technique called Mixture of Gaussians (MOG) for background subtraction to identify the automobiles in the image and Non-Maximum Suppression (NMS) to filter out overlapping bounding boxes to enhance the detection operation. The experiment's outcomes show how effectively cars can be found and counted. The result of the experiments using accuracy, precision, f1-score and recall are about 90% for the different types of video and from many corners. © 2025 Slovene Society Informatika. All rights reserved.

关键词： image enhancement

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：