检索结果-内蒙古大学图书馆

Two-dimensional Fourier transform for joint frequency-AOA estimation

SIGNAL image AND video processing 2024年第1期18卷 137-142页

作者： Cheng, Chi-Hao Pennington, Jason Miami Univ Dept Elect & Comp Engn Oxford OH 45056 USA Def Engn Corp Beavercreek OH 45434 USA

The determination of the signal's angle of arrival (AOA) is crucial for civilian and military applications. Popular AOA determination techniques such as MUSIC are often calculation intensive and require a priori knowledge about signal frequency and incoming signals need to have the same frequency. In this paper, an efficient joint frequency-AOA determination method based on a 2-dimensional fast Fourier transform (2D-FFT) is proposed and investigated. The major advantages of the proposed method lie in its simplicity and the availability of efficient hardware/software designed for FFT calculation. Simulation results demonstrate the validity of the proposed method and its advantage over MUSIC in terms of calculation efficiency and better noise immunity. The results show that the proposed method outperforms MUSIC in a low signal-to-noise ratio (SNR) environment (SNR < - 4 dB) and that it requires CPU time merely a fraction of what is required by MUSIC. Therefore, the proposed method provides a feasible approach for realizing AOA detection in a real-time processing device.

关键词： Angle of arrival (AOA) Fourier transform Space-time signal processing Uniform linear array

来源：评论

学校读者我要写书评

暂无评论

MCA-Deeplabv3+: a cupping spot image segmentation network based on improved Deeplabv3+

引用

SIGNAL image AND video processing 2025年第1期19卷 1-9页

作者： Ma, Lu-Yao Qin, Jian-Hua Liu, Ying-Bin Zeng, Gui-Fen Xu, Bao-Ling Huang, Ting-Ting Guilin Univ Technol Educ Dept Guangxi Zhuang Autonomous Reg Key Lab Adv Mfg & Automat Technol Guilin 541006 Peoples R China Guilin Univ Technol Coll Mech & Control Engn Guilin 541004 Peoples R China Guilin Med Univ Affiliated Hosp 2 Guilin 541000 Peoples R China Sun Yat Sen Univ Sch Syst Sci & Engn Guangzhou 510006 Peoples R China

To monitor the condition of cupping spots in real-time during the operation of the automatic cupping machine, reduce the influence of the surrounding environment on the image, and improve the segmentation accuracy of the cupping spots, this paper proposes a network called MCA-Deeplabv3+. Firstly, backbone network replaced by Mobilenetv2 to reduce the model size and improve feature extraction speed;Secondly, to further enhance the network's feature extraction capabilities, we added dilated convolution channels and integrated the CA attention mechanism into the ASPP module;Finally, data augmentation and brightness adjustment are performed on the dataset to improve the generalization of the model in different environments. The experimental results show that, in comparison with other segmentation models, MCA-Deeplabv3+performs the best in cupping spot segmentation, with mIoU and mPA reaching 93.90% and 96.73%, respectively. The practicality and effectiveness of the cupping spot segmentation model presented in this paper are thoroughly demonstrated.

关键词： Deeplabv3+ Cupping spot Semantic segmentation Deep learning

来源：评论

学校读者我要写书评

暂无评论

FPGA-ACCELERATED HEVC ENCODER FOR ENERGY-EFFICIENT MULTI-ACCESS EDGE COMPUTING 30

FPGA-ACCELERATED HEVC ENCODER FOR ENERGY-EFFICIENT MULTI-ACC...

引用

30th IEEE International conference on image processing (ICIP)

作者： Sjovall, Panu Mercat, Alexandre Vanne, Jarno Tampere Univ Ultra Video Grp Tampere Finland

ISBN: (纸本)9781728198354

High Efficiency video Coding (HEVC) and Multi-access Edge Computing (MEC) technologies can make real-time streaming media services available to users with reasonable bandwidth, but the computational complexity of HEVC tends to lead to increased energy consumption in these schemes. In this paper, we investigate the energy saving opportunities of utilizing a field-programmable gate array (FPGA) based HEVC encoder in edge media servers and devices. In practice, we analyze the energy impact of migrating our Kvazaar software HEVC intra encoder to Intel Arria 10 PCIe FPGA(s) on two platforms: 1) Nokia Airframe Cloud Server with 2.4 GHz dual 14-core Intel Xeon processors and 2) an embedded Jetson AGX Orin board with 2.2 GHz 12-core ARM processor. According to our experiments, FPGA encoding on these two platforms saved 76% and 86% of the energy taken up by software only encoding on Airframe, respectively. These results indicate the potential of FPGA-based video encoder acceleration in future green MEC architectures.

关键词： High Efficiency video Coding (HEVC/H.265) Kvazaar HEVC encoder field-programmable gate array (FPGA) energy efficiency Multi-access Edge Computing (MEC)

来源：评论

学校读者我要写书评

暂无评论

FLOW-GUIDED DEFORMABLE ATTENTION NETWORK FOR FAST ONLINE video SUPER-RESOLUTION 30

FLOW-GUIDED DEFORMABLE ATTENTION NETWORK FOR FAST ONLINE VID...

引用

30th IEEE International conference on image processing (ICIP)

作者： Yang, Xi Zhang, Xindong Zhang, Lei Hong Kong Polytech Univ Dept Comp Hong Kong Peoples R China

ISBN: (纸本)9781728198354

real-time online video super-resolution (VSR) on resource limited applications is a very challenging problem due to the constraints on complexity, latency and memory footprint, etc. Recently, a series of fast online VSR methods have been proposed to tackle this issue. In particular, attention based methods have achieved much progress by adaptively aligning or aggregating the information in preceding frames. However, these methods are still limited in network design to effectively and efficiently propagate the useful features in temporal domain. In this work, we propose a new fast online VSR algorithm with a flow-guided deformable attention propagation module, which leverages corresponding priors provided by a fast optical flow network in deformable attention computation and consequently helps propagating recurrent state information effectively and efficiently. The proposed algorithm achieves state-of-the-art results on widely-used benchmarking VSR datasets in terms of effectiveness and efficiency. Code can be found at https://***/IanYeung/FastOnlineVSR.

关键词： video super-resolution Flow-guided deformable attention Deep neural networks

来源：评论

学校读者我要写书评

暂无评论

High Resolution Panoramic video Synthesis and Objects Detection on Panoramic video 4

High Resolution Panoramic Video Synthesis and Objects Detect...

引用

4th International conference on Innovative Research in Applied Science, Engineering and Technology, IRASET 2024

作者： Romanova, Karina E. Tsybulko, Evgeniya A. Rudenko, Danila I. Khelvas, Aleksandr V. Gilya-Zetinov, Aleksandr A. Tykhonov, Illya V. Mipt Moscow Russia Intc Intelligent Electronics-Valdai Veliky Novgorod Russia Cos&ht Dolgoprudnii Russia Soccos Moscow Russia Ipmce Moscow Russia

ISBN: (纸本)9798350309508

We propose an approach to the creation of a panorama viewport and objects detection within it in real-time on the base of the set of videos from the assembly of cameras. The task of the panorama viewport generation is significantly different from the panoramic image generation. We perform image stitching only with a part of cameras views (region of interest - ROI) and display the resulting images flow on the user's screen. The complexity of this problem is in processing the images from the set of cameras at the moment, since it is impossible to stitch, process and save the whole panorama in the user device memory. The main practical application of the proposed technology is video surveillance for transportation, industry and public objects (airports, railway stations, sea and river ports, plants, stadiums, etc.). Nowadays, there are many scientific articles exploring panorama stitching methods. However, not many existing algorithms can be used for the panorama stitching in real-time as is. real-time panorama stitching is the value of the proposed approach. © 2024 IEEE.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

The Drone-vs-Bird Detection Grand Challenge at ICASSP 2023: A Review of Methods and Results

IEEE OPEN JOURNAL OF SIGNAL PROCESSING

引用

IEEE OPEN JOURNAL OF SIGNAL processing 2024年 5卷 766-779页

作者： Coluccia, Angelo Fascista, Alessio Sommer, Lars Schumann, Arne Dimou, Anastasios Zarpalas, Dimitrios Univ Salento Dept Innovat Engn I-73100 Lecce Italy Fraunhofer Ctr Machine Learning Fraunhofer IOSB D-76131 Karlsruhe Germany Ctr Res & Technol Hellas Informat Technol Inst Visual Comp Lab Thessaloniki 57001 Greece

This paper presents the 6th edition of the "Drone-vs-Bird" detection challenge, jointly organized with the WOSDETC workshop within the IEEE International conference on Acoustics, Speech and Signal processing (ICASSP) 2023. The main objective of the challenge is to advance the current state-of-the-art in detecting the presence of one or more Unmanned Aerial Vehicles (UAVs) in real video scenes, while facing challenging conditions such as moving cameras, disturbing environmental factors, and the presence of birds flying in the foreground. For this purpose, a video dataset was provided for training the proposed solutions, and a separate test dataset was released a few days before the challenge deadline to assess their performance. The dataset has continually expanded over consecutive installments of the Drone-vs-Bird challenge and remains openly available to the research community, for non-commercial purposes. The challenge attracted novel signal processing solutions, mainly based on deep learning algorithms. The paper illustrates the results achieved by the teams that successfully participated in the 2023 challenge, offering a concise overview of the state-of-the-art in the field of drone detection using video signal processing. Additionally, the paper provides valuable insights into potential directions for future research, building upon the main pros and limitations of the solutions presented by the participating teams.

关键词： Deep learning drone detection image and video signal processing unmanned aerial vehicles (UAV)

来源：评论

学校读者我要写书评

暂无评论

STDF-DSCS: A Lightweight Scheme for Compressed video Quality Enhancement 16

STDF-DSCS: A Lightweight Scheme for Compressed Video Quality...

引用

16th International conference on Wireless Communications and Signal processing, WCSP 2024

作者： Pei, Shufan Ren, Guangming Zhao, Tiesong Lin, Liqun Fuzhou University Fuzhou China

ISBN: (纸本)9798350390643

Lossy compression reduces the data amount in the video by sacrificing quality, which leads to severe distortion, especially when videos are overly compressed. Con-sequently, many restoration methods have been proposed to repair compressed videos. However, they are limited by their substantial computational demand and complexity. To conserve computational resources and enhance real-time capabilities, we implement a compressed video enhancement method based on Spatio-Temporal Deformable Fusion (STDF) and subsequently design a lightweight scheme for it. By integrating depthwise separable convolution and channel shuffle techniques, we design a lightweight version of STDF called STDF-DSCS. STDF-DSCS reduces computational complexity and improves model inference speed while maintaining enhancement quality. The depthwise separable convolution reduces the model's computational complexity, while channel shuffle compensates for the shortcomings of group convolution, enhancing inter-channel communication and the network's feature learning capability. Extensive experiments demonstrate that STDF-DSCS significantly enhances computational efficiency while maintaining comparable enhancement effects to existing methods, which can be utilized in real-time video processing scenarios. © 2024 IEEE.

关键词： image reconstruction

来源：评论

学校读者我要写书评

暂无评论

An adaptable physics-informed fault diagnosis approach via hybrid signal processing and transferable feature learning for structural/machinery health monitoring

引用

SIGNAL image AND video processing 2024年第12期18卷 9051-9066页

作者： Zarchi, Milad Shahgholi, Majid Tee, Kong Fah Shahid Rajaee Teacher Training Univ Fac Mech Engn Tehran *** Iran King Fahd Univ Petr & Minerals Dept Civil & Environm Engn Dhahran 31261 Saudi Arabia KFUPM Interdisciplinary Res Ctr Construct & Bldg Mat Dhahran 31261 Saudi Arabia

Structural damages, such as structural looseness and structural cracks, are commonly observed as the root causes of failures in industrial plants. These issues have been extensively studied, and deep diagnostic tools have shown promise in identifying and addressing them. However, these tools rely on large amounts of data, which leads to computational burdens and time consumption. To tackle this challenge, a groundbreaking technique is proposed within the context of this study. The key innovation of this approach lies in its ability to integrate information from various processing functions and utilize an efficient feature bank that facilitates the execution of an effective feature learning method based on a multisource strategy. This novel research also focuses on the selection of transferable features from multiple distributions for diagnostics involving unseen failure distributions. By minimizing the mean squared error function, which is based on various source domains, the accuracy of diagnostics is significantly improved. Furthermore, the joint minimization of diagnostics independence concerning failure distribution, as well as the dimension of the transferable feature space between the source domains, leads to enhanced diagnostics speed and feature visualization. To validate the effectiveness of this approach, a real case study of a structural/machinery vibration dataset is conducted to address the multi-fault diagnosis problem, encompassing machinery health conditions, foundation looseness, and cracks under various operational conditions. The results obtained from this study demonstrate that the proposed algorithm performs remarkably well in real diagnostics scenarios involving unseen failure distributions.

关键词： Structural/machinery health monitoring Multidomain data analysis Multisource information fusion Multiprocessing module Transfer learning Adaptable feature extraction

来源：评论

学校读者我要写书评

暂无评论

Resource-Efficient Design and Implementation of real-time Parking Monitoring System with Edge Device

引用

SENSORS 2025年第7期25卷 2181-2181页

作者： Kim, Jungyoon Jeong, Incheol Jung, Jungil Cho, Jinsoo Gachon Univ Dept IT Convergence Engn Seongnam 13120 South Korea PCT Co Ltd Seongnam 13449 South Korea Gachon Univ Dept Comp Engn Seongnam 13120 South Korea

Parking management systems play a crucial role in addressing parking shortages and operational challenges;however, high initial costs and infrastructure requirements often hinder their implementation. Edge computing offers a promising solution by reducing latency and network traffic, thus optimizing operational costs. Nonetheless, the limited computational resources of edge devices remain a significant challenge. This study developed a real-time vehicle occupancy detection system utilizing SSD-MobileNetv2 on edge devices to process video streams from multiple IP cameras. The system incorporates a dual-trigger mechanism, combining periodic triggers and parking space mask triggers, to optimize computational efficiency and resource usage while maintaining high accuracy and reliability. Experimental results demonstrated that the parking space mask trigger significantly reduced unnecessary AI model executions compared to periodic triggers, while the dual-trigger mechanism ensured consistent updates even under unstable network conditions. The SSD-MobileNetv2 model achieved a frame processing time of 0.32 s and maintained robust detection performance with an F1-score of 0.9848 during a four-month field validation. These findings validate the suitability of the system for real-time parking management in resource-constrained environments. Thus, the proposed smart parking system offers an economical, viable, and practical solution that can significantly contribute to developing smart cities.

关键词： dual-trigger system edge device smart parking system vehicle occupancy detection image processing

来源：评论

学校读者我要写书评

暂无评论

EGBNet: a real-time edge-guided bilateral network for nighttime semantic segmentation

引用

SIGNAL image AND video processing 2023年第6期17卷 3173-3181页

作者： An, Guanhua Guo, Jichang Wang, Yudong Ai, Yufeng Tianjin Univ Sch Elect & Informat Engn Tianjin Peoples R China

Due to poor illumination and low contrast, semantic segmentation of nighttime images faces major challenges. Various segmentation models with a large number of parameters are proposed to improve the performance but lead to an inability to process in real time. To tackle these problems, we propose a real-time edge-guided bilateral network (EGBNet) for nighttime semantic segmentation. Considering the blurred details and low contrast of nighttime images, we propose a lightweight multi-dilation dense aggregation module and introduce an efficient edge head to improve the ability to distinguish target features from the nighttime background. Moreover, a self-adaptive feature fusion module is proposed for the bilateral segmentation network to enhance the feature representation and generalization ability by fully using multi-scale feature maps. To capture more useful information from limited nighttime images, we further use the knowledge distillation strategy to improve the segmentation performance. Extensive experiments on ACDC and BDD datasets demonstrate the effectiveness of our EGBNet by achieving a satisfactory trade-off between segmentation accuracy and inference speed. Specifically, EGBNet achieves 55.56% mIoU on the ACDC test set with 9.4 M parameters and 60FPS speed for a 1080 x 1920 input image on a single NVIDIA 2080Ti.

关键词： Nighttime semantic segmentation real-time processing Edge-guided Deep learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：