检索结果-内蒙古大学图书馆

2nd IEEE International conference on Data Science and Computer Application, ICDSCA 2022

作者： Linjun, Li Chengdu University of Technology College of Computer Science and Cyber Security Oxford Brookes College Chengdu China

ISBN: (数字)9781665472005

ISBN: (纸本)9781665472005

This paper develops a remote video monitoring system based on ARM;the hardware of the system is based on the ARM S3C2440 embedded chip, and the circuit structure of the main modules such as power supply, Ethernet interface, JTAG interface, and data storage of the hardware system is designed;The software of the system uses Linux as the operating system, uses video4 Linux to complete video image acquisition, uses JPEG image compression technology to realize video compression processing, and uses real-time transmission protocol RTP to realize video image encapsulation, network transmission, and control. The experimental results show that the method can simultaneously perform real-time online detection of various anomalies such as occlusion, blurring and scene switching that suddenly appear in remote video surveillance, and the accuracy rate can reach 88.75%. © 2022 IEEE.

关键词： image recognition

来源：评论

学校读者我要写书评

暂无评论

TFA Block: Temporal Feature Alignment Block for video Frame interpolation 2

TFA Block: Temporal Feature Alignment Block for Video Frame ...

引用

2nd International conference on image processing, Computer Vision and Machine Learning, ICICML 2023

作者： Shang, Yingda Lu, Zongqing Zhang, Jingguo Chen, Yuxiang Tsinghua University Tsinghua Shenzhen International Graduate School Shenzhen China Tsinghua University Tsinghua Shenzhen International Graduate School China

ISBN: (纸本)9798350331417

video frame interpolation is an important low-level task in the field of image processing, which is widely applied to video image restoration enhancement, media players and display devices. In this work, we design an optical flow module, termed TFA block, for temporal feature alignment based on spatial-temporal modeling for fast and accurate video frame interpolation. It first matches adjacent input frames into spatial location roughly, then uses spatial-temporal modeling to align spatial features with time domain, and finally merges in time domain dimension to obtain optical flow between input continuous frames and objective frame. TFA block does not need complex network structure or relying on the pre-trained optical flow network, but obtain the task-oriented bidirectional intermediate optical flow from output frame to given inputs for backward warping. Compared to the state-of-the-art (SOTA) algorithms, our network achieves demonstrates both excellent performance improvement and fast inference speed. © 2023 IEEE.

关键词： optical flow spatial-temporal modeling temporal feature alignment video frame interpolation

来源：评论

学校读者我要写书评

暂无评论

Application of producer-consumer pattern in real-time infrared image inversion at the missile range

Application of producer-consumer pattern in real-time infrar...

引用

2023 Advanced Fiber Laser conference, AFL 2023

作者： Ji, Bo Tan, Shili Guo, Junyu Sun, Yifu Xu, Wengang Chinese People s Liberation Army 95841 Troops Jiuquan Gansu735018 China Chinese People s Liberation Army 93119 Troops Jiuquan Gansu735018 China COMAC Shanghai Aircraft Design and Research Institute Shanghai201206 China

ISBN: (纸本)9781510677661

The current ground-based and space-based testing target infrared image inversion at the missile range adopts a sequential execution method of "image reading - target tracking extraction - inversion calculation - result storage", which is inefficient and requires post-processing. With the new experimental tasks requiring real-time processing of infrared images, the current processing mode is unable to meet the requirements. This article proposes a multi-thread real-time improvement scheme based on the producer/consumer pattern. By establishing data buffers between producers and consumers, producers and consumers can execute independently, achieving decoupling between each other and improving execution efficiency. Firstly, the core processing process of current inversion algorithms is analyzed and a flowchart is provided, including four key steps: image reading, target tracking extraction, inversion calculation, and result storage. Using program instrumentation, the execution time of each step in the inversion calculation process is obtained. The image reading takes about 5.4ms, the target tracking extraction takes about 22.5ms, the inversion calculation takes about 1.0ms, and the result saving takes about 16.2ms. Secondly, we propose a "producer/consumer-producer/consumer"pattern of infrared image inversion. Each part can be executed synchronously by different threads or thread groups. We test the execution time of each step and divide them into three weakly coupled modules: producer, consumer-producer, and consumer. The producer corresponds to image reading;the consumer corresponds to result storage;since the target tracking extraction process takes much longer than the inversion calculation process and the two processes are closely related, we combine them into the consumer-producer, which is both a consumer (relative to upstream producer) and a producer (relative to downstream consumer). Thirdly, we determine the number of threads for producer, consumer-producer,

关键词： Target tracking

来源：评论

学校读者我要写书评

暂无评论

Creation of Annotated Synthetic UAV video Dataset for Object Detection and Tracking 31

Creation of Annotated Synthetic UAV Video Dataset for Object...

引用

31st IEEE conference on Signal processing and Communications Applications (SIU)

作者： Yilmaz, Can Maras, Bahri Arica, Nafiz Ertuzun, Aysin Baytan Bahcesehir Univ Yapay Zeka Muhendisligi Bolumu Istanbul Turkiye Bogazici Univ Elekt & Elekt Muhendisligi Bolumu Istanbul Turkiye Piri Reis Univ Bilisim Sistemleri Muhendisligi Bolumu Istanbul Turkiye

ISBN: (纸本)9798350343557

In order for object detection and tracking in videos obtained from unmanned aerial vehicles (UAVs) by deep convolutional neural networks (DCNN), extensive ground truth optical flow, occlusion and segmentation datasets, of various objects or vehicles, are required during the training and testing processes. The mentioned ground truth informations are not widely available in the literature due to the difficulty of labeling or extracting them from real-life recorded UAV video images. In this study, ground truth optical flow, occlusion and segmentation datasets were produced synthetically for the first time with the UAV point of view in a novel way, so as to fill the gap in literature. The ground truth datasets were created for each vehicle by subjecting the triangles (mesh) automatically generated by the Unity engine to the homography method. With this method, 1920x1080 and 250x250 sized synthetic datasets consisting of 100 scenarios were obtained.

关键词： Optical Flow Segmentation Occlusion Synthetic Dataset Unmanned Air Vehicle (UAV) Deep Convolutional Neural Network (DCNN)

来源：评论

学校读者我要写书评

暂无评论

real-time moving vehicle detection in satellite videos based on hole features and motion direction information

Real-time moving vehicle detection in satellite videos based...

引用

2024 International conference on Remote Sensing, Mapping, and Geographic Information Systems, RSMG 2024

作者： Yuan, Jieran Song, Beibei Sun, Wenfang School of Information Engineering Chang’an University Xi’an710018 China School of Aerospace Science and Technology Xidian University Xi’an710068 China

ISBN: (纸本)9781510685826

Capturing motion vehicle information from satellite videos is crucial for real-time traffic monitoring and emergency response. However, vehicles in satellite videos are small in size, lack detailed textural features and are easily obscured by complex backgrounds. Traditional frame differencing and background subtraction methods often lead to a high number of false positives and false negatives, while deep learning methods struggle to meet real-time processing requirements. To strike a balance between detection performance and processing efficiency, a real-time vehicle detection method combining hole features extracted from frame differencing and motion direction estimation is proposed. Initially, a multiframe image accumulation (MIA) strategy is employed to enhance the visibility of vehicle targets and suppress background noise. Subsequently, the hole features of moving vehicles are extracted by differencing adjacent accumulated images, leading to the creation of a hole model for coarse detection of moving vehicles. Then, by estimating the motion direction of vehicles and enforcing direction consistency constraints, hole matching of neighboring vehicles in motion is achieved to improve the detection accuracy. Finally, a novel region extraction algorithm that integrates target hole features and motion direction information is designed to effectively suppress false positives generated by background noise. This method exhibits superior detection performance on the VISO benchmark dataset, achieving a recognition accuracy of 89.5% while meeting real-time processing requirements with an average processing time of only 0.04 seconds per frame, ensuring both detection performance and processing efficiency. © 2024 SPIE.

关键词： image segmentation

来源：评论

学校读者我要写书评

暂无评论

Data-independent low-complexity KLT approximations for image and video coding

引用

SIGNAL processing-image COMMUNICATION 2022年第0期101卷 116585-116585页

作者： Radunz, Anabeth P. da Silveira, Thiago L. T. Bayer, Fabio M. Cintra, Renato J. Univ Fed Pernambuco Programa Posgrad Estat Recife PE Brazil Univ Fed Rio Grande do Sul Inst Informat Porto Alegre RS Brazil Univ Fed Santa Maria Dept Estat Santa Maria RS Brazil Univ Fed Santa Maria LACESM Santa Maria RS Brazil Univ Fed Pernambuco Dept Estat Signal Proc Grp Recife PE Brazil

The Karhunen-Loeve transform (KLT) is often used for data decorrelation and dimensionality reduction. The KLT is able to optimally retain the signal energy in only few transform components, being mathematically suitable for image and video compression. However, in practice, because of its high computational cost and dependence on the input signal, its application in real-time scenarios is precluded. This work proposes low-computational cost approximations for the KLT. We focus on the blocklengths N is an element of{4,8, 16, 32} because they are widely employed in image and video coding standards such as JPEG and high efficiency video coding (HEVC). Extensive computational experiments demonstrate the suitability of the proposed low-complexity transforms for image and video compression.

关键词： Approximate transform image compression Karhunen-Loeve transform Low-complexity transforms Signed KLT

来源：评论

学校读者我要写书评

暂无评论

image AESTHETICS ASSESSMENT VIA LEARNABLE QUERIES 49

IMAGE AESTHETICS ASSESSMENT VIA LEARNABLE QUERIES

引用

49th IEEE International conference on Acoustics, Speech, and Signal processing (ICASSP)

作者： Xiong, Zhiwei Zhang, Yunfan Shen, Zhiqi Ren, Peiran Yu, Han Nanyang Technol Univ Sch Comp Sci & Engn Singapore Singapore Nanyang Technol Univ Alibaba NTU Singapore Joint Res Inst Singapore Singapore Alibaba Grp Hangzhou Peoples R China

ISBN: (纸本)9798350344868;9798350344851

image aesthetics assessment (IAA) aims to estimate the aesthetics of images. Depending on the content of an image, diverse criteria need to be selected to assess its aesthetics. Existing works utilize pre-trained vision backbones based on content knowledge to learn image aesthetics. However, training those backbones is time-consuming and suffers from attention dispersion. Inspired by learnable queries in vision-language alignment, we propose the image Aesthetics Assessment via Learnable Queries (IAA-LQ) approach. It adapts learnable queries to extract aesthetic features from pre-trained image features obtained from a frozen image encoder. Extensive experiments on real-world data demonstrate the advantages of IAA-LQ, beating the best state-of-the-art method by 2.2% and 2.1% in terms of SRCC and PLCC, respectively.

关键词： Aesthetics Assessment Learnable Queries

来源：评论

学校读者我要写书评

暂无评论

LED Strip Quality Detection Based on OpenCV 22

LED Strip Quality Detection Based on OpenCV

引用

22nd International conference on Optical Communications and Networks, ICOCN 2024

作者： Liu, Hao Huang, Qihao Liu, Honglin Lang, Tingting College of Optoelectronic Technology China Jiliang University Hangzhou China Hangzhou Leaper Technology Co. Ltd Hangzhou China

An image processing algorithm for real-time examination of LED light strips is proposed, which enables quick detection of blind LED beads in strips. It is successfully used in production line to replace manual inspect... 详细信息

ISBN: (纸本)9798350367652

关键词： image processing

来源：评论

学校读者我要写书评

暂无评论

Parallel semi-fragile color image watermarking authentication scheme using EXIF metadata

Parallel semi-fragile color image watermarking authenticatio...

引用

real-time processing of image, Depth and video Information 2023

作者： Ortega-Rebollo, Rogelio A. Ponomaryov, Volodymyr I. Reyes-Reyes, Rogelio Cruz-Ramos, Clara Garcia-Salgado, Beatriz P. Instituto Politécnico Nacional ESIME Culhuacán Ciudad de México Mexico

ISBN: (数字)9781510662636

ISBN: (纸本)9781510662629

With the growth of digital data, its protection has become a requirement to dissemination it through telecommunication networks. Nowadays, people can easily generate, edit, and share images with their own electronic devices using applications or software to process them. For this reason, in some cases, it is necessary to prove the authenticity of digital images. This paper proposes a semi-fragile color image watermarking scheme for authentication. The proposed scheme embeds EXIF (EXchangeable image File) metadata of an image as a digital watermark using the LSB method into the Discrete Cosine Transform (DCT) coefficients. EXIF metadata stores relevant information from the image and the digital camera to organize and classify them, such as date, time information, camera settings, and image characteristics. The embedding algorithm is performed by modifying only one mid-frequency coefficient in each eight-by-eight nonoverlapped block of the DCT, which offers a significant advantage in reduced processing time. The experimental results demonstrate a watermark imperceptibility according to the objective quality measures PSNR and SSIM values of the watermarked image (43 dB and 0.99, respectively). Additionally, the EXIF metadata can be extracted with 99% accuracy using a completely blind extraction process;it is performed without the original image, original watermark, original camera, or any other derivative information. The simulation results of the proposed method in parallel implementation (multicore CPU and GPU) have shown effective real-time implementation of image watermarking. © 2023 SPIE.

关键词： Metadata

来源：评论

学校读者我要写书评

暂无评论

Layered Convolutional Neural Networks for Multi-Class image Classification

Layered Convolutional Neural Networks for Multi-Class Image ...

引用

conference on real-time image processing and Deep Learning

作者： Kasinets, Dzmitry Saeed, Amir K. Johnson, Benjamin A. Rodriguez, Benjamin M. Johns Hopkins Univ Whiting Sch Engn 3400 N Charles St Baltimore MD 21218 USA

ISBN: (纸本)9781510673878;9781510673861

In the context of the advancing digital landscape, there is a discernible demand for robust and defensible methodologies in addressing the challenges in multi-class image classification. The evolution of intelligent systems mandates swift evaluations of environmental variables to facilitate decision-making within an authorized workflow. Recognizing the imperative role of ensemble models, this paper undertakes an exploration into the efficacy of layered Convolutional Neural Network (CNN) architectures for the nuanced task of multi-class image classification, specifically applied to traffic signage recognition in the dynamic context of a moving vehicle. The research methodology employs a YOLO (You Only Look Once) model to establish a comprehensive training and testing dataset. Subsequently, a stratified approach is adopted, leveraging layered CNN architectures to categorize clusters of objects and, ultimately, extrapolate the pertinent speed limit values. Our endeavor aims to elucidate the procedural framework for integrating CNN models, providing insights into their accuracy within the application domain.

关键词： convolutional neural networks transforms architectures modeling classification

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：