检索结果-内蒙古大学图书馆

On the Use of Bayesian Networks for real-time Urban Traffic Measurements: a Case Study with Low-Cost Devices

JOURNAL OF SIGNAL processing SYSTEMS FOR SIGNAL image AND video TECHNOLOGY 2022年第3期94卷 293-304页

作者： Domenech-Asensi, Gines Cano, Maria-Dolores Morales-Esteras, Victor Univ Politecn Cartagena Dept Elect Tecnol Comp & Proyectos Cartagena Spain Univ Politecn Cartagena Dept Tecnol Informac & Comunicac Cartagena Spain

This paper describes a low cost computer vision system able to obtain traffic metrics at urban intersections. The proposed system is based on a Bayesian network based reasoning model. It employs the data extracted from background subtraction and contrast analysis techniques applied to predefined regions of interest of the video sequences, to evaluate different traffic metrics. The system has been designed to be able to work with already installed urban cameras, in order to reduce installation costs. So, it can be configured to work with different types of image sizes and video frame rates, as well as to process images taken from different distances and perspectives. The validity of the proposed system has been proved using a Raspberry Pi platform and tested using two real surveillance video cameras managed by the local authority of Cartagena (Spain) during different environmental light conditions. Using this hardware the system is able to process VGA grayscale images at a rate of 8 frames per second.

关键词： Traffic signaling Intelligent traffic lights image processing Intelligent transportation Bayesian networks

来源：评论

学校读者我要写书评

暂无评论

Defect detection with adjustable template for screw hole checking 11

Defect detection with adjustable template for screw hole che...

引用

11th IEEE International conference on Consumer Electronics - Taiwan (ICCE-Taiwan) - Empower of Innovative Consumer Technology

作者： Le, Lam Tuyen Zhang, Bo-Shuo Yu, Jing-Jay Fang, Wen-Pinn YuanZe Univ Dept Informat & Commun Taoyuan Taiwan

ISBN: (纸本)9798350386851;9798350386844

This study initiative focused on improving an exits automatic optical inspection application which uses rule base image processing, object detection, image segmentation and template matching through dynamic database updates. A real time screw defect detection in various light source situations has been proposed to ultimately reducing defect detection costs and time. Difference from CNN which require an amount of training material to achieve specific accuracy rate, this method just need a few of necessary. The effectiveness of the proposed method has been evaluated through many experiments.

关键词： image processing automated optical inspection template matching defect detection screw

来源：评论

学校读者我要写书评

暂无评论

3D human pose estimation with single image and inertial measurement unit (IMU) sequence

引用

PATTERN RECOGNITION 2024年 149卷

作者： Liu, Liujun Yang, Jiewen Lin, Ye Zhang, Peixuan Zhang, Lihua Fudan Univ Acad Engn & Technol Shanghai Peoples R China Changchun Boli Technol CO Ltd Changchun Jilin Peoples R China Minist Educ Engn Res Ctr AI & Robot Shanghai Peoples R China Jilin Prov Key Lab Intelligence Sci & Engn Changchun Jilin Peoples R China TCL Ind Res Lab Shenzhen Guangdong Peoples R China

Three-dimensional human pose estimation plays an important role in the field of computer vision, such as in healthcare, sports, activity recognition, motion capture, and augmented reality. However, monocular image or video based methods are sensitive to occlusions, while multi-view methods usually require enormous computation resources. Currently, inertial measurement unit (IMU)-based methods have begun to overcome the occlusion problem and can potentially achieve real-time inference. Yet, they still suffer from insufficient precision and scale drift error over time. In this paper, we propose a novel, efficient framework to fuse a single image with temporal sequence from IMU sensors to estimate human poses and reconstruct human shapes. Our method achieves 46 mm Mean Per Joint Positional Error (MPJPE) on the Total Capture dataset with 30 frames time segment, and surpasses state-of-the-art pure IMU-based methods. Moreover, in comparison with other vision-based methods, the proposed method shows great advantage in reducing computing floating point operations per second (FLOPS) quota while still achieving competitive estimation precision. Our method achieves 74 FPS on an IPhone 12 for offline processing. In addition, our method can easily be generalized for outdoor cases.

关键词： Artificial intelligence 3D human pose estimation Cross modals fusion Light weight model

来源：评论

学校读者我要写书评

暂无评论

Enhanced video Streaming Based Computing Encoder for Digital Visual processing 2

Enhanced Video Streaming Based Computing Encoder for Digital...

引用

2nd IEEE International conference on Distributed Computing and Electrical Circuits and Electronics, ICDCECE 2023

作者： Kumar, Tarun Shukla, Amita Galgotias University Department of Computer Science and Engineering Greater Noida India Noida Institute of Engineering and Technology Department of Computer Science and Business System Greater Noida India

ISBN: (纸本)9798350347456

The rise of video streaming for digital visual processing has been a boon for the industry of visual processing. video streaming technology has made it easier for companies to capture, analyze, and interpret visual data faster than ever before. It has allowed for the storage and transmission of large amounts of visual data at high speeds, providing businesses with the ability to process and interpret this data in real-time. video streaming technology can be used in a wide variety of applications, including facial recognition, 3D mapping, and object recognition. By streaming video data, companies can quickly and accurately identify individuals, recognize objects, and track movement. This technology can also be used in security applications, such as surveillance and monitoring, as well as in medical imaging, such as MRI and CT scans. video streaming technology has also allowed companies to create more efficient visual processing systems. The streaming video data can be used to automate the process of image recognition and object classification. This has allowed companies to reduce the amount of time and effort needed to interpret visual data. Additionally, streaming video data can be used to create virtual reality experiences, providing users with an immersive experience when viewing digital images. © 2023 IEEE.

关键词： video streaming

来源：评论

学校读者我要写书评

暂无评论

Enhancing video Encoding for Cloud Virtual reality Gaming Based on User Types

Enhancing Video Encoding for Cloud Virtual Reality Gaming Ba...

引用

2023 IEEE International conference on Visual Communications and image processing, VCIP 2023

作者： Song, Kai Sun, Haonan Huo, Junyan Yang, Fuzheng Yang, Kun Chen, Gaoxing Xidian University Xi'an710071 China Alibaba Group Hangzhou311121 China

ISBN: (纸本)9798350359855

Cloud Virtual reality (VR) gaming is a novel technology that allows users to enjoy complex games on their thin clients by offloading the graphics rendering to cloud servers. The thin clients only need to perform basic decoding functions, which reduces the hardware requirements and costs. However, cloud VR gaming also faces the challenge of high bandwidth consumption when transmitting high-resolution game video streams. This paper presents a cloud VR gaming system that can transmit users' gaze point data to the server in real time to identify users' regions of interest. With this system, we verify the difference in spatial visual sensitivity caused by the different types of users. Then, a user-type-based video encoding method is proposed. Through conducting the subjective test experiment, the proposed video encoding method can reduce the bitrate for players and viewers by at least 71% and 69%, respectively, without compromising the perceptual quality. © 2023 IEEE.

关键词： Virtual reality

来源：评论

学校读者我要写书评

暂无评论

Keyframe Insights into real-time video Tagging of Compressed UHD Content 21st

Keyframe Insights into Real-Time Video Tagging of Compressed...

引用

21st International conference on image Analysis and processing (ICIAP)

作者： Ruefenacht, Dominic Mobius Labs GmbH Berlin Germany

ISBN: (纸本)9783031064333;9783031064326

We present a method that can analyze coded ultra-high resolution (UHD) video content an order of magnitude faster than real-time. We observe that the larger the resolution of a video, the larger the fraction of the overall processing time is spent on decoding frames from the video. In this paper, we exploit the way video is coded to significantly speed up the frame decoding process. More precisely, we only decode keyframes, which can be decoded significantly faster than 'random' frames in the video. A key insight is that in modern video codecs, keyframes are often placed around scene changes (shot boundaries), and hence form a very representative subset of frames of the video. We show on the example of video genre tagging that keyframes nicely lend themselves to video analysis tasks. Unlike previous genre prediction methods which include a multitude of signals, we train a per-frame genre classification system using a CNN that solely takes (key-)frames as input. We show that the aggregated genre predictions are very competitive to much more involved methods at predicting the video genre(s), and even outperform state-of-the-art genre tagging that solely rely on video frames as input. The proposed system can reliably tag video genres of a compressed video between 12 x (8K content) and 96x (1080p content) faster than real-time.

关键词： Movie genre tagging real-time

来源：评论

学校读者我要写书评

暂无评论

real-time User-guided Adaptive Colorization with Vision Transformer

Real-Time User-guided Adaptive Colorization with Vision Tran...

引用

IEEE/CVF Winter conference on Applications of Computer Vision (WACV)

作者： Lee, Gwanghan Shin, Saebyeol Na, Taeyoung Woo, Simon S. Sungkyunkwan Univ Dept Artificial Intelligence Seoul South Korea Sungkyunkwan Univ Coll Comp & Informat Seoul South Korea SK Telecom Seoul South Korea

ISBN: (纸本)9798350318920;9798350318937

Recently, the vision transformer (ViT) has achieved remarkable performance in computer vision tasks and has been actively utilized in colorization. Vision transformer uses multi-head self attention to effectively propagate user hints to distant relevant areas in the image. However, despite the success of vision transformers in colorizing the image, heavy underlying ViT architecture and the large computational cost hinder active real-time user interaction for colorization applications. Several research removed redundant image patches to reduce the computational cost of ViT in image classification tasks. However, the existing efficient ViT methods cause severe performance degradation in colorization task since it completely removes the redundant patches. Thus, we propose a novel efficient ViT architecture for real-time interactive colorization, AdaColViT determines which redundant image patches and layers to reduce in the ViT. Unlike existing methods, our novel pruning method alleviates performance drop and flexibly allocates computational resources of input samples, effectively achieving actual acceleration. In addition, we demonstrate through extensive experiments on imageNet-ctest10k, Oxford 102flowers, and CUB-200 datasets that our method outperforms the baseline methods.

关键词： 3D Algorithms Algorithms etc. Generative models for image image recognition and understanding video

来源：评论

学校读者我要写书评

暂无评论

Accelerated Reconstruction of Highly Undersampled 3D Cardiac MRI image Navigators

Accelerated Reconstruction of Highly Undersampled 3D Cardiac...

引用

conference on Medical Imaging - image processing

作者： Guo, Xinrui Sheagren, Calder D. Patel, Jaykumar H. Li, Liwen Wright, Graham A. Guo, Fumin Huazhong Univ Sci & Technol Wuhan Natl Lab Optoelect Wuhan 430074 Peoples R China Univ Toronto Sunnybrook Res Inst Toronto ON Canada Univ Toronto Dept Med Biophys Toronto ON Canada

ISBN: (纸本)9781510671577;9781510671560

Intraprocedural 3D real-time magnetic resonance imaging (MRI) provides a way for accurate and precise radiofrequency catheter targeting during ventricular tachycardia ablation. However, the limited data acquisition time needed to freeze cardiac motion results in highly undersampled k-space data that are challenging to reconstruct. In this work, we evaluated several deep learning (DL) based methods for real-time reconstruction of highly undersampled 3D real-time cardiac MRI. Algorithm reconstruction performance and speed were compared between classical algorithms and DL-based methods. Generative adversarial networks with attention layers in the generator were used to perform reconstructions in the image domain, which strived to balance reconstruction speed and image quality. In addition, variational networks were implemented by iterating data consistency in k-space and enforcing image smoothness via neural network-based regularization. In a preliminary study of heartbeat-resolved highly undersampled 3D cardiac MRI for 11 healthy volunteers, we observed that DL reconstruction methods provided good image quality with a significant increase in computational speed.

关键词： real-time cardiac MRI highly undersampled reconstruction deep learning

来源：评论

学校读者我要写书评

暂无评论

Proceedings of the 22nd International conference on image Analysis and processing, ICIAP 2023

Proceedings of the 22nd International Conference on Image An...

引用

Proceedings of the 22nd International conference on image Analysis and processing, ICIAP 2023

ISBN: (纸本)9783031510229

The proceedings contain 92 papers. The special focus in this conference is on image Analysis and processing. The topics include: An Effective CNN-Based Super Resolution Method for video Coding;medical Transformers for Boosting Automatic Grading of Colon Carcinoma in Histological images;FERMOUTH: Facial Emotion Recognition from the MOUTH Region;consensus Ranking for Efficient Face image Retrieval: A Novel Method for Maximising Precision and Recall;towards Explainable Navigation and Recounting;towards Facial Expression Robustness in Multi-scale Wild Environments;depth Camera Face Recognition by Normalized Fractal Encodings;automatic Generation of Semantic Parts for Face image Synthesis;improved Bilinear Pooling for real-time Pose Event Camera Relocalisation;continual Source-Free Unsupervised Domain Adaptation;End-to-End Asbestos Roof Detection on Orthophotos Using Transformer-Based YOLO Deep Neural Network;OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data;UAV Multi-object Tracking by Combining Two Deep Neural Architectures;GLR: Gradient-Based Learning Rate Scheduler;a Large-scale Analysis of Athletes’ Cumulative Race time in Running Events;uncovering Lies: Deception Detection in a Rolling-Dice Experiment;active Class Selection for Dataset Acquisition in Sign Language Recognition;MC-GTA: A Synthetic Benchmark for Multi-Camera Vehicle Tracking;a Differentiable Entropy Model for Learned image Compression;learning Landmarks Motion from Speech for Speaker-Agnostic 3D Talking Heads Generation;self-Similarity Block for Deep image Denoising;SCENE-pathy: Capturing the Visual Selective Attention of People Towards Scene Elements;not with My Name! Inferring Artists’ Names of Input Strings Employed by Diffusion Models;benchmarking of Blind video Deblurring Methods on Long Exposure and Resource Poor Settings;LieToMe: An LSTM-Based Method for Deception Detection by Hand Movements;spatial Transformer Generative Adversarial Network for image Super

关键词：

来源：评论

学校读者我要写书评

暂无评论

A Research on Fast Acquisition Technology of Spatio-Temporal Data of Railway Infrastructure Based on Measurable real image

A Research on Fast Acquisition Technology of Spatio-Temporal...

引用

2024 International conference Optoelectronic Information and Optical Engineering, OIOE 2024

作者： Xu, Xiaolei Wang, Yaoyao Feng, Boqing Cui, Mengzhen Institute of Computing Technology China Academy of Railway Sciences Corporation Limited China

ISBN: (数字)9781510688209

ISBN: (纸本)9781510688193

The spatiotemporal data of railway infrastructure plays an important role in the development of railway informatization, but existing collection technologies have problems such as low efficiency, high cost, and many limitations. Starting from different business application scenarios in railways, this article first conducts a comprehensive investigation and analysis of the business requirements for spatiotemporal data of railway infrastructure. Then, by studying new surveying and mapping technologies such as GNSS+IMU combined positioning technology, laser point cloud scanning technology, and real scene video acquisition technology, a railway measurable real scene image acquisition device is developed to achieve the integrated collection of device operation trajectory positioning data, point cloud data, and real scene video data. At the same time, real scene image calculation technology is used to obtain measurable real scene image data along the railway line, thereby achieving rapid collection of railway infrastructure spatiotemporal data based on measurable real scene images. Finally, experimental verification was conducted on the circular railway line of the National Railway Test Center, successfully collecting and obtaining the spatial and mileage coordinates of various professional infrastructure along the circular railway, as well as accurately measuring the geometric dimension information of various facilities and structures. © 2025 SPIE.

关键词： Mapping

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：