检索结果-内蒙古大学图书馆

20th IEEE International conference on Advanced video and Signal-Based Surveillance (AVSS)

作者： Challagundla, Yagnesh Parvatham, Shreeya Dheera Mohanty, Sachi Nandan Ravindra, J. V. R. VIT AP Univ Sch Comp Sci & Engn SCOPE Amaravati India Vardhaman Coll Engn Dept Elect & Commun Engn Hyderabad Telangana India

ISBN: (纸本)9798350374292;9798350374285

In today's security-conscious environment, the need for effective real-time weapon detection systems is paramount, especially in public spaces and sensitive areas. The primary objective of this research is to create a real-time system for detecting weapons utilizing advanced deep learning techniques, namely VGG16 and Faster RCNN. The system's objective is to precisely detect and categorize weapons from photos and video data, offering prompt notifications and improving security procedures. The project's backdrop explores the difficulties encountered, including data gathering, precision standards, immediate processing, confidentiality, and deployment considerations. The system achieves excellent accuracy and real-time detection capabilities by building a comprehensive dataset manually and training the models using GPUs and modern technologies. Using Python as the programming language provides flexibility and simplicity in development, making use of Python's modules such as OpenCV for image processing and Keras for deep learning models. The Tkinter framework enables the creation of a graphical user interface (GUI) that supports various user operations, such as uploading datasets, generating models, processing images and videos, detecting weapons, and visualizing results. The methodology employs a systematic approach, encompassing stages such as data preprocessing, model building, training, testing, and result analysis. The combination of VGG16 and Faster RCNN algorithms demonstrates a compromise between speed and accuracy, with Faster RCNN exhibiting greater performance in real-time detection. The project aims to achieve several objectives, including the compilation of a dataset, training a model, evaluating accuracy, implementing real-time detection, and establishing alerting methods. The system's range encompasses a wide range of surroundings, lighting situations, and orientations, allowing it to be flexible and suitable for a variety of security applications. To sum

关键词： real-time weapon detection Deep learning algorithms VGG16 Faster RCNN Speed vs accuracy tradeoff real-time detection Performance analysis

来源：评论

学校读者我要写书评

暂无评论

Microscopic augmented reality calibration with contactless line-structured light registration for surgical navigation

引用

MEDICAL & BIOLOGICAL ENGINEERING & COMPUTING 2025年第5期63卷 1463-1479页

作者： Li, Yuhua Jiang, Shan Yang, Zhiyong Yang, Shuo Zhou, Zeyang Tianjin Univ Mech Engn Dept 135 Yaguan RdHaihe Educ Pk Tianjin 300350 Peoples R China

The use of AR technology in image-guided neurosurgery enables visualization of lesions that are concealed deep within the brain. Accurate AR registration is required to precisely match virtual lesions with anatomical structures displayed under a microscope. The purpose of this work was to develop a real-time augmented surgical navigation system using contactless line-structured light registration, microscope calibration, and visible optical tracking. Contactless discrete sparse line-structured light point cloud is utilized to construct patient-image registration. Microscope calibration optimization with dimensional invariant calibrator is employed to enable real-time tracking of the microscope. The visible optical tracking integrates a 3D medical model with surgical microscope video in real time, generating an augmented microscope stream. The proposed patient-image registration algorithm yielded an average root mean square error (RMSE) of 0.78 +/- 0.14 mm. The pixel match ratio error (PMRE) of the microscope calibration was found to be 0.646%. The RMSE and PMRE of the system experiments are 0.79 +/- 0.10 mm and 3.30 +/- 1.08%, respectively. Experimental evaluations confirmed the feasibility and efficiency of microscope AR surgical navigation (MASN) registration. By means of registration technology, MASN overlays virtual lesions onto the microscopic view of the real lesions in real time, which can help surgeons to localize lesions hidden deep in tissue.

关键词： Augmented reality Microscope calibration Surgical navigation system Point cloud processing video stream fusion

来源：评论

学校读者我要写书评

暂无评论

UNSUPERVISED COORDINATE-BASED video DENOISING 31

UNSUPERVISED COORDINATE-BASED VIDEO DENOISING

引用

2024 International conference on image processing

作者： Aiyetigbo, Mary Ravichandran, Dineshchandar Chalhoub, Reda Kalivas, Peter Luo, Feng Li, Nianyi Clemson Univ Sch Comp Clemson SC 29631 USA Med Univ South Carolina Dept Neurosci Charleston SC USA

ISBN: (纸本)9798350349405;9798350349399

In this paper, we introduce a novel unsupervised video denoising deep learning approach that can help to mitigate data scarcity issues and show robustness against different noise patterns, enhancing its broad applicability. Our method comprises three modules: a Feature generator creating feature maps, a Denoise-Net generating denoised but slightly blurry reference frames, and a Refine-Net re-introducing high-frequency details. By leveraging the coordinate-based network, we can greatly simplify the network structure while preserving high-frequency details in the denoised video frames. Extensive experiments on both simulated and real-captured videos demonstrate that our method can effectively denoise real-world calcium imaging video sequences without prior knowledge of noise models and data augmentation during training.

关键词： video denoising unsupervised implicit neural representation

来源：评论

学校读者我要写书评

暂无评论

Enhanced YOLOv8 framework for precision vehicle detection in high-resolution remote sensing images

引用

SIGNAL image AND video processing 2025年第3期19卷 1-10页

作者： Shao, Zhaowei He, Kunyu Yuan, Baohua Xu, Sheng Nanjing Forestry Univ Coll Informat Sci & Technol & Artificial Intellige Nanjing Jiangsu Peoples R China Changzhou Univ Coll Informat Sci & Engn Changzhou Jiangsu Peoples R China

Vehicle detection in high-resolution remote sensing imagery faces challenges such as varying scales, complex backgrounds, and high intra-class variability. We propose an enhanced YOLOv8 framework, incorporating three key advancements: the Adaptive Feature Pyramid Network (AFPN), Omni-Dimensional Convolution (ODConv), and a Slim Neck with Generalized Shuffle Convolution (GSConv). These enhancements improve vehicle detection accuracy, computational efficiency, and visual AI capabilities for applications such as computer animation and virtual worlds. Our model achieves a Mean Average Precision (mAP) of 0.7153, representing a 4.99% improvement over the baseline YOLOv8. Precision and recall increase to 0.9233 and 0.9329, respectively, while box loss is reduced from 1.213 to 1.054. This framework supports real-time surveillance, traffic monitoring, and urban planning. The NEPU-OWOD V2.0 dataset, used for evaluation, includes high-resolution images from multiple regions and seasons, along with diverse annotations and augmentations. Our modular approach allows for separate assessments of each enhancement. The dataset and source code are available for future research and development at (https://***/10.5281/zenodo.13075939).

关键词： Object detection High-resolution remote sensing imagery Multi-scale feature representation real-time processing

来源：评论

学校读者我要写书评

暂无评论

Adaptive real-time video Transmission through Underwater Acoustic Communication Link

Adaptive Real-Time Video Transmission through Underwater Aco...

引用

OCEANS conference

作者： Wu, Jie Fu, Yanbing Li, Jingxuan Qu, Fengzhong Wei, Yan Zhejiang Univ Key Lab Ocean Observat Imaging Testbed Zhejiang P Zhoushan 31600 Peoples R China Minist Educ Engn Res Ctr Ocean Sensing Technol & Equipment Zhoushan 31600 Peoples R China Zhejiang Univ Hainan Inst Sanya 572025 Peoples R China

ISBN: (纸本)9798350362077

Efficient video transmission enables a wide range of applications in underwater environments, such as seabed survey, subsea equipment maintenance, oil pipe/bridge inspection, and marine life sample collection. At present, it is a common belief that real-time underwater video transmission through underwater acoustic communication is challenging due to the influence of complex underwater environments and the limitation of underwater acoustic communication. In this paper, we propose an adaptive real-time underwater video transmission system using underwater communication. The system consists of three modules, i.e, video pre-processing module, video transmission module and video post-processing module. In the first two modules, the sender adaptively adjusts the compression bitrate and transmission rate according to the video quality and channel conditions. In the third module, the deep learning-based video reconstruction algorithm for underwater image information recovery is exploited. The efficacy of this system is verified by real underwater videos collected in several sea fields. The results prove the proposed system is able to transmit video successfully and efficiently in the underwater environment.

关键词： underwater video transmission underwater acoustic communication video super-resolution reconstruction video compressing

来源：评论

学校读者我要写书评

暂无评论

real-time Multi-Object Detection Using Enhanced Yolov5-7S on Multi-GPU for High-Resolution video

引用

INTERNATIONAL JOURNAL OF image AND GRAPHICS 2024年第2期24卷 2450019-2450019页

作者： Shaikh, Shakil A. Chopade, Jayant J. Sardey, Mohini Pramod Matoshri Coll Engn & Res Ctr Dept Elect & Telecommun Nasik India Savitribai Phule Pune Univ Pune Maharashtra India AISSMS IOIT Dept Elect & Telecommun Pune India

Multiple objects tracking in a video sequence can be performed by detecting and distinguishing the objects that appear in the sequence. In the context of computer vision, the robust multi-object tracking problem is a difficult problem to solve. Visual tracking of multiple objects is a vital part of an autonomous driving vehicle's vision technology. Wide-area video surveillance is increasingly using advanced imaging devices with increased megapixel resolution and increased frame rates. As a result, there is a huge increase in demand for high-performance computation system of video surveillance systems for real-time processing of high-resolution videos. As a result, in this paper, we used a single stage framework to solve the MOT problem. We proposed a novel architecture in this paper that allows for the efficient use of one and multiple GPUs are used to process Full High Definition video in real time. For high-resolution video and images, the suggested approach is real-time multi-object detection based on Enhanced Yolov5-7S on Multi-GPU Vertex. We added one more layer at the top in backbone to increase the resolution of feature extracted image to detect small object and increase the accuracy of model. In terms of speed and accuracy, our proposed approach outperforms the state-of-the-art techniques.

关键词： Multi-object tracking YOLOv5 GPU_NVIDIA object detection deep learning computer vision

来源：评论

学校读者我要写书评

暂无评论

MULTIPLE DESCRIPTION video CODING FOR real-time APPLICATIONS USING HEVC 30

MULTIPLE DESCRIPTION VIDEO CODING FOR REAL-TIME APPLICATIONS...

引用

30th IEEE International conference on image processing (ICIP)

作者： Trung Hieu Le Antonini, Marc Lambert, Marc Alioua, Karima Cote dAzur Univ I3S Lab Sophia Antipolis France CNRS UMR 7271 Sophia Antipolis France LEXTAN SAS Gemenos France

ISBN: (纸本)9781728198354

Remote control vehicles require the transmission of large amounts of data, and video is one of the most important sources for the driver. To ensure reliable video transmission, the encoded video stream is transmitted simultaneously over multiple channels. However, this solution incurs a high transmission cost. To address this issue, it is necessary to use more efficient video encoding methods that can make the video stream robust to noise. Moreover it should have a less complexity to adapt to the real time requirement. In this paper, we propose a low-complexity, low-latency 2-channel Multiple Description Coding (MDC) solution with an adaptive Instantaneous Decoder Refresh (IDR) frame period, which is compatible with the HEVC standard with adaptive redundancy adjustment. This method shows a better resistance to high packet loss rates with lower complexity.

关键词： Multiple Description coding HEVC Error Correction

来源：评论

学校读者我要写书评

暂无评论

Robust edge-preserving image smoothing based on complementary weighting scheme

引用

SIGNAL image AND video processing 2024年第8-9期18卷 5663-5675页

作者： Yang, Yang Xia, Minghui Wang, Xinyu Zeng, Lanling Zhan, Yongzhao Jiangsu Univ Dept Comp Sci Xuefu Rd 301 Zhenjiang 212013 Jiangsu Peoples R China

Edge-aware image smoothing refers to the removal of details with edges preserved. It is an essential topic in the field of image processing and computer graphics. In this paper, in order to achieve better edge preservation than the existing models, we propose a robust edge-preserving image filtering method based on a complementary weighting scheme. Both isotropic and anisotropic weights are involved in our model to adapt the fidelity and the regularization terms. To efficiently solve the proposed model, we introduce an effective algorithm based on additive half quadratic minimization, alternating direction of multipliers, and Fourier domain optimization strategies. We experimentally validate the proposed filter on several low-level vision tasks. Both quantitative and qualitative experimental results show significant superiority of our proposed filter compared to existing techniques. Furthermore, the filter exhibits high efficiency and is able to process 720P color images (over 10 fps) in real-time on an NVIDIA RTX 3070. Therefore, it is practical for real-world applications.

关键词： Edge-preserving Complementary weighting image processing Computer graphics

来源：评论

学校读者我要写书评

暂无评论

video conference System in Mixed reality Using a Hololens

引用

Computer Modeling in Engineering & Sciences 2023年第1期134卷 383-403页

作者： Baolin Sun Xuesong Gao Weiqiang Chen Qihao Sun Xiaoxiao Cui HaoGuo Cishahayo Remesha Kevin Shuaishuai Liu Zhi Liu School of Information Science and Engineering Shandong UniversityQingdao266000China State Key Laboratory of Digital Multi-Media Technology Hisense Co.Ltd.Qingdao266000China

The mixed reality conference system proposed in this paper is a robust,real-time video conference application software that makes up for the simple interaction and lack of immersion and realism of traditional video conference,which realizes the entire process of holographic video conference from client to cloud to the *** paper mainly focuses on designing and implementing a video conference system based on AI segmentation technology and mixed *** mixed reality conference system components are discussed,including data collection,data transmission,processing,and mixed reality *** data layer is mainly used for data collection,integration,and video and audio *** network layer uses Web-RTC to realize peer-to-peer data *** data processing layer is the core part of the system,mainly for human video matting and human-computer interaction,which is the key to realizing mixed reality conferences and improving the interactive *** presentation layer explicitly includes the login interface of the mixed reality conference system,the presentation of real-time matting of human subjects,and the presentation *** the mixed reality conference system,conference participants in different places can see each other in real-time in their mixed reality scene and share presentation content and 3D models based on mixed reality technology to have a more interactive and immersive experience.

关键词： Mixed reality AI segmentation hologram video conference Web-RTC

来源：评论

学校读者我要写书评

暂无评论

An innovative traffic flow detection model based on temporal video frame analysis and grayscale aggregation quantification

引用

IET image processing 2024年第14期18卷 4704-4715页

作者： Liu, Xin Meng, Qiao Li, Xin Wang, Zhijie Kong, Siyuan Li, Bingyu Qinghai Univ Sch Comp Technol & Applicat Xining Qinghai Peoples R China Qinghai Univ Intelligent Comp & Applicat Lab Qinghai Prov Xining Qinghai Peoples R China

Current traffic status detection methods heavily rely on historical traffic flow data and vehicle counts. However, these methods often fail to meet the stringent real-time requirements of state detection, especially on edge devices with limited computing *** address these challenges, this study develops a traffic alert model using temporal video frame analysis and grayscale aggregation quantization techniques. Initially, the model uses distance mapping between pixel features and frames of road traffic videos to construct a comprehensive road environment and vehicle segmentation model. The model also establishes a mapping between pixel equidistant lines and actual distances, enabling precise congestion detection. This approach significantly reduces costs associated with traditional traffic detection methods as it does not rely on historical data. Performance evaluation using fixed-point road monitoring data indicates that the proposed model outperforms traditional traffic state detection models, with a performance improvement of approximately 4.7% to 9.5%. Additionally, the model improves computing resource efficiency by approximately 72.5% and demonstrates substantial real-time detection capabilities.

关键词： alarm systems C plus plus language image processing road traffic road safety road vehicles

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：