检索结果-内蒙古大学图书馆

3rd International conference on image processing and Media Computing, ICIPMC 2024

ISBN: (纸本)9798350386660

The proceedings contain 58 papers. The topics discussed include: real-time heart rate detection based on body surface video data;a local dimming algorithm based on deep learning;multilevel interaction embedding for hyperspectral image super-resolution;a review of point target and extended target tracking algorithms;phase retrieval algorithm based on transport of intensity equation under the fusion of regularization and grating modulation;exploring data augmentation effects on a singular illumination distribution dataset with ColorJitter;unsupervised domain adaptation for cross-modality cardiac image segmentation based on contrastive image synthesis;a novel color image encryption scheme based on fractional-order chaotic system;and contextual transformer based small targets detection for cervical cell.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Anonymizing Big Data Streams Using In-memory processing: A Novel Model Based on One-time Clustering

引用

JOURNAL OF SIGNAL processing SYSTEMS FOR SIGNAL image AND video TECHNOLOGY 2024年第6-7期96卷 333-356页

作者： Shamsinejad, Elham Banirostam, Touraj Pedram, Mir Mohsen Rahmani, Amir Masoud Islamic Azad Univ Dept Comp Engn Cent Tehran Branch Tehran Iran Kharazmi Univ Fac Engn Dept Elect & Comp Engn Tehran Iran Islamic Azad Univ Dept Comp Engn Sci & Res Branch Tehran Iran

Big data privacy preservation is a critical challenge for data mining and data analysis. Existing methods for anonymizing big data streams using k-anonymity algorithms may cause high data loss, low data quality, and identity disclosure. In this paper, we propose a novel model for anonymizing big data streams using in-memory processing. The model uses a Spark framework to parallelize the anonymization process and a one-time clustering algorithm to avoid multiple iterations and allocate the data to optimal clusters. We evaluate the performance and effectiveness of the model using a real-world dataset and compare it with three popular k-anonymity algorithms: CRUE, Mean-Shift, and DBSCAN. The results show that the model has the lowest data loss and the highest data quality for different data sizes and k-values. The model is scalable, robust, adaptable, and flexible. The model can provide better data for data mining and data analysis while protecting data privacy and preventing data disclosure.

关键词： Big data Anonymity In-memory processing One-time clustering Data loss

来源：评论

学校读者我要写书评

暂无评论

VATE: Edge-Cloud System for Object Detection in real-time video Streams 8

VATE: Edge-Cloud System for Object Detection in Real-Time Vi...

引用

8th IEEE International conference on Fog and Edge Computing (ICFEC)

作者： Maresch, Maximilian Nastic, Stefan TU Wien Distributed Syst Grp Vienna Austria

ISBN: (纸本)9798350361360;9798350361353

In the realm of edge intelligence, emerging video analytics applications are often based on resource constrained edge devices. These applications need systems which are able to provide both low-latency and high-accuracy video stream processing, such as for object detection in real-time video streams. State-of-the-art systems tackle this challenge by leveraging edge computing and cloud computing. Such edge-cloud approaches typically combine low-latency results from the edge and high accuracy results from the cloud when processing a frame of the video stream. However, the accuracy achieved so far leaves much room for improvement. Furthermore, using more accurate object detection often requires having more capable hardware. This limits the edge devices which can be used. Applications related to autonomous drones, with the drone being the edge device, give one example. A wide variety of objects needs to be detected reliably for drones to operate safely. Drones with more computing capabilities are often more expensive and suffer from short battery life, as they consume more energy. In this paper, we introduce VATE, a novel edge-cloud system for object detection in real-time video streams. An enhanced approach for edgecloud fusion is presented, leading to improved object detection accuracy. A novel multi-object tracker is introduced, allowing VATE to run on less capable edge devices. The architecture of VATE enables it to be used when edge devices are capable of running on-device object detection frequently and when edge devices need to minimise on-device object detection to preserve battery life. Its performance is evaluated on a challenging, dronebased video dataset. The experimental results show that VATE improves accuracy by up to 27.5% compared to the state-of-theart system, while running on less capable and cheaper hardware.

关键词： video Analytics Edge Intelligence Edge Computing Edge-Cloud Systems Object Detection Object Tracking

来源：评论

学校读者我要写书评

暂无评论

Fast Software-Based real time Panoramic image processing 18

Fast Software-Based Real Time Panoramic Image Processing

引用

18th International conference on Ubiquitous Information Management and Communication, IMCOM 2024

作者： Gerlits, Matthew Moh, Melody Moh, Teng-Sheng San Jose State University Department of Computer Science San JoseCA United States

ISBN: (纸本)9798350331011

Panoramic or stitched image processing has wide applications in areas such as medical imaging, topographical mapping, and deep space exploration. Rapid development of high-speed communication and artificial intelligence technologies have enabled real-time panoramic image processing, essential to autonomous driving, robotics, drones, etc., and critical in the advancement of smart cities, smart hospitals, manufacturer automation, and intelligent military warfare. image stitching algorithms are able to join sets of images together and provide a wider field of a vision when compared with an image from a single standard camera. Traditional techniques are able to adequately produce a stitch for a static set of images, but suffer when differing lighting conditions exist between the two images, and from processing times too slow for real time use cases. We propose a solution which resolves these two issues encountered by traditional techniques. First, two advanced blending schemes, including the superpixel approach, have been implemented to resolve the lighting difference. Second, we develop a fast validation scheme to rapidly detect invalid solutions, and package all the system components in a parallel processing architecture to ensure a 0.008 second frame processing time, sufficiently small for most real-time applications. The proposed solution, including its fault-detection system, is implemented in the software level and the need for specialized hardware is completely eliminated. To the best of our knowledge, this is the first software-based solution fast enough for real-time panoramic image processing;it would contribute significantly to the advancement of real-time image processing, especially for its wide applications in the modern smart society. © 2024 IEEE.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

Miruoto : Sports event atmosphere visual rendering through real-time image and sound processing system

Miruoto : Sports event atmosphere visual rendering through r...

引用

SIGGRAPH conference on Emerging Technologies

作者： Gourmelen, Guillaume Toriya, Shutaro Miya, Eiko Shioura, Naohisa Iwata, Hiroyasu Waseda Univ Inst Mech Engn Frontiers Tokyo Japan Waseda Univ Grad Sch Creat Sci & Engn Tokyo Japan AISIN Corp Tokyo Japan Waseda Univ Fac Sci & Engn Tokyo Japan

ISBN: (纸本)9798400705243

Did you already imagine how would it be to watch a sport match without sounds? You would miss all this specific sport related sounds but also mostly miss a big part of the atmosphere present in the stadium, that is particular to live events. This is what happens to most Deaf and Hard of Hearing persons. Towards Tokyo 2025 Deaflympics, we developed an AI-based system able to recognize sounds and players motion to render in real time sound related Onomatopoeia over the match video as one could see in Comics or Manga.

关键词： Audition

来源：评论

学校读者我要写书评

暂无评论

OMRA: ONLINE MOTION RESOLUTION ADAPTATION TO REMEDY DOMAIN SHIFT IN LEARNED HIERARCHICAL B-FRAME CODING 31

OMRA: ONLINE MOTION RESOLUTION ADAPTATION TO REMEDY DOMAIN S...

引用

2024 International conference on image processing

作者： Gao, Zong-Lin Sang NguyenQuang Peng, Wen-Hsiao Xiem HoangVan Natl Yang Ming Chiao Tung Univ Comp Sci Dept Hsinchu Taiwan VNU Univ Engn & Technol Elect & Telecommun Hanoi Vietnam

ISBN: (纸本)9798350349405;9798350349399

Learned hierarchical B-frame coding aims to leverage bidirectional reference frames for better coding efficiency. However, the domain shift between training and test scenarios due to dataset limitations poses a challenge. This issue arises from training the codec with small groups of pictures (GOP) but testing it on large GOPs. Specifically, the motion estimation network, when trained on small GOPs, is unable to handle large motion at test time, incurring a negative impact on compression performance. To mitigate the domain shift, we present an online motion resolution adaptation (OMRA) method. It adapts the spatial resolution of video frames on a per-frame basis to suit the capability of the motion estimation network in a pre-trained B-frame codec. Our OMRA is an online, inference technique. It need not re-train the codec and is readily applicable to existing B-frame codecs that adopt hierarchical bi-directional prediction. Experimental results show that OMRA significantly enhances the compression performance of two state-of-the-art learned B-frame codecs on commonly used datasets.

关键词： Learned video Coding B-frame Coding and Domain Shift

来源：评论

学校读者我要写书评

暂无评论

Plume motion characterization in UAV aerial video and imagery

Plume motion characterization in UAV aerial video and imager...

引用

conference on real-time image processing and Deep Learning

作者： Mehrubeoglu, Mehrube Cammarata, Kirk Zhang, Hua McLauchlan, Lifford Texas A&M Univ Corpus Christi Dept Engn 6300 Ocean Dr Corpus Christi TX 78412 USA Texas A&M Univ Corpus Christi Dept Life Sci 6300 Ocean Dr Corpus Christi TX 78412 USA Texas A&M Univ Kingsville Dept Elect & Comp Engn MS 192 Kingsville TX 78363 USA

ISBN: (数字)9781510661714

ISBN: (纸本)9781510661707;9781510661714

Sediment plumes are generated from both natural and human activities in benthic environments, increasing the turbidity of the water and reducing the amount of sunlight reaching the benthic vegetation. Seagrasses, which are photosynthetic bioindicators of their environment, are threatened by chronic reductions in sunlight, impacting entire aquatic food chains. This research uses UAV aerial video and imagery to investigate the characteristics of sediment plumes generated by a model of anthropogenic disturbance. The extent, speed and motion of the plumes were assessed as these parameters may pertain to the potential impacts of plume turbidity on seagrass communities. In a case study using UAV video, the turbidity plume was observed to spread over 250 feet over 20 minutes of the UAV campaign. The directional speed of the plume was estimated to be between 10.4 and 10.6 ft/min. This was corroborated by observation of greatest plume turbidity and sediment load near the location of disturbance and diminishing with distance. Further temporal studies are necessary to determine long-term, if any, impacts of human activity-generated sediment plumes on seagrass beds.

关键词： sediment plume UAV imagery UAV video semantic segmentation semantic mapping plume distance plume speed plume spread image processing

来源：评论

学校读者我要写书评

暂无评论

Design and Implementation of video Surveillance Quality Inspection System

Design and Implementation of Video Surveillance Quality Insp...

引用

2023 International conference on Machine Vision, image processing and Imaging Technology, MVIPIT 2023

作者： Liu, Zhen Sun, He Zhou, Longxiang Li, Qingyu Satellite Maritime Tracking Control Department Jiangyin China

ISBN: (纸本)9798350306545

This paper presents the design and implementation of a camera surveillance picture quality inspection system. The system assesses the video stream from surveillance cameras and provides immediate feedback on image quality. Development encompasses Java multi-threading, database utilization and modular design dependent on requirements. The system's architecture and process analysis are completed, enabling the system function to be realized. The results of the application indicate that the system can promptly and precisely evaluate the real-time image quality of the video stream from surveillance equipment, precisely tally equipment asset usage information, enhance the management efficiency and inspection quality of operations and maintenance staff, and guarantee the surveillance screen's effectiveness. © 2023 IEEE.

关键词： image quality

来源：评论

学校读者我要写书评

暂无评论

FaceEngine: A Tracking-Based Framework for real-time Face Recognition in video Surveillance System

引用

SN Computer Science 2024年第5期5卷 609页

作者： Imran, Ahsan Ahmed, Riad Hasan, Md Mehedi Ahmed, M. Helal Uddin Azad, A.K.M. Alyami, Salem A. Department of Robotics and Mechatronics Engineering University of Dhaka Dhaka 1000 Bangladesh Department of Computer Science and Engineering BRAC University Dhaka 1212 Bangladesh Department of Management Information Systems University of Dhaka Dhaka 1000 Bangladesh Department of Mathematics and Statistics College of Science Imam Mohammad Ibn Saud Islamic University (IMSIU) Riyadh 11432 Saudi Arabia

The escalating concern over worldwide security and criminal activities has led to the emergence and significance of closed-circuit television video surveillance systems as an essential tool for diverse security purposes. These systems are extensively implemented and serve a crucial function in the surveillance and upkeep of security. The predominant purpose of video surveillance systems is to gather data primarily for evidentiary purposes subsequent to the occurrence of a criminal incident. The demand for video surveillance systems capable of autonomously monitoring and promptly identifying criminals or intruders in real-time is steadily increasing. Nevertheless, the existing facial recognition methods pose difficulties in reliably identifying individuals who are in motion within a video frame. Moreover, conventional approaches necessitate a substantial quantity of photographs in order to achieve precise recognition following the acquisition of an individual’s facial pattern. In order to tackle these concerns, we developed the implementation of input optimisation algorithms alongside a novel framework for real-time face recognition in the context of video surveillance. The input optimization algorithms, integrated with adaptive thresholding techniques, effectively reduce the need for manual outlier removal by actively identifying outliers for each specific case. The application of this optimisation strategy has demonstrated a substantial enhancement in both the efficiency and precision of our system in comparison to alternative baseline methodologies. Through the use of a reduced set of input image, our system is capable of attaining a heightened degree of improvements. Specifically, employing tracking and temporal voting techniques enables our system to accomplish a real-time face recognition accuracy of 90.91%. The findings of this study suggest that our approach has the potential to be a valuable tool in various applications that necessitate rapid and precise fac

关键词： Computer vision Deep learning Face recognition image processing Tracking video surveillance

来源：评论

学校读者我要写书评

暂无评论

Will Transformers change gastrointestinal endoscopic image analysis? A comparative analysis between CNNs and Transformers, in terms of performance, robustness and generalization

引用

MEDICAL image ANALYSIS 2025年 99卷 103348页

作者： Kusters, Carolus H. J. Jaspers, Tim J. M. Boers, Tim G. W. Jong, Martijn R. Jukema, Jelmer B. Fockens, Kiki N. de Groof, Albert J. Bergman, Jacques J. van der Sommen, Fons De With, Peter H. N. Eindhoven Univ Technol Dept Elect Engn Video Coding & Architectures Eindhoven Netherlands Univ Amsterdam Dept Gastroenterol & Hepatol Amsterdam UMC Amsterdam Netherlands

Gastrointestinal endoscopic image analysis presents significant challenges, such as considerable variations in quality due to the challenging in-body imaging environment, the often-subtle nature of abnormalities with low interobserver agreement, and the need for real-time processing. These challenges pose strong requirements on the performance, generalization, robustness and complexity of deep learning-based techniques in such safety-critical applications. While Convolutional Neural Networks (CNNs) have been the go-to architecture for endoscopic image analysis, recent successes of the Transformer architecture in computer vision raise the possibility to update this conclusion. To this end, we evaluate and compare clinically relevant performance, generalization and robustness of state-of-the-art CNNs and Transformers for neoplasia detection in Barrett's esophagus. We have trained and validated several top-performing CNNs and Transformers on a total of 10,208 images (2,079 patients), and tested on a total of 7,118 images (998 patients) across multiple test sets, including a high-quality test set, two internal and two external generalization test sets, and a robustness test set. Furthermore, to expand the scope of the study, we have conducted the performance and robustness comparisons for colonic polyp segmentation (Kvasir-SEG) and angiodysplasia detection (Giana). The results obtained for featured models across a wide range of training set sizes demonstrate that Transformers achieve comparable performance as CNNs on various applications, show comparable or slightly improved generalization capabilities and offer equally strong resilience and robustness against common image corruptions and perturbations. These findings confirm the viability of the Transformer architecture, particularly suited to the dynamic nature of endoscopic video analysis, characterized by fluctuating image quality, appearance and equipment configurations in transition from hospital to hospital. The

关键词： Endoscopic image analysis Convolutional neural networks Transformers Robustness Generalization

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：