检索结果-内蒙古大学图书馆

7th IEEE International conference on Multimedia Information processing and Retrieval, MIPR 2024

ISBN: (纸本)9798350351422

The proceedings contain 106 papers. The topics discussed include: macro-AUC-driven active learning strategy for multi-label classification enhancement;mitigating privacy threats without degrading visual quality of VR applications: using re-identification attack as a case study;attenuation-aware weighted optical flow with medium transmission map for learning-based visual odometry in underwater terrain;GeoVQA: a comprehensive multimodal geometry dataset for secondary education;pulse of the crowd: quantifying crowd energy through audio and video analysis;automated recognition of optic disc and blood vessels in diabetic fundoscopy images using real-time image analysis;GeoSecure-B: a method for secure bearing calculation;and exploiting correlation between facial action units for detecting deepfake videos.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Bit Plane Segmentation and LBP-Based Coverless video Steganography for Secure Data Transmission 8th

Bit Plane Segmentation and LBP-Based Coverless Video Stegano...

引用

8th International conference on Computer Vision and image processing (CVIP)

作者： Debnath, Sourabh Mohapatra, Ramesh Kumar Kulkarni, Tejas Shirish Natl Inst Technol Rourkela Rourkela 769008 Odisha India

ISBN: (纸本)9783031581809;9783031581816

Traditional information hiding techniques alter carriers to embed secret information, which steganalysis algorithms can find. In the area of covert communication, coverless information concealment has been suggested as a way of preventing steganalysis. A new methodology for sharing secret data through coverless video steganography using Local Binary Pattern (LBP) before bit-plane segmentation has been proposed. In this technique, a single frame is converted into multiple bit-planes, and various hash sequences are generated from these bit-planes. After extracting frames from the video, each frame is converted into grayscale using LBP and then split into multiple bit-planes using Bit-plane Complexity Segmentation (BPCS). The hash sequences are obtained by calculating the average median values of corresponding bit-plane sub-blocks. A retrieval database is created to relate the obtained hash sequences with the bit-plane features. And for the first time security analysis is done in this proposed technique. The experimental results demonstrate that this approach achieves better robustness against various attacks, has a larger capacity, requires less time to extract hash sequences, and has a higher success rate of concealing information than existing coverless video steganography techniques.

关键词： Steganography Bit-plane Complexity Segmentation (BPCS) Local Binary Pattern (LBP) Coverless video Steganography Cover Object

来源：评论

学校读者我要写书评

暂无评论

Enhancing Privacy-Utility Tradeoff with Few-Round Strategy in Heterogeneous Federated Learning

Enhancing Privacy-Utility Tradeoff with Few-Round Strategy i...

引用

2024 conference on Visual Communications and image processing

作者： Wei, Qingbin Zhang, Feilong Bai, Yuanchao Zhai, Deming Jiang, Junjun Liu, Xianming Harbin Inst Technol Fac Comp Harbin Peoples R China

ISBN: (纸本)9798331529543;9798331529550

Federated learning inherently provides a certain level of privacy protection, which however is often inadequate in many real-world scenarios. Existing privacy-preserving methods frequently incur unbearable time overheads or result in non-negligible deterioration to model performance, thus suffering from the tradeoff between performance and privacy. In this work, we propose a novel Federated Privacy-Preserving Knowledge Transfer framework, namely FedPPKT, which employs data-free knowledge distillation in a meta-learning manner to rapidly generates pseudo data and performs privacy-preserving knowledge transfer. FedPPKT establishes a protective barrier between the original private data and the federated model, thereby ensuring user privacy. Furthermore, leveraging the few-round strategy of FedPPKT, it has the capability to reduce the number of communication rounds, further mitigating the risk of privacy exposure for user data. With the help of the meta generator, the problem of uneven local label distribution on clients is alleviated, mitigating data heterogeneity and improving model performance. Experiments show that FedPPKT outperforms the state-of-the-art privacy-preserving federated learning methods. Our code is publicly available at https://***/HIT-weiqb/FedPPKT.

关键词： Generative image Compression Learned image Compression

来源：评论

学校读者我要写书评

暂无评论

2024 13th Mediterranean conference on Embedded Computing, MECO 2024

2024 13th Mediterranean Conference on Embedded Computing, ME...

引用

13th Mediterranean conference on Embedded Computing, MECO 2024

ISBN: (纸本)9798350387568

The proceedings contain 114 papers. The topics discussed include: application of artificial neural networks for processing some biomedical data;distinguishing between AI images and real images with hybrid image classification methods;securing Durres Port's digital transformation: cybersecurity strategy for maritime industry;linguistic encryption for underwater communication;a toolset for blood pressure visualization and measurement in time, frequency and time-frequency domains;using a shape from polarization to determine the 3D surface of objects with thermal radiation;on the influence of cell libraries and other parameters to SCA resistance of crypto IP cores;integration of PXROS-HR with micro-ROS in robotic systems;and traffic-aware video streaming topology reconfiguration for smart city applications.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Human stability assessment and fall detection based on dynamic descriptors

引用

IET image processing 2023年第11期17卷 3177-3195页

作者： Gutierrez, Jesus Martin, Sergio Rodriguez, Victor Univ Nacl Educ Distancia UNED Elect & Comp Engn Dept Juan Del Rosal 12 Madrid Spain EU Politecn Zaragoza EduQTech Maria De Luna 3 Zaragoza Spain

Fall detection systems use a number of different technologies to achieve their goals. This way, they contribute to better life conditions for the elderly community. The artificial vision is one of these technologies and, within this field, it has gained momentum over the course of the last few years as a consequence of the incorporation of different artificial neural networks (ANN's). These ANN's share a common characteristic, they are used to extract descriptors from images and video clips that, properly processed, will determine whether a fall has taken *** descriptors, which capture kinematic features associated with the fall, are inferred from datasets recorded by young volunteers or actors who simulate falls. Systems based on this concept offer excellent performances in tests which use that kind of datasets. However, given the well-documented differences between these falls and the real ones, concerns about system performances when processing falls of elderly people are *** work implements an alternative approach to the classical use of kinematic descriptors. To do it, for the first time to the best of the authors' knowledge, the authors propose the introduction of human dynamic stability descriptors used in other fields to determine whether a fall has taken place. These descriptors approach the human body in terms of balance and stability;this way, differences between real and simulated falls become irrelevant, as all falls are a direct result of fails in the continuous effort of the body to keep balance, regardless of other considerations. The descriptors are determined by using the information provided by a neural network able to estimate the body centre of mass and the feet projections onto the ground plane, as well as the feet contact *** theory behind this new approach and its validity is studied in this article with very promising results, as it is able to match or over exceed the performances of previous systems using kinematic de

关键词： convolutional neural nets image processing

来源：评论

学校读者我要写书评

暂无评论

An analytical research using image processing to create an architectural virtual scene 2

An analytical research using image processing to create an a...

引用

2nd International conference on Physics, Photonics, and Optical Engineering, ICPPOE 2023

作者： Ma, Bin Luo, Xianyong An, Xu Guo, Jiabin Liu, Peng China Construction Seventh Engineering Division. Corp. LTD. Chongqing China Kunming University of Science and Technology Yunnan China

ISBN: (纸本)9781510674684

The aim of this research is to explore methods for building architectural virtual scenes based on image processing techniques to meet the needs of the fields of architectural design, visualization and simulation. With the rapid development of digital technologies, the creation of architectural virtual scenes has become increasingly important as they provide a real-time, immersive way to present and evaluate architectural projects. Firstly, this study discusses the fundamentals of architectural virtual scenes, including the creation of virtual building models, the capture and fusion of real scenes, and the implementation of user interaction. Through image processing techniques, real architectural environments can be combined with virtual architectural elements to provide users with a highly realistic visual experience. Secondly, the study covers the application of image processing methods that help to process real scene images to make them suitable for embedding and interacting with virtual architectural models. This study highlights the potential of architectural virtual scenes for application in architectural design and visualization. With virtual scenes, architects and designers can make real-time design and layout adjustments, while all aspects of the architectural project can be visualized, including the exterior, structural and interior design. Finally, this study summarizes the importance of image processing in the construction of architectural virtual scenes, while exploring future research directions such as the development of augmented reality technology, improvements in virtual-reality fusion and user interactivity. Research in these areas will continue to drive innovation in architectural virtual scenes, providing more possibilities and flexibility in the field of architecture. © COPYRIGHT 2024 SPIE. Downloading of the abstract is permitted for personal use only.

关键词： Architectural design

来源：评论

学校读者我要写书评

暂无评论

Grid Sample Based Temporal Iteration for Fully Pipelined 1-ms SLIC Superpixel Segmentation System

引用

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS 2024年第4期E107D卷 515-524页

作者： Li, Yuan Hu, Tingting Fuchikami, Ryuji Ikenaga, Takeshi Waseda Univ Grad Sch Informat Prod & Syst Kitakyushu 8080135 Japan Panasonic Connect Co Ltd Fukuoka 8128531 Japan

A 1 millisecond (1 -ms) vision system, which processes videos at 1000 frames per second (FPS) within 1 ms/frame delay, plays an increasingly important role in fields such as robotics and factory automation. Superpixel as one of the most extensively employed image oversegmentation methods is a crucial pre-processing step for reducing computations in various computer vision applications. Among the different superpixel methods, simple linear iterative clustering (SLIC) has gained widespread adoption due to its simplicity, effectiveness, and computational efficiency. However, the iterative assignment and update steps in SLIC make it challenging to achieve high processing speed. To address this limitation and develop a SLIC superpixel segmentation system with a 1 ms delay, this paper proposes grid sample based temporal iteration. By leveraging the high frame rate of the input video, the proposed method distributes the iterations into the temporal domain, ensuring that the system's delay keeps within one frame. Additionally, grid sample information is added as initialization information to the obtained superpixel centers for enhancing the stability of superpixels. Furthermore, a selective label propagation based pipeline architecture is proposed for parallel computation of all the possibilities of label propagation. This eliminates data dependency between adjacent pixels and enables a fully pipelined system. The evaluation results demonstrate that the proposed superpixel segmentation system achieves boundary recall and under -segmentation error comparable to the original SLIC algorithm. When considering label consistency, the proposed system surpasses the performance of state-of-the-art superpixel segmentation methods. Moreover, in terms of hardware performance, the proposed system processes 1000 FPS images with 0.985 ms/frame delay.

关键词： image processing system real-time SLIC superpixel FPGA

来源：评论

学校读者我要写书评

暂无评论

Automated grading of oleaster fruit using deep learning

引用

SCIENTIFIC REPORTS 2025年第1期15卷 1-11页

作者： Azadpour, Aram Mollazade, Kaveh Ramezani, Mohsen Samimi-Akhijahani, Hadi Univ Kurdistan Fac Agr Dept Biosyst Engn Sanandaj Iran Univ Kurdistan Fac Engn Dept Comp Engn Sanandaj Iran

The agriculture sector is crucial to many economies, particularly in developing regions, with post-harvest technology emerging as a key growth area. The oleaster, valued for its nutritional and medicinal properties, has traditionally been graded manually based on color and appearance. As global demand rises, there is a growing need for efficient automated grading methods. Therefore, this study aimed to develop a real-time machine vision system for classifying oleaster fruit at various grading velocities. Initially, in the offline phase, a dataset containing video frames of four different quality classes of oleaster, categorized based on the Iranian national standard, was acquired at different linear conveyor belt velocities (ranging from 4.82 to 21.51 cm/s). The Mask R-CNN algorithm was used to segment the extracted frames to obtain the position and boundary of the samples. Experimental results indicated that, with a 100% detection rate and an average instance segmentation accuracy error ranging from 4.17 to 5.79%, the Mask R-CNN algorithm is capable of accurately segmenting all classes of oleaster at all the examined grading velocity levels. The results of the fivefold cross validation indicated that the general YOLOv8x and YOLOv8n models, created using the dataset obtained from all conveyor belt velocity levels, have a similarly reliable classification performance. Therefore, given its simpler architecture and lower processing time requirements, the YOLOv8n model was used to evaluate the grading system in real-time mode. The overall classification accuracy of this model was 92%, with a sensitivity range of 87.10-94.89% for distinguishing different classes of oleaster at a grading velocity of 21.51 cm/s. The results of this study demonstrate the effectiveness of deep learning-based models in developing grading machines for the oleaster fruit.

关键词： image segmentation Mask R-CNN Quality evaluation real-time classification YOLOv8

来源：评论

学校读者我要写书评

暂无评论

TOWARDS ZERO-LATENCY video TRANSMISSION THROUGH FRAME EXTRAPOLATION 29

TOWARDS ZERO-LATENCY VIDEO TRANSMISSION THROUGH FRAME EXTRAP...

引用

IEEE International conference on image processing (ICIP)

作者： Vijayaratnam, Melan Cagnazzo, Marco Valenzise, Giuseppe Trioux, Anthony Kieffer, Michel Inst Polytech Paris LTCI Telecom ParisTech Paris France Univ Padua Dept Informat Engn Padua Italy Univ Paris Saclay CNRS Cent Supelec Lab Signaux & Syst Gif Sur Yvette France Univ Polytech Hauts de France CNRS UMR 8520 DOAEIEMN Valenciennes France

ISBN: (数字)9781665496209

ISBN: (纸本)9781665496209

In the past few years, several efforts have been devoted to reduce individual sources of latency in video delivery, including acquisition, coding and network transmission. The goal is to improve the quality of experience in applications requiring real-time interaction. Nevertheless, these efforts are fundamentally constrained by technological and physical limits. In this paper, we investigate a radically different approach that can arbitrarily reduce the overall latency by means of video extrapolation. We propose two latency compensation schemes where video extrapolation is performed either at the encoder or at the decoder side. Since a loss of fidelity is the price to pay for compensating latency arbitrarily, we study the latency-fidelity compromise using three recent video prediction schemes. Our preliminary results show that by accepting a quality loss, we can compensate a typical latency of 100 ms with a loss of 8 dB in PSNR with the best extrapolator. This approach is promising but also suggests that further work should be done in video prediction to pursue zero-latency video transmission.

关键词： Extrapolation low-latency video delivery video deep learning

来源：评论

学校读者我要写书评

暂无评论

Perceptive Driving Assistant System for Opencast Mines During Foggy Weather

引用

MINING METALLURGY & EXPLORATION 2022年第6期39卷 2431-2447页

作者： Choudhary, Monika Kumari, Sushma Chaulya, Swades Kumar Prasad, Girendra Mohan Kumar, Vikash Kumar, Naresh CSIR Cent Inst Min & Fuel Res Dhanbad 826001 Bihar India

During harsh weather conditions, the presence of fog, dust in the environment degrades the image's quality, which affects the visibility of drivers of heavy earth-moving machinery in opencast mines. Due to low visibility, mining operations cannot be carried out as drivers are easily prone to accidents. This paper proposes a technique that includes developing a vision enhancement system called perceptive driving assistant system for increasing visibility of real-time video of the road in front of the vehicle for operators of heavy earth-moving machinery at opencast mines during harsh weather conditions to overcome the problem. The system consists of high-quality Internet Protocol cameras and thermal cameras for real-time image processing and other well-defined devices, which is quite capable of enhancing the visibility of the image, outlining edges of the road, and detecting obstacles present on the path of operators for smooth driving and reducing threat of accidents. A high-speed graphical processing unit has been used for quality-performance parallel computing, which is well suited for real-time operations to empower fast real-time operations. The calculated frame per second (fps) of image enhancement, object detection, and edge detection is 17.91, 15.91, and 25.09 fps, respectively. The actual frame rate is 26.07 fps, and after applying the algorithm, the final frame rate is 19.65 fps. The calculated accuracy of the object detection model is 81.23%. Field trials indicate that the developed system has performed adequately during foggy weather.

关键词： Convolution neural network Edge detection image defogging Object detection Foggy weather

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：