检索结果-内蒙古大学图书馆

8th International conference on Electrical, Mechanical and Computer Engineering, ICEMCE 2024

作者： Dong, Xianfa Wang, Xuezhong Beijing Institute of Petrochemical Technology College of New Materials and Chemical Engineering Beijing China

ISBN: (纸本)9798331506230

Froth flotation is an important process in the mineral processing industry for extracting valuable materials. This work investigates online microscopic imaging and machine learning based image analysis methods for real-time monitoring of the process. Previous limited work explored imaging the foam at the top surface layer of the froth flotation process. The new process imaging system in this work uses a corrosion-resistant online real-time imaging probe that can be put into the inside of the slurry and capture real-time images of bubbles. The acquired images are analyzed online using deep learning algorithms to automatically obtain key parameters of the bubbles, providing valuable data for froth flotation research and process control. © 2024 IEEE.

关键词： Froth flotation

来源：评论

学校读者我要写书评

暂无评论

Exploring the frontiers of image super-resolution: a review of modern techniques and emerging applications

引用

Neural Computing and Applications 2025年 1-49页

作者： Hassan, Esraa El-Rashidy, Nora Elbedwehy, Samar Abd El-Hafeez, Tarek Saber, Abeer Shams, Mahmoud Y. Faculty of Artificial Intelligence Kafrelsheikh University Kafrelsheikh33516 Egypt Computer Science Unit Deraya University Minia61765 Egypt Department of Computer Science Faculty of Science Minia University Minia61519 Egypt Department of Information Technology Faculty of Computers and Artificial Intelligence Damietta University New Damietta Egypt

Super-resolution (SR) aims to reconstruct high-resolution images from low-resolution inputs, with deep learning advancements driving substantial improvements in SR performance. This paper presents a comprehensive review of single- and multi-image SR techniques, analyzing findings from 12,873 research papers published between 2015 and 2025 in the computer science field. Key insights are derived from fifteen summary tables covering various SR tasks, including natural, medical, video, burst, depth map, and underwater image SR. The analysis highlights several major findings: (1) the integration of specialized modules, such as attention mechanisms, has led to consistent yearly improvements in performance metrics like PSNR and SSIM;(2) domain-specific architectures often outperform general models, particularly in medical and underwater SR applications;(3) while benchmark datasets enable objective comparisons, real-world validation remains limited, reducing the generalizability of current approaches;(4) inconsistent metric reporting across studies hampers reproducibility and fair evaluation;and (5) practical deployment considerations, including computational efficiency and real-time processing, are rarely addressed. Despite significant progress, challenges such as the need for more diverse training datasets, robust validation, and better interpretability persist. This review synthesizes these critical findings, offering an updated perspective on SR advancements, emerging trends, and future research directions. © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2025.

关键词： Medical computing

来源：评论

学校读者我要写书评

暂无评论

Student Behavior Identification During Practice and Training Based on video image

引用

TRAITEMENT DU SIGNAL 2023年第1期40卷 249-256页

作者： Chen, Wei Fan, Xinqiao Dai, Fengwei Chen, Tingting Hangzhou Vocat & Tech Coll Coll Commerce & Tourism Hangzhou 310018 Peoples R China

Enriching and developing the connotation and value of labor education theories can help students in higher vocational colleges form correct viewpoint and attitude towards labor. Higher vocational colleges should put more efforts to education through practice based on the features of each discipline. Accurately identifying students' behavior in complex practice and training scenarios is very important for teachers to know about their status during practice and training, however, existing research results are not applicable to complex practice and training scenarios since they have neither considered how to improve the accuracy of static image identification while ensuring the model is lightweight structured, nor considered the time series information of students' behavior during practice and training in the collected video images. For this reason, this paper took the property management major as the subject to study the identification of student behavior during practice and training based on video image. In the paper, the students' practice and training content was divided into three aspects, a task of asking students to cooperate with each other to deal with an equipment failure emergency was adopted for the research, and a research idea of helping teachers figure out students' status during practice and training via identifying their actions and intentions during the said activities was determined. Then, a few pre-processing operations were performed on the captured video images of student behavior during practice and training, including removing abnormal image frames, filtering, and aligning, etc. After that, based on the collected video image data, the dynamic convolution kernel was improved and optimized, and a lightweight convolution network model was built for identifying student behavior during practice and training. At last, experimental results verified the validity of the proposed identification model.

关键词： student behavior practice and training behavior identification video image convolution network model

来源：评论

学校读者我要写书评

暂无评论

The Application of AI video Generation Technology in Virtual reality (VR) and Augmented reality (AR)

The Application of AI Video Generation Technology in Virtual...

引用

2024 International conference on Artificial Intelligence, Deep Learning and Neural Networks, AIDLNN 2024

作者： Deng, Biqin Guangzhou Institute of Science and Technology Guangzhou510540 China

ISBN: (纸本)9798331520816

AI has become one of the key forces driving social progress, particularly in the fields of VR and AR, where the application of AI video generation technology is spearheading a technological revolution. This article delves into the specific applications, potential impacts, and future development trends of AI video generation technology in VR and AR. VR technology enables users to fully immerse themselves and interact with virtual environments. AR technology, on the other hand, overlays virtual information onto the real world, integrating virtual objects and information into the user's actual surroundings. AI video generation technology plays a crucial role in these two domains. In the VR sector, AI video generation technology, leveraging deep learning techniques, can learn and mimic human creative styles, automatically generating video content. Through image processing and machine learning technologies, this article explores how AI can generate high-quality virtual objects and scenes, making them more realistic and seamlessly integrated with the real world. Additionally, this article analyzes the challenges and opportunities faced by AI video generation technology in VR and AR applications, such as technological bottlenecks, data privacy, and security issues. AI video generation technology is poised to play an increasingly significant role in the VR and AR sectors. © 2024 IEEE.

关键词： Virtual environments

来源：评论

学校读者我要写书评

暂无评论

processing of Airborne video SAR Data Using the Modified Back Projection Algorithm

引用

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING 2022年 60卷 1页

作者： Cheng, Yu Ding, Jinshan Sun, Zhengyang Zhong, Chao Xidian Univ Natl Lab Radar Signal Proc Xian 710071 Peoples R China

This article proposes an algorithm framework recently developed for video synthetic aperture radar (ViSAR) and presents the processing results of the airborne data collected by a W-band radar. This framework aims to accelerate the time-domain algorithm and produce synthetic aperture radar (SAR) video that has no change of view angle, which is highly desired in target tracking. The presented framework includes a few modified processing algorithms, where the strategy of subaperture recursion is proposed. The modified acceleration algorithm is used to avoid the interpolation and adapt to high squint angles without rotation of image grids. In addition, the conventional autofocus method has been improved to deal with urban data that usually contain strong scatterers. The algorithm framework can achieve SAR video without rotation of view angle with a lower computational load compared to other existing time-domain methods, which has been verified on airborne real data. Its processing efficiency becomes impressive if the apertures are highly overlapped in some cases that require a very high frame rate.

关键词： Cartesian factorized back projection (CFBP) radar imaging recursive back projection (RBP) synthetic aperture radar (SAR) video SAR (ViSAR)

来源：评论

学校读者我要写书评

暂无评论

Drowsiness Detection and Head Pose Estimation in Online Learning Platforms with image processing 4

Drowsiness Detection and Head Pose Estimation in Online Lear...

引用

4th IEEE Interdisciplinary conference on Electrics and Computer (INTCEC)

作者： Unsal, Gurcan Tekerek, Adem Gazi Univ Technol Fac Comp Engn Dept Ankara Turkiye

ISBN: (纸本)9798350349467;9798350349450

The concept of education, which has existed for hundreds of years, is being moved to online environments, especially with the increase in internet use. Although the use of internet-based applications in education increases accessibility to information, it makes it difficult to evaluate students' performances fairly. In recent years, the number of image processing studies on this subject has been increasing in order to take online education platforms to the next level. In this study, fatigue estimation was made to measure the performance of students in distance education systems. Some machine learning and image processing methods were used for fatigue prediction. In the proposed study, two different data sets consisting of 15182 images were used. Python and Flask framework are used in model training. A web-based application was developed with Flask that performs real-time fatigue detection and head position estimation via webcam.

关键词： image processing Machine Learning Object Detection

来源：评论

学校读者我要写书评

暂无评论

real-time video Frame Interpolation Using GANs for Enhanced Streaming Service 4

Real-Time Video Frame Interpolation Using GANs for Enhanced ...

引用

4th International conference on Electrical, Computer, Communications and Mechatronics Engineering, ICECCME 2024

作者： Divya, N. Arjun, U. Vardhan, V. Vishnu Raju, M. Tirumala Sai, T. Yuvanth Gopal, M. Syam Computational Science University of Southern Mississippi Data Analyst in UMMC United States Gloom Dev Pvt Ltd Penamaluru Andhra Pradesh Vijayawada521139 India Gloom Dev Pvt Ltd Andhra Pradesh Vijayawada521139 India GloomDev Pvt Ltd Penamaluru Andhra Pradesh Vijayawada521139 India Machine Learning Architect at GloomDev Pvt Ltd Penamaluru Andhra Pradesh Vijayawada521139 India

ISBN: (纸本)9798350391183

This work presents a novel approach to real-time video frame interpolation using Generative Adversarial Networks (GANs) to enhance streaming services. We developed a custom GAN architecture comprising a generator, which predicts intermediate frames through transposed convolutions, and a discriminator, which evaluates frame authenticity. The model processes video frames by resizing them to 64 × 64 pixels and normalizing pixel values. The training alternates between updating the generator and discriminator to minimize adversarial loss, improving the generator's ability to create realistic frames. Our results, tracked through loss curves and visual comparisons, demonstrate high-quality frame interpolation, enhancing visual smoothness and continuity in video streams. This method showcases the potential of GANs for real-time video processing and future innovations in the field. © 2024 IEEE.

关键词： Generative adversarial networks

来源：评论

学校读者我要写书评

暂无评论

Context Enhanced Transformer for Single image Object Detection in video Data 38

Context Enhanced Transformer for Single Image Object Detecti...

引用

38th AAAI conference on Artificial Intelligence (AAAI) / 36th conference on Innovative Applications of Artificial Intelligence / 14th Symposium on Educational Advances in Artificial Intelligence

作者： An, Seungjun Park, Seonghoon Kim, Gyeongnyeon Baek, Jeongyeol Lee, Byeongwon Kim, Seungryong Korea Univ Seoul South Korea SK Telecom Seoul South Korea

ISBN: (纸本)1577358872

With the increasing importance of video data in real-world applications, there is a rising need for efficient object detection methods that utilize temporal information. While existing video object detection (VOD) techniques employ various strategies to address this challenge, they typically depend on locally adjacent frames or randomly sampled images within a clip. Although recent Transformer-based VOD methods have shown promising results, their reliance on multiple inputs and additional network complexity to incorporate temporal information limits their practical applicability. In this paper, we propose a novel approach to single image object detection, called Context Enhanced TRansformer (CETR), by incorporating temporal context into DETR using a newly designed memory module. To efficiently store temporal information, we construct a class-wise memory that collects contextual information across data. Additionally, we present a classification-based sampling technique to selectively utilize the relevant memory for the current image. In the testing, We introduce a test-time memory adaptation method that updates individual memory functions by considering the test distribution. Experiments with CityCam and imageNet VID datasets exhibit the efficiency of the framework on various video systems. The project page and code will be made available at: https://***/CETR.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

real-time image Enhancement for Emergency Rescue Scenarios in Smart Grids 4

Real-time Image Enhancement for Emergency Rescue Scenarios i...

引用

4th IEEE International conference on Electronic Technology, Communication and Information, ICETCI 2024

作者： Dou, Zeng Tao, Jingyi Lv, Ning Cong, Li Huang, Chengbin Su, Tao Power Company Limited Information Communication Company State Grid Jilin Province Electric Changchun China Xidian University School of Electronic Engineering Xidian Hangzhou Institute of Technology Hangzhou China Xidian University School of Electronic Engineering Xi'an China

ISBN: (纸本)9798350361643

image enhancement technology plays an important role in the practical application of detecting underwater tunnels. By improving image quality, it enhances the accuracy and reliability of detection results. However, in deep-sea scenes, the problem of image color distortion and detail loss is common. To address this issue, a real-time underwater image enhancement algorithm based on weighted multi-branch correction has been proposed in this paper. The algorithm uses a combination of multi-scale and multi-branch techniques to process the image. The multi-branch correction module has been improved by increasing the weighting and downsampling strategy, which not only improves the processing speed but also ensures the processing effect. In the air-frequency domain interactive processing module, zero padding and window functions have been introduced to reduce the ringing effect and block effect. In addition, the smart grid system provides stable power support for underwater tunnel search and rescue, which ensures the safety of the search and rescue environment and also provides a good experimental lighting environment for the image enhancement algorithm. At the same time, the model has been trained and validated using self-constructed real deep-sea datasets. The existing algorithms and the algorithm proposed in this paper have been compared and analyzed comprehensively. The experimental results show that the algorithm proposed in this paper is better than the existing algorithms in terms of processing effect, and the processing speed is improved by about 12.7%. © 2024 IEEE.

关键词： image enhancement

来源：评论

学校读者我要写书评

暂无评论

Edge Computing-Enabled Lightweight Deep Neural Network for real-time video Surveillance in Maritime Cyber-Physical Systems 9

Edge Computing-Enabled Lightweight Deep Neural Network for R...

引用

9th International conference on Intelligent Computing and Signal processing, ICSP 2024

作者： Luo, Caofei Wu, Fumin Xiong, Renhao Xu, Guanhua Liu, Wen Maritime Electronics Research Institute Co. Ltd. Ningbo315040 China College of Information Science and Electronic Engineering Zhejiang University Hangzhou310058 China School of Navigation Wuhan University of Technology Wuhan430063 China

ISBN: (纸本)9798350376548

With the rapid developments of low-end IoT devices and artificial intelligence (AI), the heterogeneous and dynamic data has been increasing drastically, especially in the era of AI-enabled maritime cyber-physical systems (CPS). It is intractable to provide the efficient and flexible computer vision applications through traditional cloud-centric data processing strategies, which commonly suffer from the limitations, e.g., high cost, high- latency communication, and high occupied storage, etc. To meet these challenges, edge computing has meaningfully evolved into a flexible and powerful distributed architecture. It contributes to improvements in bandwidth savings and response time in practical applications. To guarantee efficient video surveillance in AI-powered maritime CPS, the edge computing-based lightweight deep network (termed ShipNet-YOLOv3), incorporated with Kalman filtering and Hungarian algorithm, is proposed to implement real-time detection and tracking of moving ships. In particular, ShipNet-YOLOv3 adopts Darknet-53 as the backbone network (i.e., feature extractor) with three prediction boxes. The channel pruning and fine-tuning strategy are further introduced to balance the trade-off between computational cost, network size, and detection (or tracking) accuracy. Experimental results on both synthetic and realistic scenarios have illustrated that the proposed method could provide superior ship detection and tracking results under different adverse imaging conditions. It thus has the capacity of promoting real-time video surveillance in AI -enabled maritime CPS. © 2024 IEEE.

关键词： Kalman filters

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：