检索结果-内蒙古大学图书馆

Enhanced detection and recognition system for vehicles and drivers using multi-scale retinex guided filter and machine learning

引用

MULTIMEDIA TOOLS AND applications 2024年第6期83卷 15785-15824页

作者： Mahmood, Zahid Khan, Khurram Shahzad, Mohsin Fayyaz, Ahmad Khan, Uzair COMSATS Univ Islamabad Dept Elect & Comp Engn Abbottabad Campus Abbottabad Khyber Pakhtunk Pakistan Air Univ Islamabad Dept Avion Engn Islamabad Pakistan GIK Inst Engn Sci & Technol Fac Comp Sci & Engn Topi Kpk Pakistan

Accurate vehicle detection plays a vital role in intelligent transportation systems. Various day conditions, for instance, dawn, morning, noon, or non-uniform illuminations put restrictions on camera's visibility. Such scenarios impact the performance of detection and recognition algorithms that are used in surveillance systems and autonomous driving. This paper aims to solve the aforementioned issues using machine learning methods, such as face detection and recognition. The core theme of this paper is the development of a vehicle detection and driver recognition system, which also focuses the situation where an input image is degraded by non-uniform illuminations. The proposed system is composed of four main processing modules: (i) image acquisition, (ii) image enhancement, (iii) object detection that locates vehicles' and drivers' faces, and (iv) the Pool of Face Recognition Algorithms (PoFRA), which uses four face recognition algorithms to conclude the driver's identity. We implement suitable algorithms for each of the above-described modules to appraise its practicability. The system can be adjusted to work in different types of extreme weather conditions, such as strong or dim light. Experimental results demonstrate that the proposed system has significant potential to take the research on automated car parking systems to the next level.

关键词： Face detection Face recognition image enhancement machine learning

来源：评论

学校读者我要写书评

暂无评论

Temporal Change Studies in Kaziranga National Park 7

Temporal Change Studies in Kaziranga National Park

引用

7th International Conference on machine vision and applications (ICMVA)

作者： Janardhan, Prashanth Narayana, Harish Srinivas, Vindhya Natl Inst Technol Silchar India Visual & Transparent Infra Pvt Ltd Bengaluru India Govt Polytech Coll Immadihalli Bengaluru India

ISBN: (纸本)9798400716553

Kaziranga National Park, a UNESCO World Heritage Site and a sanctuary for the one-horned rhinoceros represents a critical ecosystem with a rich biodiversity that necessitates comprehensive monitoring and conservation efforts. This research article presents an in-depth study of the temporal changes within Kaziranga National Park over a decade, employing advanced image processing techniques on satellite imagery data from 2014 to 2022. The primary objective was to quantify and analyze the changes in land cover, including vegetation density, water body dynamics, and wetland alterations within and around the park's premises. Utilizing a combination of multispectral analysis, change detection algorithms, and supervised classification methods, the assessment of the variations in the park's landscape was studied. There was a notable fluctuation in the water bodies, largely attributable to the annual flood cycles of the Brahmaputra River, which both enriches the park's alluvial soil and poses a challenge to wildlife conservation. Vegetation analysis indicated areas of regrowth and decline, highlighting the impacts of natural processes and human intervention on the park's wetlands. This study underlines the importance of leveraging satellite imagery and image processing technologies for continuous environmental monitoring, providing an indispensable tool for conservationists, policymakers, and researchers dedicated to safeguarding natural habitats in the face of global environmental changes.

关键词： image processing land cover change detection

来源：评论

学校读者我要写书评

暂无评论

Multichannel Object Detection with Event Camera 6

Multichannel Object Detection with Event Camera

引用

6th IEEE International Conference on image processing, applications and Systems, IPAS 2025

作者： Iliasov, Rafael Golkar, Alessandro Technical University of Munich Spacecraft Systems Munich Germany

ISBN: (纸本)9798331506520

object detection based on event vision has been a dynamically growing field in computer vision for the last 16 years. In this work, we create multiple channels from a single event camera and propose an event fusion method (EFM) to enhance object detection in event-based vision systems. Each channel uses a different accumulation buffer to collect events from the event camera. We implement YOLOv7 for object detection, followed by a fusion algorithm. Our multichannel approach outperforms single-channel-based object detection by 0.7% in mean Average Precision (mAP) for detection overlapping ground truth with IOU = 0.5. © 2025 IEEE.

关键词： machine vision

来源：评论

学校读者我要写书评

暂无评论

In Situ Monitoring and Recognition of Printing Quality in Electrohydrodynamic Inkjet Printing via machine Learning

引用

JOURNAL OF MANUFACTURING SCIENCE AND ENGINEERING-TRANSACTIONS OF THE ASME 2024年第11期146卷 110901页

作者： Jiang, Liangkui Wolf, Rayne Alharbi, Khawlah Qin, Hantang Univ WisconsinMadison Univ Wisconsin Madison Dept Ind & Syst Engn Madison WI 53706 USA Univ WisconsinMadison Univ Wisconsin Madison Dept Ind & Syst Engn Madison WI 53706 USA Univ WisconsinMadison Univ Wisconsin Madison Dept Ind & Syst Engn Madison WI 53706 USA Univ Wisconsin Madison Dept Ind & Syst Engn Madison WI 53706 USA

Electrohydrodynamic (EHD) printing is an additive manufacturing technique capable of microscale and nanoscale structures for biomedical, aerospace, and electronic applications. To realize stable printing at its full resolution, the monitoring of jetting behavior while printing and optimization of the printing process are necessary. Various machine vision control schemes have been developed for EHD printing. However, in-line machine vision systems are currently limited because only limited information can be captured in situ toward quality assurance and process optimization. In this article, we presented a machine learning-embedded machine vision control scheme that is able to characterize jetting and recognize the printing quality by using only low-resolution observations of the Taylor Cone. An innovative approach was introduced to identify and measure cone-jet behavior using low-fidelity image data at various applied voltage levels, stand-off distances, and printing speeds. The scaling law between voltages and the line widths enables quality prediction of final printed patterns. A voting ensemble composed of k-nearest neighbor (KNN), classification and regression tree (CART), random forest, logistic regression, gradient boost classifier, and bagging models was employed with optimized hyperparameters to classify the jets to their corresponding applied voltages, achieving an 88.43% accuracy on new experimental data. These findings demonstrate that it is possible to analyze jetting status and predict high-resolution pattern dimensions by using low-fidelity data. The voltage analysis based on the in situ data will provide additional insights for system stability, and it can be used to establish the error functions for future advanced control schemes.

关键词： electrohydrodynamic inkjet printing Taylor cone machine learning high-speed imaging inspection and quality control micro- and nano-machining and processing sensing monitoring and diagnostics

来源：评论

学校读者我要写书评

暂无评论

High precision banana variety identification using vision transformer based feature extraction and support vector machine

引用

SCIENTIFIC REPORTS 2025年第1期15卷 1-16页

作者： Ergun, Ebru Recep Tayyip Erdogan Univ Fac Engn & Architecture Dept Elect & Elect Engn Rize Turkiye

Bananas, renowned for their delightful flavor, exceptional nutritional value, and digestibility, are among the most widely consumed fruits globally. The advent of advanced image processing, computer vision, and deep learning (DL) techniques has revolutionized agricultural diagnostics, offering innovative and automated solutions for detecting and classifying fruit varieties. Despite significant progress in DL, the accurate classification of banana varieties remains challenging, particularly due to the difficulty in identifying subtle features at early developmental stages. To address these challenges, this study presents a novel hybrid framework that integrates the vision Transformer (ViT) model for global semantic feature representation with the robust classification capabilities of Support Vector machines. The proposed framework was rigorously evaluated on two datasets: the four-class BananaimageBD and the six-class BananaSet. To mitigate data imbalance issues, a robust evaluation strategy was employed, resulting in a remarkable classification accuracy rate (CAR) of 99.86%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\:\pm\:$$\end{document}0.099 for BananaSet and 99.70%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\:\pm\:$$\end{document}0.17 for BananaimageBD, surpassing traditional methods by a margin of 1.77%. The ViT model, leveraging self-supervised and semi-supervised learning mechanisms, demonstrated exceptional promise in extracting nuanced features critical for agricultural applications. By combining ViT features with cutting-edge machine learning classifiers, the proposed system establishes a ne

关键词： Agricultural diagnostics Banana classification Deep learning machine learning Precision agriculture vision transformer

来源：评论

学校读者我要写书评

暂无评论

Design of Multi View Natural Language Query System on machine vision 4

Design of Multi View Natural Language Query System on Machin...

引用

4th IEEE International Conference on Power, Electronics and Computer applications, ICPECA 2024

作者： Fang, Quanrong Investment Co. Ltd Beijing China

ISBN: (纸本)9798350359589

In various fields such as medical imaging, object detection, and video surveillance, multi view natural language query systems utilize image data to provide a more comprehensive perspective, allowing users to intuitively query and obtain information. Due to the lack of a deep understanding of natural language in the hard coded matching rule method, the query results do not match the user's intentions and are difficult to meet practical application needs. Therefore, this article introduces machine vision algorithms for optimization and improvement. This article first discusses the system architecture of four modules: data input and preprocessing, visual feature extraction, natural language understanding and matching, and result generation and feedback. Then, the application of machine vision technology in the system was analyzed using two calculation formulas: grayscale conversion and binarization, and natural language processing technology was briefly discussed. Subsequently, a context understanding module was added to construct a multi view natural language query system based on machine vision. Finally, two sets of simulation experiments were conducted to draw the following conclusion: compared with traditional methods, the overall average improvement in image recognition accuracy indicators is about 14.3%, while the overall average improvement in response speed indicators is about 26.5%. This research system can effectively process images from different perspectives and match them with natural language queries. © 2024 IEEE.

关键词： machine vision technology multi view processing natural language query system design

来源：评论

学校读者我要写书评

暂无评论

Sora for Social vision With Parallel Intelligence: Social Interaction in Intelligent Vehicles

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES

引用

IEEE TRANSACTIONS ON INTELLIGENT VEHICLES 2024年第3期9卷 4240-4243页

作者： Yu, Hui Liang, Wei Fan, Lili Wang, Yutong Wang, Fei-Yue Univ Portsmouth Sch Creat Technol Portsmouth PO1 2DJ England Univ Shanghai Sci & Technol Shanghai 200093 Peoples R China Chinese Acad Sci Inst Automat State Key Lab Management & Control Complex Syst Beijing 100190 Peoples R China Beijing Inst Technol Sch Informat & Elect Beijing 100811 Peoples R China

Artificial technologies have made rapid progress and achieved various superior tasks in the past few years, including but not limited to classification, detection, image generation and data processing. Particularly, the very recent emerging Sora has demonstrated the exceptional ability of text-to-video generation lasting for 1 minute long with impressive quality. It provides a huge potential for many new applications across industries, especially social interaction in intelligent vehicles. The emergence of innovative intelligence vehicle applications has given rise to novel requirements for social and human-vehicle interaction within the associated contexts, where Sora and social vision could play an important role. In this perspective, we present a new Social Interaction framework based on Sora and parallel intelligence in intelligent vehicles and provide a novel perspective for conducting new social and human-vehicle interaction in the context of intelligent vehicles.

关键词： Intelligent vehicles Computational modeling Transformers Computer vision Visualization Human-vehicle systems Human computer interaction Sora parallel intelligence social vision social interaction intelligent Vehicles diffusion model human-machine interaction

来源：评论

学校读者我要写书评

暂无评论

Unifying image processing as Visual Prompting Question Answering 41

Unifying Image Processing as Visual Prompting Question Answe...

引用

41st International Conference on machine Learning, ICML 2024

作者： Liu, Yihao Chen, Xiangyu Ma, Xianzheng Wang, Xintao Zhou, Jiantao Qiao, Yu Dong, Chao Shanghai Artificial Intelligence Laboratory China Shenzhen Institute of Advanced Technology Chinese Academy of Sciences China University of Macau China Kuaishou Technology China

image processing is a fundamental task in computer vision, which aims at enhancing image quality and extracting essential features for subsequent vision applications. Traditionally, task-specific models are developed for individual tasks and designing such models requires distinct expertise. Building upon the success of large language models (LLMs) in natural language processing (NLP), there is a similar trend in computer vision, which focuses on developing large-scale models through pretraining and in-context learning. This paradigm shift reduces the reliance on task-specific models, yielding a powerful unified model to deal with various tasks. However, these advances have predominantly concentrated on high-level vision tasks, with less attention paid to low-level vision tasks. To address this issue, we propose a universal model for general image processing that covers image restoration, image enhancement, image feature extraction tasks, etc. Our proposed framework, named PromptGIP, unifies these diverse image processing tasks within a universal framework. Inspired by NLP question answering (QA) techniques, we employ a visual prompting question answering paradigm. Specifically, we treat the input-output image pair as a structured question-answer sentence, thereby reprogramming the image processing task as a prompting QA problem. PromptGIP can undertake diverse cross-domain tasks using provided visual prompts, eliminating the need for task-specific finetuning. Capable of handling up to 15 different image processing tasks, PromptGIP represents a versatile and adaptive approach to general image processing. Codes will be available at https://***/lyh-18/PromptGIP. Copyright 2024 by the author(s)

关键词： image enhancement

来源：评论

学校读者我要写书评

暂无评论

A Robust Cross-Weighted Thresholding Method for Object Extraction in Complex Scenes

引用

CIRCUITS SYSTEMS AND SIGNAL processing 2024年第9期43卷 5964-5988页

作者： Yu, Yue Tang, Jun Xiao, Min Zhang, Xuyang Xiamen Univ Technol Sch Optoelect & Commun Engn Xiamen 361024 Fujian Peoples R China Xiamen Univ Technol Detect & Intelligent Percept Grp Xiamen 361024 Fujian Peoples R China

Traditional thresholding methods are widely used to extract objects of interest from image backgrounds in various practical applications. However, these methods often face challenges in complex scenes due to poor uniformity, noise, and low contrast. To overcome these limitations, this paper proposes a peak-weaken Otsu method (PWOTSU) that improves the segmentation performance of the Otsu method for automatically extracting objects in complex scenes. The proposed approach uses a set of cross parameters as weights for the Otsu criterion function to adaptively weaken the between-class variance at the peak of the histogram. This ensures that an appropriate threshold value is always obtained for images with different types of histogram distribution. The improved criterion function has the advantage of obtaining a more accurate threshold value without the need for additional parameters, making it easily applicable to various practical applications. Experimental results demonstrate that the proposed method effectively improves the segmentation accuracy and robustness compared to the standard Otsu method and its modifications, as evidenced by qualitative and quantitative evaluations.

关键词： image segmentation Otsu criterion Complex scene Object extraction machine vision

来源：评论

学校读者我要写书评

暂无评论

A Review of Pre-processing Techniques for Weed-Plant Detection and Classification in Precision Agriculture 4th

A Review of Pre-processing Techniques for Weed-Plant Detecti...

引用

4th International Conference on Data, Engineering, and applications, IDEA 2022

作者： Sonawane, Sandip Patil, Nitin N. R. C. Patel Institute of Technology Shirpur India

ISBN: (纸本)9789819700363

Precision agriculture has recently gained significant importance in computer vision technologies. Various processes as a part of agricultural production cycle from planting to harvesting can be carried out automatically and effectively by using computer vision. The lack of publicly available image datasets is a major obstacle to the rapid design and assessment of computer vision based applications and also to machine learning algorithms which support these applications. To reduce this bottleneck, numerous image dataset collections have been discovered and made publicly available since 2015. In spite of this development, there is still a need to focus on survey of these datasets. Two most important concerns—choosing the right dataset and knowing how to pre-process and prepare the images in datasets, are considerably challenging task in every application. This review paper gives a thorough analysis of the public image datasets and numerous pre-processing techniques carried out in the field of precision agriculture. This thorough study can lead to development of suitable methods for improved quality and productivity of the crop along with proper weed management. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

关键词： Adversarial machine learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：