检索结果-内蒙古大学图书馆

On the Robustness of object detection models on Aerial Images

IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING 2025年 63卷

作者： He, Haodong Ding, Jian Xu, Bowen Xia, Gui-Song Wuhan Univ Sch Comp Sci Wuhan 430072 Peoples R China

The robustness of object detection models is a major concern when applied to real-world scenarios. The performance of most models tends to degrade when confronted with images affected by corruptions, since they are usually trained and evaluated on clean datasets. While numerous studies have explored the robustness of object detection models on natural images, there is a paucity of research focused on models applied to aerial images, which feature complex backgrounds, substantial variations in scales, and orientations of objects. This article addresses the challenge of assessing the robustness of object detection models on aerial images, with a specific emphasis on scenarios where images are affected by clouds. In this study, we introduce two novel benchmarks based on DOTA-v1.0. The first benchmark encompasses 19 prevalent corruptions, while the second focuses on the cloud-corrupted condition-a phenomenon uncommon in natural images yet frequent in aerial photography. We systematically evaluate the robustness of mainstream object detection models and perform necessary ablation experiments. Through our investigations, we find that rotation-invariant modeling and enhanced backbone architectures can improve the robustness of models. Furthermore, increasing the capacity of Transformer-based backbones can strengthen their robustness. The benchmarks we propose and our comprehensive experimental analyses can facilitate research on robust object detection on aerial images.

关键词： object detection Robustness Benchmark testing Clouds Data models Accuracy Transformers Feature extraction Detectors Visualization Aerial images object detection models robustness

来源：评论

学校读者我要写书评

暂无评论

On the application of YOLO-based object detection models to classify and detect defects in laser-directed energy deposition process

引用

PROGRESS IN ADDITIVE MANUFACTURING 2025年 1-16页

作者： Nikam, Deepika Chukwuemeke, Ajuebor Nigam, Akriti Bhosale, Tejaswini Nikam, Sagar Ulster Univ Sch Comp Engn & Intelligent Syst Magee CampusNorthland Rd Londonderry BT48 7JL North Ireland Birla Inst Technol Mesra Dept Comp Sci Engn Ranchi 835215 India MIT Art Design & Technol Univ Dept Comp Sci & Engn Pune 412201 India

Reducing the defects in the additively manufactured components using Laser-Directed Energy Deposition (L-DED) process is important for ensuring structural integrity, surface quality, and functional performance. The first required step for reducing defects in the L-DED manufactured components is the identification and understanding of the type of defects using the object detection approach. This paper aims to use a YOLO-based object detection models to classify and detect defects in the horizontal wall, vertical wall, and cuboid structures manufactured using various combinations of L-DED process parameters. The objectives involved are training, testing and validating of YOLOv7, YOLOv8, YOLOv9, and YOLOv9-GELAN models on the independent dataset of defects such as flash formation, void and rough texture, identifying the best YOLO model capable of detecting small and big size multiple defects within a single image and comparing the defects captured by YOLO model with previously used conventional CNN model such as VGG16. The results revealed that YOLOv9-GELAN exhibited good performance indicators compared to other YOLO models. The increasing trend for mAP0.5:0.95 signifies YOLOv9-GELAN as a good choice for defect detection of multiple defects in a single image. It also gave mAP of 95.7%, precision of 94%, recall of 96%, and F1-score of 90%, indicating accuracy in defect localisation and classification with minimal false positives and negatives. These high values for YOLOv9-GELAN indicate its capability to accurately highlight the defects using the bounding box compared to the previously proposed VGG16 model. In addition, YOLOv9-GELAN capability of processing 62 images per second showed its potential for higher frames processing compared to other YOLO models. This research will progress the development of AI-based in-situ defect monitoring for the L-DED process.

关键词： Additive Manufacturing object detection models Deep Convolutional Neural Network Laser Directed Energy Deposition Defect detection

来源：评论

学校读者我要写书评

暂无评论

Benchmarking Deep Learning models for object detection on Edge Computing Devices 22nd

Benchmarking Deep Learning Models for Object Detection on Ed...

引用

22nd International Conference on Service Oriented Computing

作者： Alqahtani, Daghash K. Cheema, Muhammad Aamir Toosi, Adel N. Univ Melbourne Melbourne Vic Australia Monash Univ Melbourne Vic Australia

ISBN: (纸本)9789819608041;9789819608058

Modern applications, such as autonomous vehicles, require deploying deep learning algorithms on resource-constrained edge devices for real-time image and video processing. However, there is limited understanding of the efficiency and performance of various object detection models on these devices. In this paper, we evaluate the performance of several state-of-the-art object detection models, including YOLOv8 (Nano, Small, Medium), EfficientDet Lite (Lite0, Lite1, Lite2), and SSD (SSD MobileNet V1, SSDLite MobileDet), on popular edge devices such as the Raspberry Pi 3, 4, and 5 (with and without TPU accelerators), as well as the Jetson Orin Nano. We collect key performance metrics, including energy consumption, inference time, and Mean Average Precision (mAP). Our findings highlight models with lower mAP such as SSD MobileNet V1 are more energy-efficient and faster in inference, whereas higher mAP models like YOLOv8 Medium generally consume more energy and have slower inference, though with exceptions when accelerators like TPUs are used. Among the edge devices, Jetson Orin Nano stands out as the fastest and most energy-efficient option for request handling, despite having the highest idle energy consumption.

关键词： Deep Learning object detection models Performance evaluation Inference Time Energy Efficiency Accuracy Edge

来源：评论

学校读者我要写书评

暂无评论

An Interpretable Neonatal Lung Ultrasound Feature Extraction and Lung Sliding detection System Using object Detectors

引用

IEEE JOURNAL OF TRANSLATIONAL ENGINEERING IN HEALTH AND MEDICINE 2024年 12卷 119-128页

作者： Bassiouny, Rodina Mohamed, Adel Umapathy, Karthi Khan, Naimul Toronto Metropolitan Univ Dept Elect Comp & Biomed Engn Toronto ON M5B 2K3 Canada Univ Toronto Mt Sinai Hosp Toronto ON M5S 1A1 Canada

The objective of this study was to develop an interpretable system that could detect specific lung features in neonates. A challenging aspect of this work was that normal lungs showed the same visual features (as that of Pneumothorax (PTX)). M-mode is typically necessary to differentiate between the two cases, but its generation in clinics is time-consuming and requires expertise for interpretation, which remains limited. Therefore, our system automates M-mode generation by extracting Regions of Interest (ROIs) without human in the loop. object detection models such as faster Region Based Convolutional Neural Network (fRCNN) and RetinaNet models were employed to detect seven common Lung Ultrasound (LUS) features. fRCNN predictions were then stored and further used to generate M-modes. Beyond static feature extraction, we used a Hough transform based statistical method to detect "lung sliding" in these M-modes. Results showed that fRCNN achieved a greater mean Average Precision (mAP) of 86.57% (Intersection-over-Union (IoU) = 0.2) than RetinaNet, which only displayed a mAP of 61.15%. The calculated accuracy for the generated RoIs was 97.59% for Normal videos and 96.37% for PTX videos. Using this system, we successfully classified 5 PTX and 6 Normal video cases with 100% accuracy. Automating the process of detecting seven prominent LUS features addresses the time-consuming manual evaluation of Lung ultrasound in a fast paced environment. Clinical impact: Our research work provides a significant clinical impact as it provides a more accurate and efficient method for diagnosing lung diseases in neonates.

关键词： Lung ultrasound object detection models faster RCNN RetinaNet Hough transform M-mode automatic lung sliding detection

来源：评论

学校读者我要写书评

暂无评论

Real-World Traffic detection: Achieving High Accuracy using Deep Learning based YOLOv5 and YOLOv8 Architectures

Real-World Traffic Detection: Achieving High Accuracy using ...

引用

International Conference on Advanced Systems and Emergent Technologies (ICASET)

作者： Pavithran, Rahul Thakkar, Jinal Jagdishkumar Shirke, Sunit Sanjay Masapalli, Vaishnavi Kaur, Arashdeep New Jersey Inst Technol Dept Comp Sci Newark NJ 07102 USA

ISBN: (纸本)9798350384901;9798350384895

In numerous nations, the imperative role of traffic monitoring systems is essential for overseeing and controlling vehicular and pedestrian traffic. In recent years, various techniques have been presented for automated detection to optimize traffic. The methods presented in the literature have their own pros and cons. This paper proposes two different traffic detection models using YOLOv5 and YOLOv8. In addition, this paper proposes an efficient data pre-processing algorithm to achieve better accuracy for detecting various classes of vehicles in the traffic including pedestrians. An efficient loss optimization strategy is proposed and adopted while training the model to reduce the training loss. This paper discusses the choice between two deep learning models, YOLOv5 and YOLOv8, for identifying different types of objects of interest on the road in urban areas. The efficiency of the proposed models is evaluated in this paper using multiple performance metrics, including their accuracy. The comparative analysis of the proposed models with existing models indicate that the proposed models are on par in terms of accuracy with existing strategies while integrated with the additional complexity of pedestrian detection.

关键词： deep learning neural networks traffic detection yolo object detection models traffic monitoring systems

来源：评论

学校读者我要写书评

暂无评论

Real Time Road Lane detection and Vehicle detection on YOLOv8 with Interactive Deployment 16

Real Time Road Lane Detection and Vehicle Detection on YOLOv...

引用

16th International Conference on Computational Intelligence and Communication Networks

作者： Segu, Girish Sai Pavan Kumar Sivannarayana, Annam Devi Satya Naga Ramesh, S. SRM Inst Sci & Technol Coll Engn & Technol Sch Comp Dept Comp Technol Chennai Tamil Nadu India

ISBN: (纸本)9798331505264;9798331505271

Modern strides in autonomous vehicles and embedded advanced driver assistant part systems(ADAS) have forced the need of an efficient in addition to accurate system intended for clear road lane as in addition to vehicle detection. For road lane detection, this study introduces a new proposal through the combination of YOLOv8 (You Only Look Once), and YOLO for vehicle detection, creating a complete package for road understanding and navigation. To achieve accurate road lane detection, YOLOv8, which is the fastest and most accurate model to date, is used to ensure that it can keep track of these lanes in real-time with consistent accuracy whenever the camera is placed under various scenarios with insufficient lighting and occlusions as well. At the same time it uses YOLO for vehicle detection, which improves the recognition and interpretation of the presence and motion of vehicles in real time. The system uses Streamlit, which is an open-source app framework for Machine Learning and Data Science projects to provide an intuitive user experience. Finally, we build an interface to provide results of lane and vehicle detection in real-time manner to allow ease of monitoring and evaluation on the overall system. The combination of YOLOv8 and YOLO with Streamlit provides a powerful, scalable, and deployable solution for practical computer vision applications. Combining well established object detection algorithms with a simplified deployment platform, the proposed system tackles some of the hardest and time consuming areas of autonomous driving. The goal is to make significant contributions to both the safety and efficiency of autonomous vehicles and pave the way for further inevitable advances in ADAS.

关键词： YOLOv8 Algorithm Road Lane detection Vehicle detection Autonomous Driving Advanced Driver-Assistance Systems (ADAS) Real-Time detection Streamlit Deployment object detection models Lane Tracking

来源：评论

学校读者我要写书评

暂无评论

Real time Classification system of Black Soldier Fly Larva

Real time Classification system of Black Soldier Fly Larva

引用

IEEE VTS Asia Pacific Wireless Communications Symposium (APWCS)

作者： Pookunngern, Chayanon Tsai, An-Chao Natl Pingtung Univ Int Master Program Informat Technol & Applicat Pingtung City Pingtung County Taiwan

ISBN: (纸本)9798350361711;9798350361704

Food waste presents a significant issue, contributing extensively to greenhouse gas emissions and climate change. When food waste decomposes in landfills, it generates methane, a greenhouse gas that is significantly derogation effective to the atmosphere than carbon dioxide. Additionally, the production, transportation, and disposal of food waste significantly contribute to global greenhouse gas emissions. This study explores an innovative approach to mitigate the environmental impact of food waste by using food scraps to create compost for animal feed, specifically utilizing the Black Soldier Fly Larva (BSFL). Accurate control of food quantity for various larval stages is essential, necessitating precise stage classification. This process is complex due to the larvae's similar and small color variations. In this paper, we introduce a mobile application designed to classify and detect the growth stages of BSFL, ranging from stage 1 to stage 4, which are high in protein and beneficial for animal feed, to stages 5 and 6, which are ideal for preparing pupae that can be used in skincare products. Our approach employs the YOLOv8 model for larval stage classification and detection, achieving an impressive mAP50-95 of 0.812, surpassing the performance of YOLOv7 (mAP50-95 of 0.781) and YOLOv5 (mAP50-95 of 0.789).

关键词： Machine learning Computer Vision Mobile Application Tool object detection models

来源：评论

学校读者我要写书评

暂无评论

An Interpretable object detection-Based Model For The Diagnosis Of Neonatal Lung Diseases Using Ultrasound Images 43

An Interpretable Object Detection-Based Model For The Diagno...

引用

43rd Annual International Conference of the IEEE-Engineering-in-Medicine-and-Biology-Society (IEEE EMBC)

作者： Bassiouny, Rodina Mohamed, Adel Umapathy, Karthi Khan, Naimul Ryerson Univ Toronto ON Canada Univ Toronto Mt Sinai Hosp Toronto ON Canada

ISBN: (纸本)9781728111797

Over the last few decades, Lung Ultrasound (LUS) has been increasingly used to diagnose and monitor different lung diseases in neonates. It is a noninvasive tool that allows a fast bedside examination while minimally handling the neonate. Acquiring a LUS scan is easy, but understanding the artifacts concerned with each respiratory disease is challenging. Mixed artifact patterns found in different respiratory diseases may limit LUS readability by the operator. While machine learning (ML), especially deep learning can assist in automated analysis, simply feeding the ultrasound images to an ML model for diagnosis is not enough to earn the trust of medical professionals. The algorithm should output LUS features that are familiar to the operator instead. Therefore, in this paper we present a unique approach for extracting seven meaningful LUS features that can be easily associated with a specific pathological lung condition: Normal pleura, irregular pleura, thick pleura, A- lines, Coalescent B-lines, Separate B-lines and Consolidations. These artifacts can lead to early prediction of infants developing later respiratory distress symptoms. A single multi-class region proposal-based object detection model faster-RCNN (fRCNN) was trained on lower posterior lung ultrasound videos to detect these LUS features which are further linked to four common neonatal diseases. Our results show that fRCNN surpasses single stage models such as RetinaNet and can successfully detect the aforementioned LUS features with a mean average precision of 86.4%. Instead of a fully automatic diagnosis from images without any interpretability, detection of such LUS features leave the ultimate control of diagnosis to the clinician, which can result in a more trustworthy intelligent system.

关键词： Lung Ultrasound object detection models faster RCNN RetinaNet

来源：评论

学校读者我要写书评

暂无评论

From Sky to Strategy: Construction Activity Index and Stage Estimation From Drone-Captured Imagery 22

From Sky to Strategy: Construction Activity Index and Stage ...

引用

22nd IEEE International Conference on Machine Learning and Applications, ICMLA 2023

作者： Gupta, Advait Padsala, Manan Shayla, Aastha Bisen, Tanmay Biswas, Susham Sarin, Abhemanyu Rajiv Gandhi Institute of Petroleum Technology Dept. of Cse Uttar Pradesh Jais India Kesowa Infinite Ventures Pvt Ltd. West Bengal Kolkata India

ISBN: (纸本)9798350345346

In the rapidly evolving construction industry, timely and accurate monitoring of construction activities is paramount. This paper introduces a novel approach to quantifying construction activity using high-resolution drone imagery and advanced object detection models. We present a unique, high-quality dataset of construction site images captured from drones, which is distinct from the commonly used open-source or lower-quality datasets. Leveraging this dataset, we fine-tuned pre-trained object detection models to achieve remarkable accuracy in detecting construction materials from images. Building on this, we introduce the Composite Construction Activity Index (CAI), a metric formulated by combining the Material Diversity Ratio (MDR) and the Percentage of Image Covered by Construction Materials (PIC). This metric offers a comprehensive view of construction activity, capturing both its diversity and intensity. Additionally, we propose a methodology to map construction stages based on the detected materials, categorizing them into distinct phases of construction, providing insights into the project's lifecycle. The paper culminates in the visualization of CAI values through heatmaps, offering a spatial representation of construction activity across cities or countries. Our findings and methodologies pave the way for more informed decision-making by stakeholders, from urban planners to construction managers, emphasizing the potential of drone technology and machine learning in revolutionizing construction monitoring. © 2023 IEEE.

关键词： Composite Construction Activity Index (CAI) Construction Activity Monitoring Construction Material detection Construction Stages Mapping Convolutional Neural Networks (CNNs) Digital Transformation in Construction Drone Imagery Heatmap Visualization High-resolution Dataset object detection models

来源：评论

学校读者我要写书评

暂无评论

Optimal support vector machine and hybrid tracking model for behaviour recognition in highly dense crowd videos

引用

DATA TECHNOLOGIES AND APPLICATIONS 2021年第1期55卷 19-40页

作者： Satya Sujith, K. Sasikala, G. Vel Tech Rangarajan Dr Sagunthala R&D Inst Sci & Chennai Tamil Nadu India

Purpose object detection models have gained considerable popularity as they aid in lot of applications, like monitoring, video surveillance, etc. object detection through the video tracking faces lot of challenges, as most of the videos obtained as the real time stream are affected due to the environmental factors. Design/methodology/approach This research develops a system for crowd tracking and crowd behaviour recognition using hybrid tracking model. The input for the proposed crowd tracking system is high density crowd videos containing hundreds of people. The first step is to detect human through visual recognition algorithms. Here, a priori knowledge of location point is given as input to visual recognition algorithm. The visual recognition algorithm identifies the human through the constraints defined within Minimum Bounding Rectangle (MBR). Then, the spatial tracking model based tracks the path of the human object movement in the video frame, and the tracking is carried out by extraction of color histogram and texture features. Also, the temporal tracking model is applied based on NARX neural network model, which is effectively utilized to detect the location of moving objects. Once the path of the person is tracked, the behaviour of every human object is identified using the Optimal Support Vector Machine which is newly developed by combing SVM and optimization algorithm, namely MBSO. The proposed MBSO algorithm is developed through the integration of the existing techniques, like BSA and MBO. Findings The dataset for the object tracking is utilized from Tracking in high crowd density dataset. The proposed OSVM classifier has attained improved performance with the values of 0.95 for accuracy. Originality/value This paper presents a hybrid high density video tracking model, and the behaviour recognition model. The proposed hybrid tracking model tracks the path of the object in the video through the temporal tracking and spatial tracking. The features train th

关键词： object detection models Crowd behaviour Hybrid tracking model Optimal support vector machine Monarch bird Swarm optimization Bird Swarm Algorithm Monarch butterfly optimization

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：