检索结果-内蒙古大学图书馆

22nd IEEE/CVF Winter Conference on applications of Computer vision (WACV)

作者： Ashrafee, Alif Khan, Akib Mohammed Irbaz, Mohammad Sabik Al Nasim, Md Abdullah Islamic Univ Technol Dept Comp Sci & Engn Gazipur Bangladesh Pioneer Alpha Ltd Machine Learning Team Dhaka Bangladesh

ISBN: (纸本)9781665458245

Automatic License Plate Recognition systems aim to provide a solution for detecting, localizing, and recognizing license plate characters from vehicles appearing in video frames. However, deploying such systems in the real world requires real-time performance in low-resource environments. In our paper, we propose a two-stage detection pipeline paired with vision API that provides real-time inference speed along with consistently accurate detection and recognition performance. We used a haar-cascade classifier as a filter on top of our backbone MobileNet SSDv2 detection model. This reduces inference time by only focusing on high confidence detections and using them for recognition. We also impose a temporal frame separation strategy to distinguish between multiple vehicle license plates in the same clip. Furthermore, there are no publicly available Bangla license plate datasets, for which we created an image dataset and a video dataset containing license plates in the wild. We trained our models on the image dataset and achieved an AP(0.5) score of 86% and tested our pipeline on the video dataset and observed reasonable detection and recognition performance (82.7% detection rate, and 60.8% OCR F1 score) with real-time processing speed (27.2 frames per second).

关键词： Computer vision image recognition Conferences Pipelines Focusing Licenses Streaming media

来源：评论

学校读者我要写书评

暂无评论

Evaluating the Impact of Lossy Compression on ADAS Deep Learning Models using Fisheye Cameras 26

Evaluating the Impact of Lossy Compression on ADAS Deep Lear...

引用

26th Irish machine vision and image processing Conference, IMVIP 2024

作者： Simha, Srinidhi Mukanahallipatna Molloy, Dara Fahy, Darren Valeo Vision Systems Ireland University of Galway Ireland

ISBN: (纸本)9781837242672

The increasing deployment of Advanced Driver Assistance Systems (ADAS) alongside the continual rise in camera sensor resolution has led to high bandwidth, and generally high cost, computation, and intra-vehicle communication. While the sensor bandwidth impacts the vehicle architecture, it also affects the data collection, storage, deep learning model training, and validation infrastructures. However, if the bandwidth was low, while still achieving the goal of high accuracy ADAS perception, the time and cost associated with creating and deploying the system would be greatly reduced. This study investigates the influence of lossy compression on multi-task deep learning models for real-time perception in ADAS employing fisheye cameras. We leverage a large-scale dataset and train a representative multi-task ADAS perception model for pedestrian, kerb, line, and soiling classification. The testing dataset is subjected to compression using the popular H.264 video codec at varying compression ratios. Through rigorous evaluation, we analyse the effects of compression on model performance, providing insights into the feasibility of employing lossy compression techniques in ADAS applications. Our results reveal that lossy compression could be deployed in automotive perception applications and that a compression ratio of up to 98% (720Mb/s to 12Mb/s), could be utilised with negligible performance degradation. © This is an open access article published by the IET under the Creative Commons Attribution License (http://***/licenses/by/3.0/)

关键词： Advanced driver assistance systems

来源：评论

学校读者我要写书评

暂无评论

Automatic Data processing for Space Robotics machine Learning 74

Automatic Data Processing for Space Robotics Machine Learnin...

引用

74th International Astronautical Congress, IAC 2023

作者： Sheppard, Anja Skinner, Katherine A. Department of Robotics University of Michigan 2505 Hayward St Ann ArborMI48109 United States

Autonomous terrain classification is an important problem in planetary navigation, whether the goal is to identify scientific sites of interest or to traverse treacherous areas safely. Past Martian rovers have relied on human operators to manually identify a navigable path from transmitted imagery. Our goals on Mars in the next few decades will eventually require rovers that can autonomously move farther, faster, and through more dangerous landscapes-demonstrating a need for improved terrain classification for traversability. Autonomous navigation through extreme environments will enable the search for water on the Moon and Mars as well as preparations for human habitats. Advancements in machine learning techniques have demonstrated potential to improve terrain classification capabilities for ground vehicles on Earth. However, classification results for space applications are limited by the availability of training data suitable for supervised learning methods. This paper contributes an open source automatic data processing pipeline that uses camera geometry to co-locate Curiosity and Perseverance Mastcam image products with Mars overhead maps via ray projection over a terrain model. In future work, this automated data processing pipeline will be leveraged for development of machine learning methods for terrain classification. Copyright © 2023 by the International Astronautical Federation (IAF). All rights reserved.

关键词： computer vision geographic information systems open source robotics space

来源：评论

学校读者我要写书评

暂无评论

Early vision on the Focal-Plane with High Dynamic Range Pixels

Early Vision on the Focal-Plane with High Dynamic Range Pixe...

引用

International Workshop on Compressed Sensing Theory and its applications to Radar, Sonar and Remote Sensing (CoSeRa)

作者： Marko Jaklin D. García-Lesta P. López V.M. Brea Centro Singular de Investigación en Tecnoloxías Intelixentes (CiTIUS) Universidade de Santiago de Compostela Santiago de Compostela Spain

ISBN: (数字)9798350365504

ISBN: (纸本)9798350365511

This paper introduces a high dynamic range pixel for early vision processing. Early vision is the first stage to subsequently extract semantic information for image processing or video analytics. This paper proposes to bring said processing to the focal plane, next to a high dynamic range image sensor working on the principle of lateral overflow capacitor. This brings the benefits of processing scenes with a wide dynamic range in a power efficient manner. Circuit simulations for edge detection, as an example of early vision processing conveyed in this paper, show that our proposal meets the accuracy typically found in applications like machine vision. Simulations are in XFAB’s XS018 technology.

关键词： image sensors Accuracy Power demand image edge detection Visual analytics Multimodal sensors Semantics Radar imaging High dynamic range Proposals

来源：评论

学校读者我要写书评

暂无评论

SPA 2024 Tutorial

SPA 2024 Tutorial

引用

Signal processing Algorithms, Architectures, Arrangements and applications (SPA)

来源：评论

学校读者我要写书评

暂无评论

An optimized automated recognition of infant sign language using enhanced convolution neural network and deep LSTM

引用

MULTIMEDIA TOOLS AND applications 2023年第18期82卷 28043-28065页

作者： Enireddy, Vamsidhar Anitha, J. Mahendra, N. Kishore, G. Koneru Lakshmaiah Educ Fdn Dept Comp Sci & Engn Guntur 522502 Andhra Pradesh India Malla Reddy Engn Coll Dept Comp Sci & Engn Hyderabad 500100 Telangana India Miracle Educ Soc Grp Inst Miracle City 535216 Andhra Pradesh India RISE Krishna Sai Prakasam Grp Inst Dept CSE Ongole 523272 Andhra Pradesh India

In the world, several sign languages (SL) are used, and BSL (Baby Sign Language) is the process of communication between the parents and baby using gestures. Communication by gestures is a non-verbal process that utilizes motion to pass on realities, expressions and feelings to people. SL is the communication mode in which the information is conveyed via movement of body parts like cheeks, eyebrows and head. Even though many research works based on SL are available, research in BSL remains a challenge. Hence, this paper presents an optimization-based automated recognition of the deep BSL system, which determines the gesture signalled by the kids. Initially, the image frames are extracted from the videos and data augmentation processes are performed. After pre-processing, the features are extracted from the frames using the Enhanced Convolution Neural Network (ECNN). The optimal characteristics are then selected by a new Life Choice Based Optimizer (LCBO). Finally, the classification is carried out by the Deep Long Short-Term Memory (DLSTM) scheme. The implementation is performed on the Python platform, and the performances are evaluated using several performance metrics such as accuracy, precision, kappa, f1-score and recall. The performance of the proposed approach (ECNN-DLSTM) is compared with several deep and machine learning approaches and obtains an accuracy of 99% and a kappa of 96%.

关键词： Baby sign language Automated recognition Computer vision Optimization

来源：评论

学校读者我要写书评

暂无评论

A real-time SC²S-based open-set recognition in remote sensing imagery

引用

JOURNAL OF REAL-TIME image processing 2022年第5期19卷 867-880页

作者： Gyaneshwar, Dubacharla Nidamanuri, Rama Rao Indian Inst Space Sci & Technol Thiruvananthapuram 695547 Kerala India

Accuracy and computational time are two crucial parameters influencing the efficacy of classification algorithms for remote sensing applications. machine learning algorithms are known for achieving notable success for several classification problems in various domains, including remote sensing. However, they are well-recognized and considered accurate and efficient for closed-set recognition (CSR) but may provide suboptimal and erroneous results for open-set recognition (OSR) tasks. Many practical image-driven and computer vision applications have open-set and dynamic scenarios with unknown data where classification algorithms have not yet achieved significant prediction performance. This paper presents a group of class-aware (CA) classification algorithms based on a supervised cascaded classifier system ((SCS)-S-2), called CA-(SCS)-S-2, which is accurate for OSR and CSR tasks. We evaluate the prediction accuracy of the proposed methods against the state-of-the-art methods in a multiclass setting using multiple image classification scenarios of OSR and CSR. The test case scenarios use six multispectral and hyperspectral datasets from different sensing platforms. And to assess the computational performance of the methods, we designed various field-programmable gate array (FPGA) architectures of the proposed methods. We evaluated their real-time performance on a low-cost, low-power Artix-7 35 T FPGA.

关键词： Supervised cascaded classifier system ((SCS)-S-2) Class-aware (SCS)-S-2 (CA-(SCS)-S-2) image classification Open-set recognition (OSR) Close-set recognition (CSR) Field-programmable gate array (FPGA) Remote sensing imagery

来源：评论

学校读者我要写书评

暂无评论

Application of the image processing Technique for Powerline Robot 11th

Application of the Image Processing Technique for Powerline ...

引用

11th EAI International Conference on Context-Aware Systems and applications, ICCASA 2022

作者： Ngo, Ha Quang Thinh Bui, The Tri 268 Ly Thuong Kiet Street District 10 Ho Chi Minh City Viet Nam Linh Trung Ward Thu Duc City Ho Chi Minh City Viet Nam

ISBN: (纸本)9783031288159

Applying image processing to electromechanical systems is a problem of interest to scientists, in order to serve humans in many fields. To do that, there needs to be a connection between image processing and mechanical construction to create complete mobile cameras. One of the research directions is about mobile cameras, specifically a system consisting of dual cameras that detect and track moving objects, and at the same time calculate the distance from the dual camera system to the target, this system can be application in object tracking robot. In this paper, the research object includes the camera system designed according to the pan-tilt structure, the algorithm used for object detection is YOLO-based on CNN, estimating the distance from the camera system to the object. By means of stereo vision, control the pan-tilt system to automatically track objects. © 2023, ICST Institute for Computer Sciences, Social Informatics and Telecommunications Engineering.

关键词： machine learning

来源：评论

学校读者我要写书评

暂无评论

Pigment Epithelial Detachment Detection: A Review of Imaging Techniques and Algorithms

Pigment Epithelial Detachment Detection: A Review of Imaging...

引用

2022 International Conference on Advanced Computing Technologies and applications, ICACTA 2022

作者： Sheeba, T.M. Albert Antony Raj, S. Anand, M. SRM Institute of Science and Technology College of Science and Humanities Department of Computer Applications Chennai India SRM Institute of Science and Technology College of Engineering and Technology Department of Networking and Communications Chennai India

ISBN: (纸本)9781665495158

Pigment epithelial detachment(PED) is a disorder in retina that happens when RPE layers of cells at the back side of the eye come apart, or get teared. The bend of layers in the retina, as well as fluid, proteins, tissue, or blood vessels, is a defining feature of PED disease, which occurs most frequently in the macula. PED can disturb the vision of the people which is often depict dark shadow, blurry vision or partial loss of vision. The optical coherence tomography (OCT) is a trend set of high resolution and non-invasive imaging modality that expedite the structure of the retina. OCT non-invasively yields cross-sectional volume of images with tissues. The major objective of this research paper is to study, state of art and to classify the retinal layer segmentation techniques, PED fluid segmentation and classification of diseases in retinal OCT images. The medical industry is suffering with more critical patients and the cases are increasing in eye diseases double the number as of now. The artificial intelligence (AI) techniques help the health sector with a great and accurate automatic detection of disease. The image classification and pattern recognition are transforming the industry with artificial intelligence techniques. Many studies are being conducted employing image processing to aid in the early diagnosis of this disease. image processing techniques have advanced as a result of the introduction of artificial intelligence and machine learning. In this review paper, the structure classification methods and the image segmentation method that are best available existing research is discussed. This review summarizes all the recent algorithms that suits for the application of machine learning algorithms for predicting retinal diseases in OCT images. The algorithms discussed from existing research paper, produce the readers to identify the best accurate algorithm for retinal classification of infected eye and normal eye, precision and less processing time for la

关键词： Optical tomography

来源：评论

学校读者我要写书评

暂无评论

Multilevel Crop image Segmentation Using Firefly Algorithm and Recursive Minimum Cross Entropy 4th

Multilevel Crop Image Segmentation Using Firefly Algorithm a...

引用

4th International Conference on machine Intelligence and Signal processing, MISP 2022

作者： Kumar, Arun Kumar, A. Vishwakarma, Amit PDPM Indian Institute of Information Technology Design and Manufacturing Jabalpur India

ISBN: (纸本)9789819900466

image segmentation plays an important role in computer vision technology and agriculture is one of their applications. The crop images present near the vicinity are complex and dense. Hence, multilevel thresholding of such crop images is a tedious task. In this paper, we propose multilevel thresholding of crop images using recursive minimum cross entropy and firefly algorithm. The firefly is based on the social behavior of the swarm of fireflies and bioluminescent information-sharing phenomena. It is a swarm-based algorithm, which offers a better search mechanism to find the optimum threshold value. The performance of the proposed method is observed over ten complex background crop images and compared with the wind-driven optimization algorithm. The better fidelity parameters evidence the superiority of the proposed method. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

关键词： image segmentation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：