Deep learning-based approaches, such as Convolutional Neural Nets (CNNs), have shown high performance in classifying contents of images. CNNs, however, have the notable drawbacks of potentially high computing costs, p...
详细信息
ISBN:
(纸本)9781510674219;9781510674202
Deep learning-based approaches, such as Convolutional Neural Nets (CNNs), have shown high performance in classifying contents of images. CNNs, however, have the notable drawbacks of potentially high computing costs, poor explainability, and wide performance variance if the underlying imagery data deviates from the training baseline. As advanced imageprocessing capabilities are matured, the on-board detection of objects in space-based imagery is increasingly proposed. On-board satellite processingapplications, which may be resource-limited, can drive the need for simpler models that reduce the necessary computing burden for edge computing applications. This raises the question of how well classic computer vision techniques can compete with more modern approaches. This paper characterizes and compares the performance of multiple computer vision models for the application of distinguishing maritime vessels from typical clutter in commercial electrooptical (EO) satellite imagery. A Support Vector machine (SVM) model using manually curated features is compared to multiple DL-based models spanning a range of model sizes, with the goal of determining whether classical approaches can compete favorably with DL when computational resources are taken into consideration. Differences in performance and processing resources are characterized between the approaches. Findings include that the SVM-based model may approach the accuracy of some CNN-based models for classifying images of clouds in satellite EO imagery for smaller DL-based models. However, even the smallest DL-based models, which take about the same computational resources as the SVM-based model, generally out-perform the SVMbased model. This finding may have implications for the operational use of on-board processing techniques for satellite payloads.
With the rapid advances of deep learning-based computer vision (CV) technology, digital images are increasingly consumed, not by humans, but by downstream CV algorithms. However, capturing high-fidelity and high-resol...
详细信息
ISBN:
(纸本)9798400700958
With the rapid advances of deep learning-based computer vision (CV) technology, digital images are increasingly consumed, not by humans, but by downstream CV algorithms. However, capturing high-fidelity and high-resolution images is energy-intensive. It not only dominates the energy consumption of the sensor itself (i.e. in low-power edge devices), but also contributes to significant memory burdens and performance bottlenecks in the later storage, processing, and communication stages. In this paper, we systematically explore a new paradigm of in-sensor processing, termed "learned compressive acquisition" (LeCA). Targeting machinevisionapplications on the edge, the LeCA framework exploits the joint learning of a sensor autoencoder structure with the downstream CV algorithms to effectively compress the original image into low-dimensional features with adaptive bit depth. We employ column-parallel analog-domain processing directly inside the image sensor to perform the compressive encoding of the raw image, resulting in meaningful hardware savings, and energy efficiency improvements. Evaluated within a modern machinevisionprocessing pipeline, LeCA achieves 4x, 6x, and 8x compression ratios prior to any digital compression, with minimal accuracy loss of 0.97%, 0.98%, and 2.01% on imageNet, outperforming existing methods. Compared with the conventional full-resolution image sensor and the state-of-the-art compressive sensing sensor, our LeCA sensor is 6.3x and 2.2x more energy-efficient while reaching a 2x higher compression ratio.
The main purpose of this study is to explore the issues of real-time, accurate, and unmarked recognition of sports movements in recent years. By reviewing the relevant research on machine learning or deep learning for...
详细信息
ISBN:
(纸本)9798350374407
The main purpose of this study is to explore the issues of real-time, accurate, and unmarked recognition of sports movements in recent years. By reviewing the relevant research on machine learning or deep learning for specific sports or target actions based on computer visionimage data input, the aim is to provide references for the application of unmarked motion capture technology in the field of sports motion recognition. The research employed a literature review methodology, conducting searches in six databases, namely Web of Science, PubMed, Scopus, Google Scholar, IEEE Xplore, and China National Knowledge Infrastructure (CNKI), covering publications from January 2000 to June 2020. Through boolean logic operations on the retrieved literature, key information such as first author/publication year, types/targets of motion, participant information, camera parameters, image feature extraction techniques, action recognition algorithms, evaluation methods for action recognition quality, training and validation methods for image data, and performance metrics for action recognition were extracted. After screening, a total of 23 articles were included in the study. The findings revealed that $39 \%$ of the studies utilized machine learning algorithms based on support vector machines, while $35 \%$ employed deep learning algorithms based on convolutional neural networks. Commonly used evaluation metrics for action recognition quality included classification accuracy, confusion matrix, and displacement error. The development of computer vision motion capture, models, and algorithms demonstrated promising applications in areas such as action technique recognition and sports performance analysis. Traditional machine learning algorithms like support vector machines and principal component analysis remain dominant in action recognition technology;however, in certain scenarios, the performance of deep learning algorithms surpassed that of traditional machine learning methods.
This paper introduces the structure and operation mode of automatic production line based on the actual situation of laser quenching automatic production line of tool in enterprises. Robot vision integrates workpiece ...
详细信息
ISBN:
(纸本)9781665464680
This paper introduces the structure and operation mode of automatic production line based on the actual situation of laser quenching automatic production line of tool in enterprises. Robot vision integrates workpiece positioning coordinates with robot coordinates to realize the positioning and grasping function of robot through machinevision. Focus on OpenCV imageprocessing methods. This paper describes its principle and possible problems from the aspects of system structure, robot coordinate calibration, visual identification and positioning and software design.
This paper investigates the optimization and deployment of YOLOv7 deep learning model on NVIDIA Jetson Nano, an AI-focused edge computing platform for object detection in various computer visionapplications. The work...
详细信息
Recent studies point to an accuracy gap between humans and Artificial Neural Network (ANN) models when classifying blurred images, with humans outperforming ANNs. To bridge this gap, we introduce a spectral channel-ba...
详细信息
image classification is one of the main parts of computer vision, which is important in applications like self-driving automotives/vehicle systems. While working with image/video data it needs huge amount of resources...
详细信息
The integration of human-robot interaction (HRI) technologies with industrial automation has become increasingly essential for enhancing productivity and safety in manufacturing environments. In this paper, we propose...
详细信息
The proceedings contain 39 papers. The topics discussed include: performance analysis of several CNN based models for brain MRI in tumor classification;MRI-based lumbar sagittal alignment classification system;3D mapp...
ISBN:
(纸本)9798350352368
The proceedings contain 39 papers. The topics discussed include: performance analysis of several CNN based models for brain MRI in tumor classification;MRI-based lumbar sagittal alignment classification system;3D mapping and landing zone identification in complex terrains using DSM and photogrammetry;vision language models for oil palm fresh fruit bunch ripeness classification;towards no shadow: region-based shadow compensation on low-altitude urban aerial images;comparative analysis of deep learning architectures for blood cancer classification;exploration of group and shuffle module for semantic segmentation of sea ice concentration;on handcrafted machine learning features for art authentication;and acoustic signature modelling of marine vessels in various environmental and operational conditions.
image stabilization plays a crucial role in providing accurate and reliable visual information for machinevisionapplications. In maritime applications, such as unmanned ship navigation, where six degrees of freedom ...
详细信息
ISBN:
(纸本)9798350388350;9798350388343
image stabilization plays a crucial role in providing accurate and reliable visual information for machinevisionapplications. In maritime applications, such as unmanned ship navigation, where six degrees of freedom (DOF) motion and harsh maritime conditions prevail, the efficacy of image stabilization technology is vital for robust imageprocessing algorithms. This paper offers a comprehensive review of image stabilization techniques tailored for maritime environments, developed over the past two decades. We analyzed a total of 39 research articles on the subject, sourced from Web-of-Science, SCOPUS, and the Engineering Index databases, discussing potential research directions to address the limitations of current image stabilization methods, with special consideration for the unique requirements of ship-borne cameras. It provides an up-to-date overview of the techniques, limitations, and algorithms of ship-borne cameras for maritime applications, identifying current knowledge gaps and areas requiring further research. This review aims to guide the development of new technologies and methods to improve the performance of image stabilization systems in maritime contexts.
暂无评论