In recent years, object detection algorithms have achieved great success in the field of machinevision. To pursue the detection accuracy of the model, the scale of the network is constantly increasing, which leads to...
详细信息
In recent years, object detection algorithms have achieved great success in the field of machinevision. To pursue the detection accuracy of the model, the scale of the network is constantly increasing, which leads to the continuous increase in computational cost and a large requirement for memory. The larger network scale allows their execution to take a longer time, facing the balance between the detection accuracy and the speed of execution. Therefore, the developed algorithm is not suitable for real-time applications. To improve the detection performance of small targets, we propose a new method, the real-time object detection algorithm based on transfer learning. Based on the baseline Yolov3 model, pruning is done to reduce the scale of the model, and then migration learning is used to ensure the detection accuracy of the model. The object detection method using transfer learning achieves a good balance between detection accuracy and inference speed and is more conducive to the real-time processing of images. Through the evaluation of the dataset voc2007 + 2012, the experimental results show that the parameters of the Yolov3-Pruning(transfer): model are reduced by 3X compared with the baseline Yolov3 model, and the detection accuracy is improved, realizes real-time processing, and improves the detection accuracy.
Remote Sensing image Captioning (RSIC) is crucial for many researchers since it has many applications in environmental monitoring, disaster management, urban planning, image retrieval, performance of building planes, ...
详细信息
We propose a CNN-based framework for "real-time object detection and tracking using deep learning" in this paper, which includes a spatial–temporal mechanism. The impact of efficient data on performance ben...
详细信息
Aerospace, civil, energy, and mechanical engineering structures continue to be used despite reaching their design lifetime. Developing sensing and data analytics to assess the structural condition of the targeted syst...
详细信息
ISBN:
(纸本)9783031072581;9783031072574
Aerospace, civil, energy, and mechanical engineering structures continue to be used despite reaching their design lifetime. Developing sensing and data analytics to assess the structural condition of the targeted systems is crucial. Traditional contact-based techniques may produce inconsistent results and are labor-intensive to be considered a valid alternative for monitoring large-scale structures such as bridges, large buildings, and wind turbines. Advancements in image-processing algorithms made techniques such as three-dimensional digital image correlation (3D-DIC), infrared thermography (IRT), motion magnification (MM), and structure from motion (SfM) appealing tools for structural health monitoring and non-destructive testing. Besides, as those techniques are implemented within unmanned aerial vehicles (UAVs), the measurement process is expedited while reducing interference with the targeted structure. This paper summarizes the research experience performed at the University of Massachusetts Lowell. The results of these activities show that the combination of autonomous flight with 3D-DIC, IRT, and SfM can provide precious insights into the structural conditions of the inspected systems while reducing downtime and costs. The study includes future research directions to make those approaches suitable for real-world applications.
Artificial vision systems will be essential in intelligent machine-visionapplications such as autonomous vehicles, bionic eyes, and humanoid robot eyes. However, conventional digital electronics in these systems face...
详细信息
Artificial vision systems will be essential in intelligent machine-visionapplications such as autonomous vehicles, bionic eyes, and humanoid robot eyes. However, conventional digital electronics in these systems face limitations in system complexity, processing speed, and energy consumption. These challenges have been addressed by biomimetic approaches utilizing optoelectronic synapses inspired by the biological synapses in the eye. Nano- materials can confine photogenerated charge carriers within nano-sized regions, and thus offer significant potential for optoelectronic synapses to perform in-sensor image-processing tasks, such as classifying static multicolor images and detecting dynamic object movements. We introduce recent developments in optoelectronic synapses, focusing on use of photosensitive nanomaterials. We also explore applications of these synapses in recognizing static and dynamic optical information. Finally, we suggest future directions for research on optoelectronic synapses to implement neuromorphic artificial vision.
Low-light image enhancement is an effective solution for improving image recognition by both humans and machines. Due to low illuminance, images captured in such conditions possess less color information compared to t...
详细信息
Low-light image enhancement is an effective solution for improving image recognition by both humans and machines. Due to low illuminance, images captured in such conditions possess less color information compared to those taken in daylight, resulting in occluded images characterized by distortion, low contrast, low brightness, a narrow gray range, and noise. Low-light image enhancement techniques play a crucial role in enhancing the effectiveness of object detection. This paper reviews state-of-the-art low-light image enhancement techniques and their developments in recent years. Techniques such as gray transformation, histogram equalization, defogging, Retinex, image fusion, and wavelet transformation are examined, focusing on their working principles and assessing their ability to improve image quality. Further discussion addresses the contributions of deep learning and cognitive approaches, including attention mechanisms and adversarial methods, to image enhancement.
Light propagating through a nonuniform medium scatters as it interacts with particles with different refractive properties such as cells in the tissue. In this work we aim to utilize this scattering process to learn a...
详细信息
Light propagating through a nonuniform medium scatters as it interacts with particles with different refractive properties such as cells in the tissue. In this work we aim to utilize this scattering process to learn a volumetric reconstruction of scattering parameters, in particular particle densities. We target microscopy applications where coherent speckle effects are an integral part of the imaging process. We argue that the key for successful learning is modeling realistic speckles in the training process. To this end, we build on the development of recent physically accurate speckle simulators. We also explore how to incorporate speckle statistics, such as the memory effect, in the learning framework. Overall, this paper contributes an analysis of multiple aspects of the network design including the learning architecture, the training data and the desired input features. We hope this study will pave the road for future design of learning based imaging systems in this challenging domain. (C) 2022 Optica Publishing Group under the terms of the Optica Open Access Publishing Agreement
In the rapidly evolving landscape of computer vision and artificial intelligence, transfer learning has emerged as a powerful tool for efficiently applying pre-trained models to new tasks. This article delves into the...
详细信息
One of the main causes of cancer-related deaths is lung cancer, and increasing survival rates requires early detection. The use of sophisticated machine learning (ML) algorithms to improve the identification of lung c...
详细信息
Nonlinear estimation in robotics and vision is typically plagued with outliers due to wrong data association or incorrect detections from signal processing and machine learning methods. This article introduces two uni...
详细信息
Nonlinear estimation in robotics and vision is typically plagued with outliers due to wrong data association or incorrect detections from signal processing and machine learning methods. This article introduces two unifying formulations for outlier-robust estimation, generalized maximum consensus ($\textrm{G}$-$\textrm{MC}$) and generalized truncated least squares ($\textrm{G-TLS}$), and investigates fundamental limits, practical algorithms, and applications. Our first contribution is a proof that outlier-robust estimation is inapproximable: In the worst case, it is impossible to (even approximately) find the set of outliers, even with slower-than-polynomial-time algorithms (particularly, algorithms running in quasi-polynomial time). As a second contribution, we review and extend two general-purpose algorithms. The first, adaptive trimming ($\textrm{ADAPT}$), is combinatorial and is suitable for $\textrm{G}$-$\textrm{MC}$;the second, graduated nonconvexity ($\textrm{GNC}$), is based on homotopy methods and is suitable for $\textrm{G-TLS}$. We extend $\textrm{ADAPT}$ and $\textrm{GNC}$ to the case where the user does not have prior knowledge of the inlier-noise statistics (or the statistics may vary over time) and is unable to guess a reasonable threshold to separate inliers from outliers (as the one commonly used in RANdom SAmple Consensus $(\textrm{RANSAC})$. We propose the first minimally tuned algorithms for outlier rejection, which dynamically decide how to separate inliers from outliers. Our third contribution is an evaluation of the proposed algorithms on robot perception problems: mesh registration, image-based object detection (shape alignment), and pose graph optimization. $\textrm{ADAPT}$ and $\textrm{GNC}$ execute in real time, are deterministic, outperform $\textrm{RANSAC}$, and are robust up to 80-90% outliers. Their minimally tuned versions also compare favorably with the state of the art, even though they do not rely on a noise bound for the inliers.
暂无评论