作者:
Lu, Yufan
Zhejiang Gongshang University Hangzhou China
This research aims to improve the visual target detection and recognition capabilities of shopping robots in various sales environments by optimizing and improving the YOLO algorithm, in order to improve accuracy and ...
详细信息
We characterized manufacturing-induced defects in 316L stainless steels - fabricated by direct metal laser sintering (DMLS) - and investigated their roles in the fatigue behavior of steel parts. The primary defects ta...
详细信息
Industrial automation is undergoing a tremendous change due to the proliferation of the concepts, the Internet of Things (IoT), Cyber-Physical Systems (CPS) and tactile internet, which enables the interconnections of ...
详细信息
ISBN:
(纸本)9781665473507
Industrial automation is undergoing a tremendous change due to the proliferation of the concepts, the Internet of Things (IoT), Cyber-Physical Systems (CPS) and tactile internet, which enables the interconnections of factory floor devices and enterprise network on a wider and fine-grained scale. vision Sensor deployments are getting great momentum in factories, as it improves the quality and productivity of the systems being inspected. Smart vision Sensors[1] removes the need of the additional infrastructures for running the imageprocessing algorithms and visionapplications, by directly running the vision logic on the device and control/monitor the various parameters on the field based on the imageprocessing outputs. Industrial vision sensor (IVIS) is an industrial smart camera, which has a CMOS image sensor[2] and a powerful on-board processing system capable of supporting machinevisionapplications, for improving the product and process qualities and thereby improve the yield and profit. IVIS is capable of extracting applicationspecific information from the captured images and make decisions based on the imageprocessing algorithms implemented on the system, to realize stand-alone intelligent and decision-making automation system. In this paper we present the design and development of IVIS, its application domains and preliminary test results.
Intelligent optimization algorithm is an advanced computing technology, which simulates the biological evolution process in nature or the logical thinking of human beings to find a solution to the problem. In computer...
详细信息
Micro-expressions(MEs) have emerged as a viable strategy for affective estimation due to their high reliability in emotion detection. In recent years, deep learning methods have been successfully applied to the field ...
详细信息
Today's computer vision industry makes extensive use of image recognition. A popular method of image recognition is digit recognition. The recognition of handwritten numbers is one of the most well-known difficult...
详细信息
Solution processed photodetectors have garnered great attention in applications such as, machinevision perception, neuromorphic computing and opto-electronic memory storage. Though, such photodetectors offer several ...
详细信息
Solution processed photodetectors have garnered great attention in applications such as, machinevision perception, neuromorphic computing and opto-electronic memory storage. Though, such photodetectors offer several advantages such as ease of fabrication, high scalability, low thermal budget and low-cost processing, multi-modal functionality etc. however, they suffer from the major drawback of inferior device performance -as low responsivity and slow rise time, particularly due to the intrinsic poor crystallinity of the photoactive material. In this work, we demonstrate a solution processed photodetector with impressive performance at comparatively low processing temperatures (<150 degrees C) based on the mixed dimensional heterostructure configuration of 1D TiO2 nanorods and 3D CdS nanoflowers. TiO2 nanorods have been synthesized by hydrothermal technique, whereas their CdS sensitization is done by chemical bath deposition. Low cost carbon paste is used as electrode instead of conventional non-economic noble metal electrodes. X-ray diffraction studies validated excellent crystallinity of the photoactive material even under low temperature processing condition. The type-ii Heterojunction (TiO2 and CdS) configuration photodetector shows efficient response at zero bias, thus yielding a self-powered device. The detector shows response in UV and visible region, with excellent responsivity of 110 mA/W (5 V), 563 A/W (0 V) and a quicker rise time of 81 ms. Albeit the simple fabrication scheme and low processing temperatures, the detector exhibited promising figures-of-merit, which aids in fabrication of novel solution processed photodetectors.
The increasing popularity of attention mechanisms in deep learning algorithms for computer vision and natural language processing made these models attractive to other research domains. In healthcare, there is a stron...
详细信息
The increasing popularity of attention mechanisms in deep learning algorithms for computer vision and natural language processing made these models attractive to other research domains. In healthcare, there is a strong need for tools that may improve the routines of the clinicians and the patients. Naturally, the use of attention-based algorithms for medical applications occurred smoothly. However, being healthcare a domain that depends on high-stake decisions, the scientific community must ponder if these high-performing algorithms fit the needs of medical applications. With this motto, this paper extensively reviews the use of attention mechanisms in machine learning methods (including Transformers) for several medical applications based on the types of tasks that may integrate several works pipelines of the medical domain. This work distinguishes itself from its predecessors by proposing a critical analysis of the claims and potentialities of attention mechanisms presented in the literature through an experimental case study on medical image classification with three different use cases. These experiments focus on the integrating process of attention mechanisms into established deep learning architectures, the analysis of their predictive power, and a visual assessment of their saliency maps generated by post-hoc explanation methods. This paper concludes with a critical analysis of the claims and potentialities presented in the literature about attention mechanisms and proposes future research lines in medical applications that may benefit from these frameworks.
Aiming at the problems of low detection accuracy, high computational complexity and long-time consumption of visual perception model in a complex mining environment, this research designs a visual information percepti...
详细信息
Aiming at the problems of low detection accuracy, high computational complexity and long-time consumption of visual perception model in a complex mining environment, this research designs a visual information perception system of coal mine comprehensive excavation working face for an edge computing terminal. Firstly, the C3-Fast feature extraction module, spatial pyramid pooling with cross-stage partial connection (SPPCSPC) pooling module, bi-directional feature pyramid network and lightweight decoupled detection head are used to optimize the YOLOv5s model, so as to construct the FSBD-YOLOv5s multi-object detection model. Secondly, the pruning and distillation algorithm is used to lighten the FSBD-YOLOv5s model, and the model complexity is greatly reduced while maintaining the model detection accuracy. Further, the lightweight FSBD-YOLOv5s model is migrated and deployed to the edge computing terminal platform and the TensorRT engine is used to accelerate model inference. Finally, experiments are carried out based on the data set of the coal mine comprehensive excavation working face. The experimental results show that on the edge computing terminal platform, the parameters and computational volume of the lightweight FSBD-YOLOv5s model are reduced by 50.8% and 34.0%, while its detection accuracy and speed reach 94.0% and 43.7 fps, which can fully satisfy the requirements of the accuracy and real-time for the coal mine engineering applications. In the complex operation scene of coal mine, due to adverse environmental factors such as uneven illumination, high dust and mixed man-machine multi-target, the speed and measurement accuracy of traditional visual perception model decrease sharply. In order to solve the above problems, this study proposes to build a visual information perception system for coal mine comprehensive excavation working face for edge computing terminal and combines channel pruning algorithm, knowledge extraction algorithm and TensorRT acceleration e
Perceiving the shape and structure of the real three-dimensional world through sensors and cameras is indispensable across various domains. The 3D reconstruction technology is dedicated to realizing this ideal process...
详细信息
Perceiving the shape and structure of the real three-dimensional world through sensors and cameras is indispensable across various domains. The 3D reconstruction technology is dedicated to realizing this ideal process. 3D reconstruction technology serves as a transformative tool, enriching our ability to perceive the genuine shape and stereo structure of objects and scenes in the real world. Through combining advanced sensors, imageprocessing algorithms and 3D reconstruction methods, it captures the shape and structural information of targets from multiple perspectives and dimensions, and creates highly realistic 3D models in the virtual environment. With the rapid modernization of agriculture and ongoing technological progress, the demand for more efficient and precise management and monitoring methods in agricultural production is increasing. Traditional observation and measurement methods face challenges such as low efficiency and incomplete data. 3D reconstruction technology provides more accurate and intelligent management tools for smart agriculture. This paper provides a detailed introduction to the research progress based on 3D reconstruction technology in smart agriculture. It delves into the characteristics and development of various sensors and sensing systems, discussing various methods to implement 3D reconstruction technology. Different from applications in industrial environments, agricultural environments and crops are usually complex and variable, and consideration of diverse factors is required for the selection of suitable sensors and reconstruction methods. Therefore, several aspects of applications are summarized, such as agricultural robotics, crop phenotyping, livestock, and the food industry. Finally, the challenges and potential future trends of 3D reconstruction in agriculture are given.
暂无评论