As an important research topic in recent years,semantic segmentation has been widely applied to image understanding problems in various *** the successful application of deep learning methods in machinevision,the sup...
详细信息
As an important research topic in recent years,semantic segmentation has been widely applied to image understanding problems in various *** the successful application of deep learning methods in machinevision,the superior performance has been transferred to agricultural imageprocessing by combining them with traditional *** segmentation methods have revolutionized the development of agricultural automation and are commonly used for crop cover and type analysis,pest and disease identification,*** frst give a review of the recent advances in traditional and deep learning methods for semantic segmentation of agricultural images according to different segmentation *** we introduce the traditional methods that can effectively utilize the original image information and the powerful performance of deep learningbased ***,we outline their applications in agricultural image *** our literature,we identify the challenges in agricultural image segmentation and summarize the innovative developments that address these *** robustness of the existing segmentation methods for processing complex images still needs to be improved urgently,and their generalization abilities are also *** particular,the limited number of labeled samples is a roadblock to new developed deep learning methods for their training and *** this,segmentation methods that augment the dataset or incorporate multimodal information enable deep learning methods to further improve the segmentation *** review provides a reference for the application of image semantic segmentation in the field of agricultural informatization.
video coding has traditionally been developed to support services such as video streaming, videoconferencing, digital Tv, and so on. The main intent was to enable human viewing of the encoded content. However, with th...
详细信息
video coding has traditionally been developed to support services such as video streaming, videoconferencing, digital Tv, and so on. The main intent was to enable human viewing of the encoded content. However, with the advances in deep neural networks (DNNs), encoded video is increasingly being used for automatic video analytics performed by machines. In applications such as automatic traffic monitoring, analytics such as vehicle detection, tracking and counting, would run continuously, while human viewing could be required occasionally to review potential incidents. To support such applications, a new paradigm for video coding is needed that will facilitate efficient representation and compression of video for both machine and human use in a scalable manner. In this manuscript, we introduce an end-to-end learnable video codec that supports a machinevision task in its base layer, while its enhancement layer, together with the base layer, supports input reconstruction for human viewing. The proposed system is constructed based on the concept of conditional coding to achieve better compression gains. Comprehensive experimental evaluations conducted on four standard video datasets demonstrate that our framework outperforms both state-of-the-art learned and conventional video codecs in its base layer, while maintaining comparable performance on the human vision task in its enhancement layer.
The mitigation of material defects from additive manufacturing (AM) processes is critical to reliability in their fabricated parts and is enabled by modeling the complex relations between available build monitoring si...
详细信息
The mitigation of material defects from additive manufacturing (AM) processes is critical to reliability in their fabricated parts and is enabled by modeling the complex relations between available build monitoring signals and final mechanical performance. To this end, the present study investigates a machine learning approach for predicting mechanical properties for Ti-6Al-4v fabricated through laser powder bed fusion (PBF-LB) AM using in situ photodiode processing signals. Samples were fabricated under different processing parameters, varying laser powers and scan speeds for the purpose of probing a wide range of microstructure and property variations. Photodiode data were collected during fabrication, later to be arranged in image format and extracted to information-dense vectors by the transferal of deep convolutional neural network (DCNN) structures and weights pretrained on a large computer vision benchmark image database. The extracted features were then used to train and test a newly designed regression model for mechanical properties. Average cross-validation accuracies were found to be 98.7% (r(2) value of 0.89) for the prediction of ultimate tensile strength, which ranged from 900 to 1150 MPa in the samples studied, and 93.1% (r(2 )value of 0.96) for the prediction of elongation to fracture, which ranged from 0 to 17%. Thus, with high accuracy and hardware-accelerated inference speeds, we demonstrate that a transfer learning framework can be used to predict strength and ductility of metal AM components based on processing signals in PBF-LB, illustrating a potential route toward real-time closed-loop control and process optimization of PBF-LB in industrial applications.
Steel plays an important role in industry, and the surface defect detection for steel products based on machinevision has been widely used during the last two decades. This paper attempts to review state-of-art of vi...
详细信息
Steel plays an important role in industry, and the surface defect detection for steel products based on machinevision has been widely used during the last two decades. This paper attempts to review state-of-art of vision-based surface defect inspection technology of steel products by investigating about 170 publications. This review covers the overall aspects of vision-based surface defect inspection for steel products including hardware system, automated vision-based inspection method, existing problems and latest development. The types of steel product surface defects composition of visual inspection system are briefly described, and image acquisition system is introduced as well. The imageprocessing algorithms for surface defect detection of steel products are reviewed, including image pre-processing, region of interest (ROI) detection, image segmentation for ROI, feature extraction and selection and defect classification. The important problems such as small sample and real time of steel surface defect detection are discussed. Finally, the challenge and development trend of steel surface defect detection are prospected.
This article aims to identify core research directions and provide a comprehensive overview of major advancements in the field of hypercomplex signal and imageprocessing techniques using network graph theory. The met...
详细信息
This article aims to identify core research directions and provide a comprehensive overview of major advancements in the field of hypercomplex signal and imageprocessing techniques using network graph theory. The methodology employs community detection algorithms on research networks to uncover relationships among researchers and topic fields in the hypercomplex domain. This is accomplished through a comprehensive academic database search and metadata analysis from pertinent papers. The article focuses on the utility of these techniques in various applications and the value of mathematically rich frameworks. The results demonstrate how optimized network-based approaches can determine common topics and emerging lines of research. The article identifies distinct core research directions, including significant advancements in image/video processing, computer vision, signal processing, security, navigation, and machine learning within the hypercomplex domain. Current trends, challenges, opportunities, and the most promising directions in hypercomplex signal and imageprocessing are highlighted based on a thorough literature analysis. This provides actionable insights for researchers to advance this domain.
images are a vital part of our everyday life and imageprocessing is the heart of all the modern technologies, including machinevision, artificial intelligence, robotics, deep learning. It would not be wrong to say t...
images are a vital part of our everyday life and imageprocessing is the heart of all the modern technologies, including machinevision, artificial intelligence, robotics, deep learning. It would not be wrong to say that imageprocessing is one of the many reasons for achieving success in any industrial domain, whether it be medical, food, textile, or any other automation industry. It is next to impossible to work in these domains without having sufficient knowledge and skills about imageprocessing techniques. In this thesis document you will find the significance of imageprocessing used in three diverse projects. Each one of the projects is described as a separate chapter in this document. The first project is focused on reducing the power consumption in OLED-based devices. Actually there are two main goals of this project, first one, as the name suggests, is to minimize the power consumed by an OLED device to display images, and the second goal is to simultaneously enhance the color contrasts in images. OLED display panels have become increasingly popular in recent years, thanks to their numerous advantages over the traditional LCD displays. Power consumption in OLED displays depends on the contents where as the backlight is responsible for power consumption in LCD displays. This image-dependent or content-dependent power consumption model of OLED displays have encouraged numerous researchers to create possibilities for reducing the power consumption in OLED-based devices. One such possibility has been explored in this Ph. D. research work. Another industrial application has been presented in the second part of the thesis document. It is a part of the "Food Digital Monitoring" project, funded by Regione Piemonte. The major aim of this project is to identify the healthy and contaminated hazelnuts by using fluorescence and spectral imaging techniques. Two types of contamination are discussed in this work, one, caused by bacterial and fungal infections, called "rot
This study presents a vision-based closed-loop tracking system designed specifically for robotic laser beam welding of curved and closed square butt joints. The proposed system is compared against 11 existing solution...
详细信息
This study presents a vision-based closed-loop tracking system designed specifically for robotic laser beam welding of curved and closed square butt joints. The proposed system is compared against 11 existing solutions reported in the literature, which employ various sensor principles for the same application. The system employs a non-contact, non-intrusive machinevision approach, seamlessly integrated into the laser beam welding head to mitigate challenges associated with sensor forerun. Key features include an off-axis LED illumination, an optical filter, and a movable actuator, facilitating real-time imageprocessing and closed-loop control during the welding process. Experimental validation was conducted on stainless-steel plates with complex closed square butt joints. The system achieved a mean absolute joint-to-beam offset of 0.14 mm across four test cases, with a maximum offset of 0.85 mm, demonstrating its robustness and precision. Comparative analysis underscores the proposed method's advantages, showcasing its potential for industrial applications in laser beam welding of geometrically challenging joints.
Human skin classification is an essential task for several machinevisionapplications such as human -machine interfaces, people/object tracking, and classification. In this paper, we describe a hybrid CMOS/memristor ...
详细信息
Human skin classification is an essential task for several machinevisionapplications such as human -machine interfaces, people/object tracking, and classification. In this paper, we describe a hybrid CMOS/memristor vision sensor architecture embedding skin detection over a wide dynamic range. In -sensor RGB to rg -chromaticity colorspace conversion is executed on -the -fly through a pixel -level automatic exposure time control. Each pixel of the array delivers two pre -filtered analog signals, the r and g values, suitable for being efficiently classified as skin or non -skin through an analog memristive neural network (NN), without the need for any further signal processing. Moreover, we study the NN performance and theorize how it should be added in the hardware. The skin classifier is organized in an array of column -level memristor-based NN to exploit the nano -scale device characteristics and non-volatile analog memory capabilities, making the proposed sensor architecture highly flexible, customizable for various use -case scenarios, and low -power. The output is a skin bitmap that is robust against variations of the illuminant color and intensity. (c) 2024 Optica Publishing Group
The contemporary industry has witnessed a significant transformative development with the integration of artificial intelligence (AI) in various industrial systems, resulting in an enhanced automation for heightened p...
详细信息
The contemporary industry has witnessed a significant transformative development with the integration of artificial intelligence (AI) in various industrial systems, resulting in an enhanced automation for heightened productivity and efficiency. However, mastering this level of automation can be challenging for some applications, such as manufacturing inspection, which can be delicate while maintaining a precise cadence for an in-line manufacturing scale. In this paper, a systematic machinevision-based approach for on-machine inspection is proposed in order to automate and improve inspection process towards computer numerical control (CNC) machined parts. The approach incorporates remapping algorithm and imageprocessing operations to accurately extract desired features. Subsequently, these features will undergo dimensional inspection based on their generated point clouds. Tests were applied on a sample part using a complementary metal-oxide-semiconductor (CMOS) camera mounted on the spindle of 5-axis CNC machining center. The paper explores numerous aspects related to different stages of the approach and their impact on the resulting inspected features evaluations. It also highlights significant findings regarding critical factors for conducting well-structured experiments at various stages. Promising results have shown the significance of the presented work regarding industrial automation technology, ultimately improving manufacturing efficiency throughout the production line.
Conventional photography can only provide a two-dimensional image of the scene, whereas emerging imaging modalities such as light field enable the representation of higher dimensional visual information by capturing l...
详细信息
Conventional photography can only provide a two-dimensional image of the scene, whereas emerging imaging modalities such as light field enable the representation of higher dimensional visual information by capturing light rays from different directions. Light fields provide immersive experiences, a sense of presence in the scene, and can enhance different vision tasks. Hence, research into light field processing methods has become increasingly popular. It does, however, come at the cost of higher data volume and computational complexity. With the growing deployment of machine-learning and deep architectures in imageprocessingapplications, a paradigm shift toward learning-based approaches has also been observed in the design of light field processing methods. various learning-based approaches are developed to process the high volume of light field data efficiently for different vision tasks while improving performance. Taking into account the diversity of light field vision tasks and the deployed learning-based frameworks, it is necessary to survey the scattered learning-based works in the domain to gain insight into the current trends and challenges. This paper aims to review the existing learning-based solutions for light field imaging and to summarize the most promising frameworks. Moreover, evaluation methods and available light field datasets are highlighted. Lastly, the review concludes with a brief outlook for future research directions.
暂无评论