Among the instruments for early detection are those for analysing gases in people's faeces, as it has been found that the presence of the intensity of certain compounds is related to the presence of cancer, diabet...
详细信息
ISBN:
(纸本)9783031133213;9783031133206
Among the instruments for early detection are those for analysing gases in people's faeces, as it has been found that the presence of the intensity of certain compounds is related to the presence of cancer, diabetes or Alzheimer. The availability of sensor devices in recent years, together with the Internet of Things (IoT) paradigm, has made it possible to create low-cost systems that allow initial solutions to be tested for various real applications. Therefore, the aim of this contribution is to present the use case of a stool gas monitoring system in order to be the beginning of a solution for the early detection of this type of diseases. The proposed prototype integrates a thermal camera and MOX sensors to collect temperature and gas measurements immediately after a person has made deposition in their home. The measurements are monitored through an IoT platform and stored on a cloud server.
Curved ceramic materials possess corrosion-resistant, heat-resistant, and exceptional hardness, rendering them indispensable in many critical fields, including automotive, aerospace, biomedical, architectural, and art...
详细信息
Curved ceramic materials possess corrosion-resistant, heat-resistant, and exceptional hardness, rendering them indispensable in many critical fields, including automotive, aerospace, biomedical, architectural, and artistic applications. The detection of defects in these materials is of paramount importance;but traditional inspection methods fall short due to their unique structural characteristics. Presently, the assessment of curved ceramics predominantly relies on manual inspection, which is fraught with subjective errors and inefficiencies. This paper provides a comprehensive overview of the advancements in machinevision technology as applied to defect detection across various types of curved ceramic materials, including industrial, daily-use, and artistic ceramics. The paper highlights the key factors influencing the development of surface defect detection technology in curved ceramics. It summarises the advantages and disadvantages of traditional machinevision and deep learning approaches for defect detection in curved ceramics and analyzes their respective application scopes. Concluding with an examination of the technical challenges inherent in defect detection, the paper also outlines a prospective trajectory for the advancement of inspection technology in curved ceramic materials.
Automatic image description generation is an active research area in Computer vision and Natural Language processing. Objects, their attributes, actions, and spatial relationships are identified in the image descripti...
详细信息
Automatic image description generation is an active research area in Computer vision and Natural Language processing. Objects, their attributes, actions, and spatial relationships are identified in the image description generation system. Earlier, these systems used classical machine learning approaches. Later majority of these works follow deep learning strategies. The essential goal of these systems is to produce syntactically and semantically correct sentences. This review aims to synthesize the studies conducted from 2010 to 2023 to get a deeper view of various image description generation systems and their applications. The prominent contribution of this review is that it covers the different aspects of image captioning systems, such as the methods used, the various applied domains, evaluation measures, and the datasets used. A single synthesized study directs scholars regarding developing image captioning systems to date utilizing machine learning approaches. It also offers suggestions for researchers in this sector for the future. image captioning is applied in many fields like natural images, medical images, remote sensing images, videos, etc. This review paper reviews the various taxonomies of image description generation systems. We also analyzed multiple methods used in the architecture of image captioning systems. The datasets and the evaluation metrics used in these systems are discussed. We studied the performance of the system under various circumstances.
With the development of computer vision, natural language processing, and machine learning technologies, a great number of joint visual-textual applications, such as image captioning, visual question answering, visual...
详细信息
With the development of computer vision, natural language processing, and machine learning technologies, a great number of joint visual-textual applications, such as image captioning, visual question answering, visual grounding, image-text cross-modal retrieval, and text-based image generation, emerged in recent years. They leverage machine learning models as the core module to tackle problems related to the intersection of vision and language. For all these joint visual-textual applications, vision and text modalities interact in three fundamental modes. The first is the "joint learning" mode, which considers both modalities as parallel inputs to jointly predict a target. The second is the "retrieval" mode, which explores the correspondence relation between the two modalities and aims to find the corresponding items that belong to different modalities. The third is the "generation" mode, which focuses on creating and modifying the items of one modality using the input of another modality as guidance. For all the joint visual-textual applications of the three modes, how to effectively "capture" and "attend" to the significant information of the visual and textual inputs is crucial. This thesis develops new "capturing" and "attending" methods to effectively model joint visual-textual applications in the three modes. For the first mode, we focus on a significant social media classification application. A novel bilateral attention model is proposed to classify whether a WeChat Moment is related to business or not based on the Moment's image and text information. For the second mode, we comprehensively investigate the application of image-text cross-modal retrieval on both general and domain-specific tasks. We first explore the general image-text matching task and propose approaches that capture high-performance cross-modal information. We then focus on two domain-specific tasks related to font retrieval and person search. We design methods to further utilize the specia
This article centers on foreground-background image segmentation techniques, with a particular focus on applications using two shifted or highly similar images. Our work critically examines prevalent imageprocessing ...
详细信息
ISBN:
(数字)9798350375428
ISBN:
(纸本)9798350375435
This article centers on foreground-background image segmentation techniques, with a particular focus on applications using two shifted or highly similar images. Our work critically examines prevalent imageprocessing methodologies, leading to the development and implementation of an innovative algorithm for more precise segmentation. We initially unpack the concepts of image segmentation and stereo vision, setting the stage for an in-depth exploration of various segmentation methods and techniques. Our investigation reveals that synergistic combinations of methods frequently produce more refined outcomes. We subsequently propose a practical solution, juxtaposing our approach with pre-existing alternatives to delineate its comparative strengths, weaknesses, and unique attributes.
It is essential to find creative solutions to the growing urban problems of traffic congestion and parking issues. By using real-time traffic camera photos with imageprocessing and deep learning algorithms to compute...
详细信息
This paper presents a secure reconfigurable hierarchical hardware architecture at the pixel and region level for smart image sensors to accelerate machinevisionapplications. The design maintains hierarchical process...
详细信息
This paper presents a secure reconfigurable hierarchical hardware architecture at the pixel and region level for smart image sensors to accelerate machinevisionapplications. The design maintains hierarchical processing that begins at the pixel level. It aims to reduce the computational burden on the sequential processor and increases the confidentiality of the sensor. We achieve this goal by preprocessing the data in parallel with event-based processing within the sensor and extract the local features, which are then forwarded to an encryption module. After that, an external processor can obtain the encrypted features to complete the vision application. This approach significantly accelerates the vision application by executing the low-level and mid-level imageprocessingapplications and simultaneously by reducing the data volume at the sensor level. The secure hardware architecture enables the vision application to perform in real-time with reliability. This hierarchical processing breaks the traditional sequential imageprocessing and introduces parallelism for machinevisionapplications. We evaluate the design in FPGA and achieve the GDSII file in the ASIC platform at 800MHz. Simulation results show that the area overhead and power penalty for adding reconfiguration features stay in an acceptable range. Besides, removing redundant information, 84.01%, and 94.31% dynamic power can be saved at each pixel-level and region-level, respectively.
A key element of quality control in manufacturing is Product Inspection, which is a process that allows for verifying a product's quality enabled by activities such as measuring, examining, and testing one or more...
详细信息
In a complex semiconductor manufacturing environment, critical dimension scanning electron microscope (CD-SEM) images are captured at metrology step to monitor structural measurements and detect anomaly to meet the st...
详细信息
ISBN:
(数字)9798331531850
ISBN:
(纸本)9798331531867
In a complex semiconductor manufacturing environment, critical dimension scanning electron microscope (CD-SEM) images are captured at metrology step to monitor structural measurements and detect anomaly to meet the stringent process control requirements. This paper focuses on advanced CD-SEM imageprocessing and anomaly detection using machine learning and generative AI models, which include computer vision (CV) imageprocessing, Residual Neural Network (ResNet) deep learning and Generative Adversarial Network (GAN) model for various use cases. The applications of these models during in-line monitoring are crucial to identify potential process issues for yield and quality improvement.
Object classification and detection involve numerous applications like imageprocessing, picture retrieval, security and surveillance, video communication, robot vision and observation. They are often classified based...
详细信息
暂无评论