Potato is a widely consumed food worldwide, and its productivity has increased due to new varieties and the use of technologies related to irrigation, nutrition, and soil preparation, among others. However, diseases s...
详细信息
ISBN:
(纸本)9783031530357;9783031530364
Potato is a widely consumed food worldwide, and its productivity has increased due to new varieties and the use of technologies related to irrigation, nutrition, and soil preparation, among others. However, diseases such as late blight disease can often affect the crop, impacting many farmers around the world. As a way to help production, technology in agriculture is increasing. Among the various computational techniques that can be applied, those based on digital imageprocessing associated with machine learning algorithms stand out, producing excellent results. This work aimed to develop a methodology for recognizing late blight disease in potato leaves using digital imageprocessing techniques and machine learning algorithms. It was possible to obtain promising results. The experiments were carried out in a set of images from a public database containing images of healthy and unhealthy leaves (with late blight). We compare the performance of machine learning algorithms using feature vectors obtained with SIFT algorithm and RGB descriptors. The best performance was using the Decision Tree algorithm and SIFT vectors, with 99.24% of accuracy.
This study presents a novel approach for pH estimation in buffer solutions using images of solutions prepared with Hibiscus sabdariffa L. as a natural pH indicator. The images of the solutions, each displaying distinc...
详细信息
This study presents a novel approach for pH estimation in buffer solutions using images of solutions prepared with Hibiscus sabdariffa L. as a natural pH indicator. The images of the solutions, each displaying distinctive colours indicative of their pH levels, were transformed into standardized 200x200-pixel images through the application of imageprocessing techniques. Following this, a pH prediction model was constructed using the Adaptive Boosting regressor algorithm. The pH values of the training data used when training the model were distributed irregularly between 0-14. The models were trained with 94 pictures and 1880 experimental values. In addition, a reliable pre-processing part has been placed into the model using imageprocessing techniques, allowing test data to be obtained in any desired environment. The obtained training and test data were separated from noise parameters, affecting the prediction results negatively. A smartphone application based on the model has been developed and made available to everyone. This innovative methodology bridges the gap between traditional pH measurement techniques and computer vision, offering amore accessible and eco-friendly means of pH assessment. The practical applications of this research extend to various fields, including environmental monitoring, agriculture, and educational settings.
In this study, an ad-hoc imageprocessing pipeline has been developed and proposed for the purpose of semantically segmenting wheat kernel data acquired through near-infrared hyperspectral imaging (HSI). The Gaussian ...
详细信息
In this study, an ad-hoc imageprocessing pipeline has been developed and proposed for the purpose of semantically segmenting wheat kernel data acquired through near-infrared hyperspectral imaging (HSI). The Gaussian Mixture Model (GMM), characterized as a soft clustering method, has been employed for this task, yielding noteworthy results in both kernel and germ segmentation. A comparative analysis was conducted, wherein GMM was compared with two hard clustering methods, hierarchical clustering and k-means, as well as other common clustering algorithms prevalent in food HSI applications. Notably, GMM exhibited the highest accuracy, with a Jaccard index of 0.745, surpassing hierarchical clustering at 0.698 and k-means at 0.652. Furthermore, the spectral variations observed in wheat kernel topology can be used for semantic image segmentation, especially in the context of selecting the germ portion within the wheat kernels. These findings carry practical significance for professionals in the fields of hyperspectral imaging (HSI) and machinevision, particularly for food product quality assessment and real-time inspection.
The task of image style transfer is to automatically redraw an input image in the style of another image, such as an artist's painting. The disadvantage of conventional stylization algorithms is the uniqueness of ...
详细信息
The task of image style transfer is to automatically redraw an input image in the style of another image, such as an artist's painting. The disadvantage of conventional stylization algorithms is the uniqueness of result. If the user is not satisfied with the way the style was transferred, he has no option to remake the stylization. The paper provides an overview of existing style transfer methods that generate diverse results after each run and proposes two new methods. The first method enables diversity by concatenating a random vector into inner image representation inside the neural network and by reweighting image features accordingly in the loss function. The second method allows diverse stylizations by passing the stylized image through orthogonal transformations, which impact the way the target style is transferred. These blocks are trained to replicate patterns from additional pattern images, which serve as additional input and provide an interpretable way to control stylization variability for the end user. Qualitative and quantitative comparisons demonstrate that both methods are capable to generate different stylizations with higher variability achieved by the second method. The code of both methods is available on github.
India holds the title of being the top banana producer globally, contributing approximately 25% of the total banana production. However, exporting it can be a challenge because of its shelf-life. To propose the best p...
详细信息
ISBN:
(纸本)9783031581731;9783031581748
India holds the title of being the top banana producer globally, contributing approximately 25% of the total banana production. However, exporting it can be a challenge because of its shelf-life. To propose the best possible shelf-life extension methodology, it is important to classify based on the banana varieties and ripening stages to ensure sustainable growth and nutritional value. There are still not enough data sets with different varieties of bananas and their respective ripening stages. A review of research publications from the last five years has been conducted using electronic databases like Scopus, Google Scholar, and Research-Gate, as well as the details of publicly accessible dataset repository sites. The dataset captures images of different varieties of banana fruit as well as its respective different stages of ripening. Banana varieties considered include Robusta (MusaAA), Dwarf Cavendish (Musaacuminata), Nanjangud bananas, and Red bananas (Musa acuminata). The dataset contains over 41,900 processed images. In this paper, the authors provide researchers with an opportunity to develop and investigate machine learning and deep learning algorithms that are used to predict and extend the shelf life of banana fruits.
The absence of standardized evaluation methodologies for single-layer dimensional accuracy significantly hinders the broader implementation of direct ink writing (DIW) technology. Addressing the critical need for prec...
详细信息
The absence of standardized evaluation methodologies for single-layer dimensional accuracy significantly hinders the broader implementation of direct ink writing (DIW) technology. Addressing the critical need for precision non-contact assessment in DIW fabrication, this study develops a novel machinevision-based framework for dimensional accuracy evaluation. The methodology encompasses three key phases: (1) establishment of an optimized hardware configuration with integrated imageprocessing algorithms;(2) comprehensive investigation of camera calibration protocols, advanced image preprocessing techniques, and high-precision contour extraction methods;and (3) development of an iterative closest point (ICP) algorithm-enhanced evaluation system. The experimental results demonstrate that our machinevision system achieves 0.04 mm x 0.04 mm spatial resolution with the ICP convergence threshold optimized to 0.001 mm. The proposed method shows an 80% improvement in measurement accuracy (0.001 mm) compared to conventional approaches. Process parameter optimization experiments validated the system's effectiveness, showing at least 76.3% enhancement in printed layer dimensional accuracy. This non-contact evaluation solution establishes a robust framework for quantitative quality control in DIW applications, providing critical insights for process optimization and standardization efforts in additive manufacturing.
Industry 4.0, the digitalization of manufacturing promises to lead to lowered cost, efficient processes and even discovery of new business models. However, many of the enterprises have huge investments in legacy machi...
详细信息
ISBN:
(纸本)9783031702587;9783031702594
Industry 4.0, the digitalization of manufacturing promises to lead to lowered cost, efficient processes and even discovery of new business models. However, many of the enterprises have huge investments in legacy machines which are not 'smart'. In this study, we thus designed a cost-efficient solution to retrofit a legacy conveyor belt-based cutlery washing machine with a commodity web camera. We then applied computer vision (using both traditional imageprocessing and deep learning techniques) to infer the speed and utilization of the machine. We detailed the algorithms that we designed for computing both speed and utilization. With the existing operational constraints of our client, frequent re-training of the deep learning model for object detection is not feasible. Thus, we compared the generalizability of the two techniques across 'unseen' cutleries and found traditional imageprocessing to be generalizable across 'unseen' images. Our proposed final solution uses traditional imageprocessing for computation of utilization but a hybrid of traditional imageprocessing and deep learning model for speed computation as it is more reliable. Our client has implemented our proposed solution for one conveyor belt-based cutlery washing machine and will be planning to scale this to multiple conveyor belt-based cutlery washing machines.
Visual question answering (VQA) is a problem that researchers in both computer vision and natural language processing are interested in studying. In VQA, a system is given an image and a question in natural language a...
详细信息
Visual question answering (VQA) is a problem that researchers in both computer vision and natural language processing are interested in studying. In VQA, a system is given an image and a question in natural language about that image. The VQA system is then expected to answer in natural language. To find the right answer, a VQA algorithm may need to use common sense to make sense of the information in the image and external knowledge. In this paper, we discuss some of the main ideas behind VQA systems and provide a comprehensive literature survey of the current state of the art in VQA and visual reasoning from four perspectives: problem definition and challenges, approaches, existing datasets, and evaluation matrices. We conclude our survey with a discussion and some potential future research directions in this area to generate new ideas and creative approaches to solving current problems and developing new applications.
In aerial image classification, integrating advanced vision transformers with optimal preprocessing techniques is pivotal for enhancing model performance. This study presents SwinSight, a novel hierarchical vision tra...
详细信息
Pill image recognition by machinevision can reduce the risk of taking the wrong medications, a severe healthcare problem. Automated dispensing machines or home applications both need reliable imageprocessing techniq...
详细信息
Pill image recognition by machinevision can reduce the risk of taking the wrong medications, a severe healthcare problem. Automated dispensing machines or home applications both need reliable imageprocessing techniques to compete with the problem of changing viewing conditions, large number of classes, and the similarity in pill appearance. The problem is attacked with a multi-stream, two-phase metric embedding neural model. To enhance the metric learning procedure, dynamic margin setting is introduced into the loss function. Moreover, it is shown that besides the visual features of drug samples, even free text of drug leaflets (processed with a natural language model) can be used to set the value of the margin in the triplet loss and thus increase the recognition accuracy of testing. Thus, besides using the conventional metric learning approach, the given discriminating features can be explicitly injected into the metric model using the NLP of the free text of pill leaflets or descriptors of images of selected pills. The performance on two datasets is analysed and a 1.6% (two-sided) and 2.89% (one-sided) increase in Top-1 accuracy on the CURE dataset is reported compared to existing best results. The inference time on CPU and GPU makes the proposed model suitable for different kinds of applications in medical pill verification;moreover, the approach applies to other areas of object recognition where few-shot problems arise. The proposed high-level feature injection method (into a low-level metric learning model) can also be exploited in other cases, where class features can be well described with textual or visual cues.
暂无评论