Manual visual assessment of mangoes has been problematic for the agriculture sector because of its time-consuming nature and inconsistent evaluation and sorting methods. The advent of automated flaw identification usi...
详细信息
ISBN:
(纸本)9798350357974
Manual visual assessment of mangoes has been problematic for the agriculture sector because of its time-consuming nature and inconsistent evaluation and sorting methods. The advent of automated flaw identification using computer vision and machine learning offers a notable shift and improvement in the visual inspection process. A common issue with mangoes is the presence of dark patches, indicative of disease or rot, which negatively affect the appearance and quality of the fruit. This paper introduces a framework using computer vision which utilizes image analysis and machine learning methods to identify these dark spots, taking into account the mangoes' texture. The proposed framework has a simplified configuration and tuning process, enhancing its ease of deployment in real-world applications. This innovation aligns with the advancements in integrating cutting-edge technologies to optimize efficiency and consistency in agricultural practices, thereby contributing to the evolution of smart agriculture and addressing the challenges and opportunities presented by the next wave of industrial revolution.
Conventional imaging and data processing devices are not ideal for mobile artificial visionapplications, such as vision systems for drones and robots, because of the heavy and bulky multilens optics in the camera mod...
详细信息
Diabetic Retinopathy is an eye disorder that affects people suffering from diabetes. Higher sugar levels in blood leads to damage of blood vessels in eyes and may even cause blindness. Diabetic retinopathy is identifi...
详细信息
Diabetic Retinopathy is an eye disorder that affects people suffering from diabetes. Higher sugar levels in blood leads to damage of blood vessels in eyes and may even cause blindness. Diabetic retinopathy is identified by red spots known as microanuerysms and bright yellow lesions called exudates. It has been observed that early detection of exudates and microaneurysms may save the patient's vision and this paper proposes a simple and effective technique for diabetic retinopathy. Both publicly available and real time datasets of colored images captured by fundus camera have been used for the empirical analysis. In the proposed work, grading has been done to know the severity of diabetic retinopathy i.e. whether it is mild, moderate or severe using exudates and micro aneurysms in the fundus images. An automated approach that uses imageprocessing, features extraction and machine learning models to predict accurately the presence of the exudates and micro aneurysms which can be used for grading has been proposed. The research is carried out in two segments;one for exudates and another for micro aneurysms. The grading via exudates is done based upon their distance from macula whereas grading via micro aneurysms is done by calculating their count. For grading using exudates, support vector machine and K-Nearest neighbor show the highest accuracy of 92.1% and for grading using micro aneurysms, decision tree shows the highest accuracy of 99.9% in prediction of severity levels of the disease.
Application of artificial intelligence methods in agriculture is gaining research attention with focus on improving planting, harvesting, post-harvesting, etc. Fruit quality recognition is crucial for farmers during h...
详细信息
Application of artificial intelligence methods in agriculture is gaining research attention with focus on improving planting, harvesting, post-harvesting, etc. Fruit quality recognition is crucial for farmers during harvesting and sorting, for food retailers for quality monitoring, and for consumers for freshness evaluation, etc. However, there is a lack of multi-fruit datasets to support real-time fruit quality evaluation. To address this gap, we present a new dataset of fruit images aimed at evaluating fruit freshness, which addresses the lack of multi-fruit datasets for real-time fruit quality evaluation. The dataset contains images of 11 fruits categorized into three freshness classes, and five well-known deep learning models (ShuffleNet, SqueezeNet, EfficientNet, ResNet18, and MobileNet-V2) were adopted as baseline models for fruit quality recognition using the dataset. The study provides a benchmark dataset for the classification task, which could improve research endeavors in the field of fruit quality recognition. The dataset is systematically organized and annotated, making it suitable for testing the performance of state-of-the-art methods and new learning classifiers. The research community in the fields of computer vision, machine learning, and pattern recognition could benefit from this dataset by applying it to various research tasks such as fruit classification and fruit quality recognition. The study achieved impressive results with the best classifier being ResNet-18 with an overall best performance of 99.8% for accuracy. The study also identified limitations, such as the small size of the dataset, and proposed future work to improve deep learning techniques for fruit quality classification tasks.
Transformer with self-attention has led to the revolutionizing of natural language processing field, and recently inspires the emergence of Transformer-style architecture design with competitive results in numerous co...
详细信息
Transformer with self-attention has led to the revolutionizing of natural language processing field, and recently inspires the emergence of Transformer-style architecture design with competitive results in numerous computer vision tasks. Nevertheless, most of existing designs directly employ self-attention over a 2D feature map to obtain the attention matrix based on pairs of isolated queries and keys at each spatial location, but leave the rich contexts among neighbor keys under-exploited. In this work, we design a novel Transformer-style module, i.e., Contextual Transformer (CoT) block, for visual recognition. Such design fully capitalizes on the contextual information among input keys to guide the learning of dynamic attention matrix and thus strengthens the capacity of visual representation. Technically, CoT block first contextually encodes input keys via a 3 x 3 convolution, leading to a static contextual representation of inputs. We further concatenate the encoded keys with input queries to learn the dynamic multi-head attention matrix through two consecutive 1 x 1 convolutions. The learnt attention matrix is multiplied by input values to achieve the dynamic contextual representation of inputs. The fusion of the static and dynamic contextual representations are finally taken as outputs. Our CoT block is appealing in the view that it can readily replace each 3 x 3 convolution in ResNet architectures, yielding a Transformer-style backbone named as Contextual Transformer Networks (CoTNet). Through extensive experiments over a wide range of applications (e.g., image recognition, object detection, instance segmentation, and semantic segmentation), we validate the superiority of CoTNet as a stronger backbone.
Automatic vision-based inspection systems have played a key role in product quality assessment for decades through the segmentation, detection, and classification of defects. Historically, machine learning frameworks,...
详细信息
Automatic vision-based inspection systems have played a key role in product quality assessment for decades through the segmentation, detection, and classification of defects. Historically, machine learning frameworks, based on hand-crafted feature extraction, selection, and validation, counted on a combined approach of parameterized imageprocessing algorithms and explicated human knowledge. The outstanding performance of deep learning (DL) for vision systems, in automatically discovering a feature representation suitable for the corresponding task, has exponentially increased the number of scientific articles and commercial products aiming at industrial quality assessment. In such a context, this article reviews more than 220 relevant articles from the related literature published until February 2023, covering the recent consolidation and advances in the field of fully-automatic DL-based surface defects inspection systems, deployed in various industrial applications. The analyzed papers have been classified according to a bi-dimensional taxonomy, that considers both the specific defect recognition task and the employed learning paradigm. The dependency on large and high-quality labeled datasets and the different neural architectures employed to achieve an overall perception of both well-visible and subtle defects, through the supervision of fine or/and coarse data annotations have been assessed. The results of our analysis highlight a growing research interest in defect representation power enrichment, especially by transferring pre-trained layers to an optimized network and by explaining the network decisions to suggest trustworthy retention or rejection of the products being evaluated.
This paper presents a novel approach for enhancing vehicle safety and navigation through an integrated system for lane detection, vehicle alignment, and automatic braking using visual feedback. Our proposed system emp...
详细信息
L Color cast, an aberration common in digital images, poses challenges in various imageprocessingapplications, affecting image quality and visual perception. This research investigates diverse methodologies for colo...
详细信息
A group of eye conditions known as glaucoma impair the optic nerve, which is in charge of sending visual data from the eye to the brain. Glaucoma impacts 3.54% of adults aged 40 to 80 around the world. Early detection...
详细信息
A group of eye conditions known as glaucoma impair the optic nerve, which is in charge of sending visual data from the eye to the brain. Glaucoma impacts 3.54% of adults aged 40 to 80 around the world. Early detection of glaucoma is crucial as it can prevent total optic nerve damage, which would cause irreversible vision loss. It is possible for specialists to diagnose glaucoma medically, but treatment options are either expensive or time-consuming and requires ongoing care from medical professionals. There have been numerous initiatives at streamlining all components of the glaucoma categorization process, however these models are challenging for users to comprehend the key predictors, resulting in them being unreliable for use by medical experts. The study uses eye fundus images to classify glaucoma patients using three distinct Deep Learning techniques: Convolutional neural network, Visual Geometry Group 16 (VGG16), and Global Context Network (GC-Net). In addition, several data pre-processing techniques are used to avoid overfitting and achieve high accuracy. This research compares and analyses the performance of various architectures using the aforementioned techniques. The CNN model had the best accuracy of 83% when in contrast to the other deep learning models.
Electron microscopy (EM) enables capturing high resolution images of very small structures in biological and non-biological specimens such as membrane proteins, viruses, subcellular structures, nanoparticles, or mater...
Electron microscopy (EM) enables capturing high resolution images of very small structures in biological and non-biological specimens such as membrane proteins, viruses, subcellular structures, nanoparticles, or material surfaces. Electron microscopy plays a critical role in research, development, and diagnosis in many applications of biological, physical, chemical and material sciences. Thanks to advances in instrumentation, electron microscopy generates large amounts of complex data that is no longer feasible to analyze manually. There is a growing need for development of computational methods and tools for automated analysis of electron microscopy data generated for variety of research fields. Recent advances in artificial intelligence and machine learning, particularly in deep learning have revolutionized imageprocessing and computer vision. In this work, we explored deep learning guided imageprocessing and computer vision solutions to address the growing high-performance processing needs of image data acquired using electron microscopy. The proposed solutions involved novel multi-step, 2D/3D fusion approaches to address the unique challenges of complex, low-contrast, noisy electron microscopy imagery; and selfsupervised, semi-supervised, or meta-learning schemes to address the challenges caused by lack of or limited amounts of labeled training data. These image analysis solutions were used for detection, segmentation, and quantification of various biological structures of interest such as proteins, viruses, mitochondrial or neural structures; and non-biological structures of interest such as carbon nanotube forests. Experiments conducted on the proposed methods showed robust and promising results towards automated, objective, and quantitative analysis of electron microscopy image data, that is of great value for biology, medicine, and material science applications.
暂无评论