Object detection in computer vision has been a significant research area for the past decade. Identifying objects with multiple classes from an image has attracted great attention because it can effectively classify a...
详细信息
Object detection in computer vision has been a significant research area for the past decade. Identifying objects with multiple classes from an image has attracted great attention because it can effectively classify and detect the image. A multi-class object detection system from a video or image is quite challenging because of the errors obtained by the location classification process. Our proposed system generalized a hybrid convolutional neural network (H-CNN) model is used to realize the user object from an image. The proposed work integrates pre-processing, object localization, feature extraction and classification. First, the input image is pre-processed with Gaussian filtering to remove noise and improve the image quality. After completing the pre-processing procedure, it is subjected to object localization. Here the object in the image is localized using Grid Guided Localization (GGL). In the feature extraction phase, the model would be pre-trained with AlexNet. Here the AlexNet are generalized as fully connected (FC) layers. Finally, the Softmax layer in the AlexNet architecture is replaced by SvR (Support vector Regression), which acts as a classifier for identifying the object class. The classification loss is minimized using the Improved Grey Wolf (IGW) optimization algorithm. Thus, the H-CNN model can quickly classify and label the objects from images. It also offers improved classification performance in managing effective training time. The proposed work will be implemented in PYTHON. Therefore, the model would be built using various datasets such as MIT-67, PASCAL vOC2010, MS (Microsoft)-COCO, and MSRC to effectively train and classify the object. The proposed H-CNN achieved improved results with MIT-67 (96.02%), PASCAL vOC2010 (95.04%), MSRC (97.37%), and MS COCO (94.53%). The results obtained by H-CNN proved that the excluded result of Mean Average Precision (mAP), Precision, Accuracy, Recall values and F1-Score achieved better results than with re
Nutrition is an important aspect of public health, and in recent years, there has been increasing interest in the nutritional information of food. However, processing this information can be a challenging task due to ...
详细信息
ISBN:
(纸本)9798350336672
Nutrition is an important aspect of public health, and in recent years, there has been increasing interest in the nutritional information of food. However, processing this information can be a challenging task due to the large amounts of data involved. machine learning (ML) has emerged as a useful tool to address this challenge. In this paper, we present a data resource that uses the FoodData Central (FDC) nutrient database to explore the combination of food images, nutritional information, and text with ML. We begin by providing an overview of machine learning and its applications in nutrition research, including the use of ML algorithms to identify food intake patterns, predict nutrient intakes, and evaluate dietary guidelines. We then describe the features and applications of Inception-v3, Inception-v4, and MobileNetv2 in ML, highlighting how these models can be used to extract nutritional information from food images. To further explore the potential of ML in nutrition research, we developed a quick search app that integrates images, text, and nutritional information. This app uses image recognition algorithms to identify food items in pictures, and text processing techniques to extract food information from text data. Users can simply take a picture of a food item and the app will provide the details of its nutritional content. This app can be used to facilitate the study of food and nutrition information and help promote healthier eating habits. In conclusion, the development of data resources and apps that use ML algorithms can be particularly helpful in processing large amounts of nutrition data and making it more accessible to the public. By harnessing the power of ML, we can advance our understanding of the relationship between diet and health, and ultimately work towards improving public health outcomes.
The precise detection of plant centres is important for growth monitoring, enabling the continuous tracking of plant development to discern the influence of diverse factors. It holds significance for automated systems...
详细信息
ISBN:
(纸本)9798350372977;9798350372984
The precise detection of plant centres is important for growth monitoring, enabling the continuous tracking of plant development to discern the influence of diverse factors. It holds significance for automated systems like robotic harvesting, facilitating machines in locating and engaging with plants. In this paper, we explore the YOLOv4 (You Only Look Once) real-time neural network detector for plant centre detection. Our dataset, comprising over 12,000 images from 151 Arabidopsis thaliana accessions, is used to fine-tune the model. Evaluation of the dataset reveals the model's proficiency in centre detection across various accessions, boasting an mAP of 99.79% at a 50% IoU threshold. The model demonstrates real-time processing capabilities, achieving a frame rate of approximately 50 FPS. This outcome underscores its rapid and efficient analysis of video or image data, showcasing practical utility in time-sensitive applications.
The verification of IP core with imageprocessing algorithm is important for SoC and FPGA application in the field of machinevision. This paper proposes a verification framework with general purpose, real-time perfor...
详细信息
The face is a critical perspective in predicting human feelings and moods. More frequently than not human senti-ments are extricated with the utilization of the camera. various applications are being made based on the...
详细信息
In the era of digitization and big data, the world is inundated with an ever-growing volume of visual content, be it images or videos. As organizations strive to harness the potential of these multimedia data sources,...
详细信息
In recent years there has been an increased interest towards edge computing, i.e., computing performed on distributed devices as opposed to centralized high-power hubs. Examples of edge computing would be the local im...
详细信息
In recent years there has been an increased interest towards edge computing, i.e., computing performed on distributed devices as opposed to centralized high-power hubs. Examples of edge computing would be the local imageprocessing performed on Unmanned Autonomous vehicles (UAv's) or the specialized machinevision systems on drones. These edge computing applications require schemes that are efficient with power and memory and typically must operate real-time. Many state-of-the-art imageprocessing solutions that employ advanced optimization and deep neural networks (NNs) achieve impressive benchmark results, but are computationally demanding and thus on many occasions, impractical. The additional requirement for a range of applications is noise robustness or the ability to work in (extreme) low-light conditions; reasonable quality image or accurate object classification may be critical when there is low light flux or when the environment is over-saturated with other signals. Here, we approach edge computing with a combination of optical preprocessing and shallow NN and we show that this hybrid approach greatly reduces the computational requirements. For low-SNR imaging, we develop a technique that reconstructs objects and scenes from their Fourier-plane images. The optical preprocessing is performed via encoded diffraction with optical vortex singularities. The optical vortex encoder achieves differentiation of the already-compressed Fourier-plane patterns and enables facile inverse inference of the original object scene. We demonstrate that our method is robust to noise. And for a simple NN architecture (one or two layers), leads to generalization, i.e., reconstruction of objects from classes that are greatly different from the ones the NN was trained on. Our research identifies strong potential for swift hybrid imaging systems with edge computing applications and highlights the valuable function of the vortex encoder for spectral differentiation.
The fashion industry’s traditional price-setting methods, based on historical sales and Fashion Week trends, are inadequate in the digital era. Rapid changes in collections and consumer preferences necessitate advanc...
详细信息
Hyperspectral images include information from a wide range of spectral bands deemed valuable for computer visionapplications in various domains such as agriculture, surveillance, and reconnaissance. Anomaly detection...
详细信息
Hyperspectral images include information from a wide range of spectral bands deemed valuable for computer visionapplications in various domains such as agriculture, surveillance, and reconnaissance. Anomaly detection in hyperspectral images has proven to be a crucial component of change and abnormality identification, enabling improved decision-making across various applications. These abnormalities/anomalies can be detected using background estimation techniques that do not require the prior knowledge of outliers. However, each hyperspectral anomaly detection (HS-AD) algorithm models the background differently. These different assumptions may fail to consider all the background constraints in various scenarios. We have developed a new approach called Greedy Ensemble Anomaly Detection (GE-AD) to address this shortcoming. It includes a greedy search algorithm to systematically determine the suitable base models from HS-AD algorithms and hyperspectral unmixing for the first stage of a stacking ensemble and employs a supervised classifier in the second stage of a stacking ensemble. It helps researchers with limited knowledge of the suitability of the HS-AD algorithms for the application scenarios to select the best methods automatically. Our evaluation shows that the proposed method achieves a higher average F1-macro score with statistical significance compared to the other individual methods used in the ensemble. This is validated on multiple datasets, including the Airport-Beach-Urban (ABU) dataset, the San Diego dataset, the Salinas dataset, the Hydice Urban dataset, and the Arizona dataset. The evaluation using the airport scenes from the ABU dataset shows that GE-AD achieves a 14.97% higher average F1-macro score than our previous method (HUE-AD), at least 17.19% higher than the individual methods used in the ensemble, and at least 28.53% higher than the other state-of-the-art ensemble anomaly detection algorithms. As using the combination of greedy algorithm and
This research work suggests developing a diagnostic tool by using the techniques of machine learning and computer vision for the identification of plant diseases based on leaf images. It incorporates various features ...
详细信息
暂无评论