检索结果-内蒙古大学图书馆

Multi-class object detection system using hybrid convolutional neural network architecture

MULTIMEDIA TOOLS AND applications 2022年第22期81卷 31727-31751页

作者： Borade, Jay Laxman Lakshmi, Muddana A. GITAM Deemed Univ Hyderabad India GITAM Deemed Univ CSE Dept Hyderabad India

Object detection in computer vision has been a significant research area for the past decade. Identifying objects with multiple classes from an image has attracted great attention because it can effectively classify and detect the image. A multi-class object detection system from a video or image is quite challenging because of the errors obtained by the location classification process. Our proposed system generalized a hybrid convolutional neural network (H-CNN) model is used to realize the user object from an image. The proposed work integrates pre-processing, object localization, feature extraction and classification. First, the input image is pre-processed with Gaussian filtering to remove noise and improve the image quality. After completing the pre-processing procedure, it is subjected to object localization. Here the object in the image is localized using Grid Guided Localization (GGL). In the feature extraction phase, the model would be pre-trained with AlexNet. Here the AlexNet are generalized as fully connected (FC) layers. Finally, the Softmax layer in the AlexNet architecture is replaced by SvR (Support vector Regression), which acts as a classifier for identifying the object class. The classification loss is minimized using the Improved Grey Wolf (IGW) optimization algorithm. Thus, the H-CNN model can quickly classify and label the objects from images. It also offers improved classification performance in managing effective training time. The proposed work will be implemented in PYTHON. Therefore, the model would be built using various datasets such as MIT-67, PASCAL vOC2010, MS (Microsoft)-COCO, and MSRC to effectively train and classify the object. The proposed H-CNN achieved improved results with MIT-67 (96.02%), PASCAL vOC2010 (95.04%), MSRC (97.37%), and MS COCO (94.53%). The results obtained by H-CNN proved that the excluded result of Mean Average Precision (mAP), Precision, Accuracy, Recall values and F1-Score achieved better results than with re

关键词： image processing Object localization Deep learning Object recognition machine learning

来源：评论

学校读者我要写书评

暂无评论

Rosette Plant Centre Detection and Tracking using YOLO: An Efficient Deep Learning Approach 3

Rosette Plant Centre Detection and Tracking using YOLO: An E...

引用

3rd International Conference on Computing and machine Intelligence (ICMI)

作者： Akagic, Amila Saric, Rijad Buza, Emir Kecman, Stefani Lewsey, Mathew G. Custovic, Edhem Whelan, James Univ Sarajevo UNSA Fac Elect Engn Sarajevo 71000 Bosnia & Herceg La Trobe Inst Sustainable Agr & Food LISAF Dept Anim Plant & Soil Sci Melbourne Vic 3086 Australia La Trobe Univ Australian Res Council Res Hub Med Agr Melbourne Vic 3086 Australia Sci Instruments Australia SIA 2 Res Ave Melbourne Vic 3086 Australia Zhejiang Univ Coll Life Sci State Key Lab Plant Environm Resilience Hangzhou 310058 Peoples R China Zhejiang Univ Prov Int Sci & Technol Cooperat Base Engn Biol Haining 314400 Peoples R China

ISBN: (纸本)9798350372977;9798350372984

The precise detection of plant centres is important for growth monitoring, enabling the continuous tracking of plant development to discern the influence of diverse factors. It holds significance for automated systems like robotic harvesting, facilitating machines in locating and engaging with plants. In this paper, we explore the YOLOv4 (You Only Look Once) real-time neural network detector for plant centre detection. Our dataset, comprising over 12,000 images from 151 Arabidopsis thaliana accessions, is used to fine-tune the model. Evaluation of the dataset reveals the model's proficiency in centre detection across various accessions, boasting an mAP of 99.79% at a 50% IoU threshold. The model demonstrates real-time processing capabilities, achieving a frame rate of approximately 50 FPS. This outcome underscores its rapid and efficient analysis of video or image data, showcasing practical utility in time-sensitive applications.

关键词： Plant Phenotyping Arabidopsis thaliana Computer vision image processing Deep Learning Neural Networks

来源：评论

学校读者我要写书评

暂无评论

Automated Detection of Diabetic Retinopathy Segmented images using ResNet50 and vGG16 Deep Learning Algorithms 2

Automated Detection of Diabetic Retinopathy Segmented Images...

引用

2nd International Conference on Inventive Computing and Informatics (ICICI)

作者： Betha, Sashi Kanth Seventline, J. B. GITAM Deemed Be Univ Visakhapatnam Andhra Pradesh India Vignans Inst Engn Women Dept ECE Visakhapatnam Andhra Pradesh India GITAM Deemed Be Univ Dept EECE Visakhapatnam Andhra Pradesh India

ISBN: (纸本)9798350373301;9798350373295

Diabetic retinopathy (DR), a severe complication arising from diabetes, make a significant threat to vision due to the deterioration of retinal blood vessels. This research work proposes a comprehensive methodology for the automated detection, grading, and segmentation of DR, leveraging advanced image processing, deep learning techniques and machine learning. The study utilizes the Indian Diabetic Retinopathy image dataset (IDRID), comprising 81 fundus images and labels, to rigorously evaluates the proposed methodology. Key steps include detailed image preprocessing, vGG16-based feature extraction, Random Forest classifier-based grading, and innovative segmentation techniques for lesion localization. The evaluation demonstrates exceptional performance, with both vGG16 and ResNet50 architectures achieving over 99% accuracy. The process of semantic segmentation enhances interpretability, supporting clinical decision-making in retinopathy diagnosis. While the results are promising, future validation on diverse datasets and careful consideration of ethical implications are essential for responsible deployment in clinical settings. The proposed methodology signifies a significant step toward precise diagnostics and improved patient outcomes in diabetic retinopathy and holds potential for broader applications in retinal disease diagnosis.

关键词： Diabetic retinopathy vGG16 Feature extraction image Preprocessing Segmentation

来源：评论

学校读者我要写书评

暂无评论

Research on verification framework of image processing IP core based on real-time reconfiguration 18

Research on verification framework of image processing IP co...

引用

Colloidal Nanoparticles for Biomedical applications XvIII 2023

作者： Mo, Wei Zhao, Lu Wen, Jianping Xi’an Xwzn Technology Co. Ltd. Shaanxi Science and Technology Holding Group Co. Ltd. Xi’an China Xi’an University of Science and Technology Xi’an China

ISBN: (纸本)9781510658950

The verification of IP core with image processing algorithm is important for SoC and FPGA application in the field of machine vision. This paper proposes a verification framework with general purpose, real-time performance and agility for IP core with image processing algorithm by using heterogeneous platform composed of ARM and FPGA. In the verification framework, the Gigabit Ethernet communication between PC and ARM is established. The FPGA is used to build the data bus to be compatible with multiple types of images, and combine with a partial reconfiguration to achieve fast iteration of IP cores of the algorithm to be verified. The validation framework is reusable for the algorithm IP core, and the deployment speed of the IP cores to be verified is 25 times faster than global reconfiguration. Compared with the existing FPGA verification technology, it has better reusability, shorter verification cycle, more targeted test stimulus, and faster deployment of IP cores to be verified. © 2023 SPIE.

关键词： image processing

来源：评论

学校读者我要写书评

暂无评论

image processing using Cloud for Surveillance 5

Image Processing using Cloud for Surveillance

引用

5th International Conference on Information Management and machine Intelligence, ICIMMI 2023

作者： Tiwari, Shivam Phukan, Aarhee vedhavathy, T.R. Department of Networking and Communications School of Computing SRM Institute of Science and Technology Chengalpattu Tamil Nadu Kattankulathur603203 India

ISBN: (纸本)9798400709418

In the era of digitization and big data, the world is inundated with an ever-growing volume of visual content, be it images or videos. As organizations strive to harness the potential of these multimedia data sources, there is an increasing need for advanced image processing techniques that can automate the analysis and extraction of valuable information. Amazon Web Services (AWS) Rekognition emerges as a powerful solution in this landscape, offering a comprehensive system for image and video analysis through the lens of machine learning and computer vision. This paper delves into the realm of image processing using AWS Rekognition, unveiling the transformative capabilities of this cloud-based service and its applications in various domains. As we embark on this journey, we will explore the principles, methodologies, and real-world implications of leveraging AWS Rekognition for image analysis. © 2023 ACM.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

Optical Preprocessing for Low-Latency machine vision

Optical Preprocessing for Low-Latency Machine Vision

引用

作者： Muminov, Baurzhan University of California Riverside

学位级别：Ph.D., Doctor of Philosophy

In recent years there has been an increased interest towards edge computing, i.e., computing performed on distributed devices as opposed to centralized high-power hubs. Examples of edge computing would be the local image processing performed on Unmanned Autonomous vehicles (UAv's) or the specialized machine vision systems on drones. These edge computing applications require schemes that are efficient with power and memory and typically must operate real-time. Many state-of-the-art image processing solutions that employ advanced optimization and deep neural networks (NNs) achieve impressive benchmark results, but are computationally demanding and thus on many occasions, impractical. The additional requirement for a range of applications is noise robustness or the ability to work in (extreme) low-light conditions; reasonable quality image or accurate object classification may be critical when there is low light flux or when the environment is over-saturated with other signals. Here, we approach edge computing with a combination of optical preprocessing and shallow NN and we show that this hybrid approach greatly reduces the computational requirements. For low-SNR imaging, we develop a technique that reconstructs objects and scenes from their Fourier-plane images. The optical preprocessing is performed via encoded diffraction with optical vortex singularities. The optical vortex encoder achieves differentiation of the already-compressed Fourier-plane patterns and enables facile inverse inference of the original object scene. We demonstrate that our method is robust to noise. And for a simple NN architecture (one or two layers), leads to generalization, i.e., reconstruction of objects from classes that are greatly different from the ones the NN was trained on. Our research identifies strong potential for swift hybrid imaging systems with edge computing applications and highlights the valuable function of the vortex encoder for spectral differentiation.

关键词： Low-latency machine learning machine vision Noise robustness Topological optics vortices

来源：评论

学校读者我要写书评

暂无评论

An AI pipeline for garment price projection using computer vision

引用

Neural Computing and applications 2024年第25期36卷 15631-15651页

作者： Rico Gómez, Rodrigo Lorentz, Joe Hartmann, Thomas Goknil, Arda Pal Singh, Inder Halaç, Tayfun Gökmen Boruzanlı Ekinci, Gülnaz DataThings 5 rue de l’industrie Luxembourg1811 Luxembourg SnT University of Luxembourg Campus Kirchberg Luxembourg1359 Luxembourg SINTEF Digital Oslo Norway Galaksiya Information Technologies Izmir Turkey Department of Mathematics Ege University Izmir Turkey

The fashion industry’s traditional price-setting methods, based on historical sales and Fashion Week trends, are inadequate in the digital era. Rapid changes in collections and consumer preferences necessitate advanced Artificial Intelligence (AI) techniques. These AI methods should analyze data from various sources, including social media and e-commerce, to predict future fashion trends and prices. In this paper, we propose, apply, and assess a data analytics approach, i.e., FashionXpert, employing several image processing and machine learning techniques in an AI pipeline for garment price prediction. It integrates various heterogeneous data sources (e.g., textual and image data from e-stores, brand websites, and social media) to obtain more consistent, accurate, and beneficial information. We evaluated its effectiveness with an industrial data set obtained by a fashion search tool from the electronic commerce sites of clothing brands. FashionXpert predicted garment prices with an average Mean Absolute Error (MAE) of 15.31 EUR on a data set that has a standard deviation of 72.99 EUR. © The Author(s) 2024.

关键词： image segmentation

来源：评论

学校读者我要写书评

暂无评论

machine Learning Based Leaf Disease Diagnosis System

Machine Learning Based Leaf Disease Diagnosis System

引用

2025 International Conference on Multi-Agent Systems for Collaborative Intelligence, ICMSCI 2025

作者： Shalini, v. Baby Kumar, Bittu varma, P. varun Reddy, P. vinay Kumar Kumar, T. Tarun Kalasalingam Academy of Research and Education Department of Information Technology Tamil Nadu Krishnankoil India

ISBN: (纸本)9798331509828

This research work suggests developing a diagnostic tool by using the techniques of machine learning and computer vision for the identification of plant diseases based on leaf images. It incorporates various features such as spots, lesions, abnormal shapes, and signs of insect damage to identify potential health problems in the plants. The tool can use image-processing methods like contour analysis, colour space transformation, and morphological operations to detect and classify the diseases based on risk factors like shape and area of spots, along with the extent of damage on the leaf. The tool is designed to work with uploaded leaf images from Google Drive and then provide an extensive report on plant health indicators such as spots, shape, and insect damage. It's early enough to detect and consequently reduce crop losses. This detection system equally ensures more targeted application of pesticides. © 2025 IEEE.

关键词： Diagnosis

来源：评论

学校读者我要写书评

暂无评论

Padding-Free Convolution Based on Preservation of Differential Characteristics of Kernels 22

Padding-Free Convolution Based on Preservation of Differenti...

引用

22nd IEEE International Conference on machine Learning and applications, ICMLA 2023

作者： Leng, Kuangdai Thiyagalingam, Jeyan Science and Technology Facilities Council Scientific Computing Department Didcot United Kingdom

ISBN: (纸本)9798350345346

Convolution is a fundamental operation in image processing and machine learning. Aimed primarily at maintaining image size, padding is a key ingredient of convolution, which, however, can introduce undesirable boundary effects. We present a non-padding-based method for size-keeping convolution based on the preservation of differential characteristics of kernels. The main idea is to make convolution over an incomplete sliding window 'collapse' to a linear differential operator evaluated locally at its central pixel, which no longer requires information from the neighbouring missing pixels. While the underlying theory is rigorous, our final formula turns out to be simple: the convolution over an incomplete window is achieved by convolving its nearest complete window with a transformed kernel. This formula is computationally lightweight, involving neither interpolation or extrapolation nor restrictions on image and kernel sizes. Our method favours data with smooth boundaries, such as high-resolution images and fields from physics. Our experiments include: i) filtering analytical and non-analytical fields from computational physics and, ii) training convolutional neural networks (CNNs) for the tasks of image classification, semantic segmentation and super-resolution reconstruction. In all these experiments, our method has exhibited visible superiority over the compared ones. © 2023 IEEE.

关键词： computer vision convolutional neural network differential operator machine learning padding

来源：评论

学校读者我要写书评

暂无评论

Efficient Algorithm Analysis Of Kidney images On FPGA

Efficient Algorithm Analysis Of Kidney Images On FPGA

引用

2025 IEEE International Students' Conference on Electrical, Electronics and Computer Science, SCEECS 2025

作者： Thanuja, G.v. Ramya, S. Manipal Institute of Technology Manipal Academy of Higher Education Dept. Electronics and Communication Engineering Karnataka Manipal576104 India

ISBN: (纸本)9798331529833

The field-programmable gate array (FPGA) offers an effective solution to meet the high-performance requirements of real-time digital signal processors. IP cores developed on FPGAs benefit from the programmable logic's flexibility, efficient timing, and adaptability in algorithm modification, coupled with the processing power provided by the embedded processor. Integrating image processing into this sector is an ideal addition, especially in the growing field of edge detection, which is crucial in areas like image pattern recognition, machine learning, and data processing. This paper presents a case study focused on kidney CT scan images for edge detection, utilizing the Sobel and Canny edge detection techniques on the target FPGA device, xc7z020clg484-1. An IP core was designed and generated using vIvADO software, converting image pixels into binary form for pre-processing through these edge detection algorithms. The algorithms were simulated and synthesized on the xc7z020clg484-1 FPGA, with the entire design implemented in verilog code to create the IP, which connects to the DMA controller throughout the system. This IP core can process multiple CT scan images simultaneously, making it valuable for various biomedical applications. The primary aim of this research is to compare the Sobel and Canny edge detection algorithms to identify the best approach for developing an IP core, evaluated by metrics such as total power consumption, CPU time, execution time, LUT count, and FF usage. The resulting IP core is efficient and conserves resources, making it suitable for other embedded applications. © 2025 IEEE.

关键词： image analysis

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：