检索结果-内蒙古大学图书馆

3rd International Conference on Computing and machine Intelligence (ICMI)

作者： Akagic, Amila Saric, Rijad Buza, Emir Kecman, Stefani Lewsey, Mathew G. Custovic, Edhem Whelan, James Univ Sarajevo UNSA Fac Elect Engn Sarajevo 71000 Bosnia & Herceg La Trobe Inst Sustainable Agr & Food LISAF Dept Anim Plant & Soil Sci Melbourne Vic 3086 Australia La Trobe Univ Australian Res Council Res Hub Med Agr Melbourne Vic 3086 Australia Sci Instruments Australia SIA 2 Res Ave Melbourne Vic 3086 Australia Zhejiang Univ Coll Life Sci State Key Lab Plant Environm Resilience Hangzhou 310058 Peoples R China Zhejiang Univ Prov Int Sci & Technol Cooperat Base Engn Biol Haining 314400 Peoples R China

ISBN: (纸本)9798350372977;9798350372984

The precise detection of plant centres is important for growth monitoring, enabling the continuous tracking of plant development to discern the influence of diverse factors. It holds significance for automated systems like robotic harvesting, facilitating machines in locating and engaging with plants. In this paper, we explore the YOLOv4 (You Only Look Once) real-time neural network detector for plant centre detection. Our dataset, comprising over 12,000 images from 151 Arabidopsis thaliana accessions, is used to fine-tune the model. Evaluation of the dataset reveals the model's proficiency in centre detection across various accessions, boasting an mAP of 99.79% at a 50% IoU threshold. The model demonstrates real-time processing capabilities, achieving a frame rate of approximately 50 FPS. This outcome underscores its rapid and efficient analysis of video or image data, showcasing practical utility in time-sensitive applications.

关键词： Plant Phenotyping Arabidopsis thaliana Computer vision image processing Deep Learning Neural Networks

来源：评论

学校读者我要写书评

暂无评论

image Enhancement Via Multi-Scale-Exposure image Fusion 11

Image Enhancement Via Multi-Scale-Exposure Image Fusion

引用

Optoelectronic Imaging and Multimedia Technology XI 2024

作者： Zelensky, A. Gapon, N. Zhdanova, M. Voronin, V. Ilukhin, Y. Gribkov, A. Scientific-Manufacturing Complex «Technological Centre» Zelenograd Russia Don State Technical University Rostov-on-Don Russia Center for Cognitive Technology and Machine Vision Moscow State University of Technology «STANKIN» Moscow Russia

ISBN: (纸本)9781510682061

The goal of image enhancement is to improve specific features or details of an image and enhance its overall visual quality. We introduce a novel image enhancement algorithm based on block-rooting processing combined with multi-scale exposure image fusion. The proposed method integrates both local and global transform domain-based feedback mechanisms for imaging applications. The core concept of the local alpha-rooting method involves applying it to disjoint blocks of varying sizes, followed by the decomposition of the weight map and multi-scale enhanced images into Gaussian and Laplacian pyramids. Fusion is achieved by multiplying the multi-scale images and their corresponding weights. A new stage is introduced to obtain a local-global estimate of high-contrast images, which is also employed in the general artificial fusion model. Computer simulations conducted on image datasets demonstrate that the new enhancement algorithm outperforms state-of-the-art techniques. © 2024 SPIE.

关键词： image fusion

来源：评论

学校读者我要写书评

暂无评论

Unsupervised Pose Estimation by Means of an Innovative vision Transformer 21st

Unsupervised Pose Estimation by Means of an Innovative Visio...

引用

21st International Conference on Artificial Intelligence and Soft Computing (ICAISC)

作者： Brandizzi, Nicolo' Fanti, Andrea Gallotta, Roberto Russo, Samuele Iocchi, Luca Nardi, Daniele Napoli, Christian Sapienza Univ Rome Dept Comp Automat & Management Engn Via Ariosto 25 I-00185 Rome Italy Sapienza Univ Rome Dept Psychol Via Marsi 78 I-00185 Rome Italy

ISBN: (纸本)9783031234798;9783031234804

Attention-only Transformers [34] have been applied to solve Natural Language processing (NLP) tasks and Computer vision (CV) tasks. One particular Transformer architecture developed for CV is the vision Transformer (ViT) [15]. ViT models have been used to solve numerous tasks in the CV area. One interesting task is the pose estimation of a human subject. We present our modified ViT model, Un-TraPEs (UNsupervised TRAnsformer for Pose Estimation), that can reconstruct a subject's pose from its monocular image and estimated depth. We compare the results obtained with such a model against a ResNet [17] trained from scratch and a ViT finetuned to the task and show promising results.

关键词： Computer vision image understanding Pose estimation Visual transformers Artificial intelligence and applications

来源：评论

学校读者我要写书评

暂无评论

Automated Detection of Diabetic Retinopathy Segmented images using ResNet50 and VGG16 Deep Learning Algorithms 2

Automated Detection of Diabetic Retinopathy Segmented Images...

引用

2nd International Conference on Inventive Computing and Informatics (ICICI)

作者： Betha, Sashi Kanth Seventline, J. B. GITAM Deemed Be Univ Visakhapatnam Andhra Pradesh India Vignans Inst Engn Women Dept ECE Visakhapatnam Andhra Pradesh India GITAM Deemed Be Univ Dept EECE Visakhapatnam Andhra Pradesh India

ISBN: (纸本)9798350373301;9798350373295

Diabetic retinopathy (DR), a severe complication arising from diabetes, make a significant threat to vision due to the deterioration of retinal blood vessels. This research work proposes a comprehensive methodology for the automated detection, grading, and segmentation of DR, leveraging advanced image processing, deep learning techniques and machine learning. The study utilizes the Indian Diabetic Retinopathy image dataset (IDRID), comprising 81 fundus images and labels, to rigorously evaluates the proposed methodology. Key steps include detailed image preprocessing, VGG16-based feature extraction, Random Forest classifier-based grading, and innovative segmentation techniques for lesion localization. The evaluation demonstrates exceptional performance, with both VGG16 and ResNet50 architectures achieving over 99% accuracy. The process of semantic segmentation enhances interpretability, supporting clinical decision-making in retinopathy diagnosis. While the results are promising, future validation on diverse datasets and careful consideration of ethical implications are essential for responsible deployment in clinical settings. The proposed methodology signifies a significant step toward precise diagnostics and improved patient outcomes in diabetic retinopathy and holds potential for broader applications in retinal disease diagnosis.

关键词： Diabetic retinopathy VGG16 Feature extraction image Preprocessing Segmentation

来源：评论

学校读者我要写书评

暂无评论

Research on verification framework of image processing IP core based on real-time reconfiguration 18

Research on verification framework of image processing IP co...

引用

Colloidal Nanoparticles for Biomedical applications XViiI 2023

作者： Mo, Wei Zhao, Lu Wen, Jianping Xi’an Xwzn Technology Co. Ltd. Shaanxi Science and Technology Holding Group Co. Ltd. Xi’an China Xi’an University of Science and Technology Xi’an China

ISBN: (纸本)9781510658950

The verification of IP core with image processing algorithm is important for SoC and FPGA application in the field of machine vision. This paper proposes a verification framework with general purpose, real-time performance and agility for IP core with image processing algorithm by using heterogeneous platform composed of ARM and FPGA. In the verification framework, the Gigabit Ethernet communication between PC and ARM is established. The FPGA is used to build the data bus to be compatible with multiple types of images, and combine with a partial reconfiguration to achieve fast iteration of IP cores of the algorithm to be verified. The validation framework is reusable for the algorithm IP core, and the deployment speed of the IP cores to be verified is 25 times faster than global reconfiguration. Compared with the existing FPGA verification technology, it has better reusability, shorter verification cycle, more targeted test stimulus, and faster deployment of IP cores to be verified. © 2023 SPIE.

关键词： image processing

来源：评论

学校读者我要写书评

暂无评论

GAN-Based Facial Attribute Manipulation

引用

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND machine INTELLIGENCE 2023年第12期45卷 14590-14610页

作者： Liu, Yunfan Li, Qi Deng, Qiyao Sun, Zhenan Yang, Ming-Hsuan Univ Chinese Acad Sci Sch Elect Elect & Commun Engn Beijing 101408 Peoples R China Chinese Acad Sci Inst Automat Ctr Res Intelligent Percept & Comp State Key Lab Multimodal Artificial Intelligence S Beijing 100190 Peoples R China Univ Chinese Acad Sci Sch Artificial Intelligence Beijing 100049 Peoples R China Peoples Publ Secur Univ China Beijing 100038 Peoples R China Univ Calif Merced Merced CA 95343 USA Yonsei Univ Seoul 03722 South Korea

Facial Attribute Manipulation (FAM) aims to aesthetically modify a given face image to render desired attributes, which has received significant attention due to its broad practical applications ranging from digital entertainment to biometric forensics. In the last decade, with the remarkable success of Generative Adversarial Networks (GANs) in synthesizing realistic images, numerous GAN-based models have been proposed to solve FAM with various problem formulation approaches and guiding information representations. This paper presents a comprehensive survey of GAN-based FAM methods with a focus on summarizing their principal motivations and technical details. The main contents of this survey include: (i) an introduction to the research background and basic concepts related to FAM, (ii) a systematic review of GAN-based FAM methods in three main categories, and (iii) an in-depth discussion of important properties of FAM methods, open issues, and future research directions. This survey not only builds a good starting point for researchers new to this field but also serves as a reference for the vision community.

关键词： Generative adversarial networks image translation facial attribute manipulation

来源：评论

学校读者我要写书评

暂无评论

Recognition and evaluation of cutaneous condition through assorted artificial intelligence reliant algorithms

引用

International Journal of Information Technology (Singapore) 2025年 1-13页

作者： Mishra, Manmohan Yadav, Ajay Kumar Mazumdar, Bireshwar Dass Gupta, Prashant K. Panwar, Arvind Bharadwaj, Shivam Department of Computer Application United Institute of Management Prayagraj India School of Computer Science & Engineering Technology Bennett University Plot Nos 8-11 TechZone II Uttar Pradesh Greater Noida 201310 India Galgotias University Plot No. 2 Yamuna Expy opposite Buddha International Circuit Sector 17A Uttar Pradesh Prayagraj 203201 India

Our skin is the hefty organ that envelops and shields body. It prevents us from numerous fatal and non fatal diseases. It is observed that due to bacteria or other causes of infection, skin faces certain minor or life threatening diseases. The most prioritized step toward restoring health is early illness signs identification. Identifying Cutaneous Condition from clinical images is one of the foremost challenges in medical image investigation. In the presented study we will enlighten the various Artificial Intelligence techniques falling under the categories of supervised machine learning including Probabilistic classifier (Naïve Bayes), Statistical algorithm (Logistic Regression), Ensemble learning (Random Decision Trees), Data analysis technique (Convolutional Neural Network) and Kernel approach (Support Vector machine) to identify and classify the cutaneous condition appropriately so that corrective measures of skin treatment can be endow with. The proposed approach entails collecting images as input, preprocessing, segmenting, feature extraction and lastly applying the classification algorithms to derive the Cutaneous Condition categories. Additional trials are conducted using the different approaches as indicated and it was discovered from the suggested tests that the Convolutional Neural Network strategy yields the best results overall. The proposed model is trained, tested, and evaluated using the International Skin Imaging Collaboration (ISIC) 2019 challenge dataset and Human Against machine with 10,000 training images (HAM10000) for the detection of manifold Cutaneous Condition. © Bharati Vidyapeeth's Institute of Computer applications and Management 2025.

关键词： Artificial intelligence Confusion matrix Convolutional neural networks Cutaneous condition image processing machine learning

来源：评论

学校读者我要写书评

暂无评论

image processing using Cloud for Surveillance 5

Image Processing using Cloud for Surveillance

引用

5th International Conference on Information Management and machine Intelligence, ICIMMI 2023

作者： Tiwari, Shivam Phukan, Aarhee Vedhavathy, T.R. Department of Networking and Communications School of Computing SRM Institute of Science and Technology Chengalpattu Tamil Nadu Kattankulathur603203 India

ISBN: (纸本)9798400709418

In the era of digitization and big data, the world is inundated with an ever-growing volume of visual content, be it images or videos. As organizations strive to harness the potential of these multimedia data sources, there is an increasing need for advanced image processing techniques that can automate the analysis and extraction of valuable information. Amazon Web Services (AWS) Rekognition emerges as a powerful solution in this landscape, offering a comprehensive system for image and video analysis through the lens of machine learning and computer vision. This paper delves into the realm of image processing using AWS Rekognition, unveiling the transformative capabilities of this cloud-based service and its applications in various domains. As we embark on this journey, we will explore the principles, methodologies, and real-world implications of leveraging AWS Rekognition for image analysis. © 2023 ACM.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

Optical Preprocessing for Low-Latency machine vision

Optical Preprocessing for Low-Latency Machine Vision

引用

作者： Muminov, Baurzhan University of California Riverside

学位级别：Ph.D., Doctor of Philosophy

In recent years there has been an increased interest towards edge computing, i.e., computing performed on distributed devices as opposed to centralized high-power hubs. Examples of edge computing would be the local image processing performed on Unmanned Autonomous Vehicles (UAV's) or the specialized machine vision systems on drones. These edge computing applications require schemes that are efficient with power and memory and typically must operate real-time. Many state-of-the-art image processing solutions that employ advanced optimization and deep neural networks (NNs) achieve impressive benchmark results, but are computationally demanding and thus on many occasions, impractical. The additional requirement for a range of applications is noise robustness or the ability to work in (extreme) low-light conditions; reasonable quality image or accurate object classification may be critical when there is low light flux or when the environment is over-saturated with other signals. Here, we approach edge computing with a combination of optical preprocessing and shallow NN and we show that this hybrid approach greatly reduces the computational requirements. For low-SNR imaging, we develop a technique that reconstructs objects and scenes from their Fourier-plane images. The optical preprocessing is performed via encoded diffraction with optical vortex singularities. The optical vortex encoder achieves differentiation of the already-compressed Fourier-plane patterns and enables facile inverse inference of the original object scene. We demonstrate that our method is robust to noise. And for a simple NN architecture (one or two layers), leads to generalization, i.e., reconstruction of objects from classes that are greatly different from the ones the NN was trained on. Our research identifies strong potential for swift hybrid imaging systems with edge computing applications and highlights the valuable function of the vortex encoder for spectral differentiation.

关键词： Low-latency machine learning machine vision Noise robustness Topological optics Vortices

来源：评论

学校读者我要写书评

暂无评论

An AI pipeline for garment price projection using computer vision

引用

Neural Computing and applications 2024年第25期36卷 15631-15651页

作者： Rico Gómez, Rodrigo Lorentz, Joe Hartmann, Thomas Goknil, Arda Pal Singh, Inder Halaç, Tayfun Gökmen Boruzanlı Ekinci, Gülnaz DataThings 5 rue de l’industrie Luxembourg1811 Luxembourg SnT University of Luxembourg Campus Kirchberg Luxembourg1359 Luxembourg SINTEF Digital Oslo Norway Galaksiya Information Technologies Izmir Turkey Department of Mathematics Ege University Izmir Turkey

The fashion industry’s traditional price-setting methods, based on historical sales and Fashion Week trends, are inadequate in the digital era. Rapid changes in collections and consumer preferences necessitate advanced Artificial Intelligence (AI) techniques. These AI methods should analyze data from various sources, including social media and e-commerce, to predict future fashion trends and prices. In this paper, we propose, apply, and assess a data analytics approach, i.e., FashionXpert, employing several image processing and machine learning techniques in an AI pipeline for garment price prediction. It integrates various heterogeneous data sources (e.g., textual and image data from e-stores, brand websites, and social media) to obtain more consistent, accurate, and beneficial information. We evaluated its effectiveness with an industrial data set obtained by a fashion search tool from the electronic commerce sites of clothing brands. FashionXpert predicted garment prices with an average Mean Absolute Error (MAE) of 15.31 EUR on a data set that has a standard deviation of 72.99 EUR. © The Author(s) 2024.

关键词： image segmentation

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：