检索结果-内蒙古大学图书馆

Special Section Guest Editorial: machine vision-Systems, Methods, and applications

JOURNAL OF ELECTRONIC IMAGING 2022年第5期31卷 051601-051601页

作者： Debayle, Johan Osten, Wolfgang Nikolaev, Dmitry MINES St Etienne St Etienne France Univ Stuttgart Stuttgart Germany Inst Informat Transmiss Problems Moscow Russia

Special section editors Johan Debayle, Wolfgang Osten, and Dmitry Nikolaev introduce the Special Section on machine vision: Systems, Methods, and applications.

关键词： machine vision image processing RGB color model image analysis image segmentation Digital image processing Inspection Tumors Optical metrology Physics

来源：评论

学校读者我要写书评

暂无评论

An automated retinal hemorrhages and exudates detection in a Mexican image set

An automated retinal hemorrhages and exudates detection in a...

引用

Conference on applications of machine Learning

作者： Echeveste-Vazquez, J. A. Armas-Perez, J. C. Gonzalez-Vega, A. Soto-Cruz, G. Villasenor-Mora, C. Univ Guanajuato Div Ciencias Ingn Guanajuato Mexico Univ Nacl Autonoma Mexico ENES Leon Mexico City DF Mexico

ISBN: (纸本)9781510665651;9781510665644

Retinopathy is a common complication of diabetes that can cause severe vision loss if not detected and managed promptly. In this study, we propose a comprehensive approach that leverages image processing techniques to analyze fundus images of patients with diabetic retinopathy. Our primary focus is on vein extraction and hemorrhage detection, with exudate detection being performed only on specific images to showcase advancements in the current prototype algorithm. The dataset used in this project consists of images obtained from Mexican ophthalmology institutes, ensuring its relevance and applicability to the local population. By extracting veins and hemorrhages, we aim to capture crucial features indicative of the severity of retinopathy. These generated images, along with the original dataset, are utilized to train convolutional neural network (CNN) models, enabling accurate classification of the disease's degree into three categories. The significance of this project lies in its potential to serve as an auxiliary tool in diagnosing diabetic retinopathy. By automating the analysis of fundus images and providing objective classification results, our algorithm aims to assist healthcare professionals in making informed decisions regarding treatment and management options. The proposed method can potentially enhance the efficiency and precision of diabetic retinopathy (DR) diagnosis, improving Mexican health outcomes.

关键词： Diabetic retinopathy fundus exudates eye veins eye hemorrhages CNN automatic classification.

来源：评论

学校读者我要写书评

暂无评论

DFTNet: Dual Flow Transformer Network for Conveyor Belt Edge Detection

引用

UNMANNED SYSTEMS 2024年第5期12卷 877-885页

作者： Yang, Zhifang Zhang, Liya Hao, Bonan Li, Biao Zhang, Tianxiang China Coal Res Inst China Coal Technol Engn Grp Beijing 100013 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China

In traditional conveyor belt edge detection methods, contact detection methods have a high cost. At the same time noncontact detection methods have low precision, and the methods based on the convolutional neural network are limited by the local operation features of the convolution operation itself, causing problems such as insufficient perception of long-distance and global information. In order to solve the above problems, a dual flow transformer network (DFTNet) integrating global and local information is proposed for belt edge detection. DFTNet could improve belt edge detection accuracy and suppress the interference of belt image noise. In this paper, the authors have merged the advantages of the traditional convolutional neural network's ability to extract local features and the transformer structure's ability to perceive global and long-distance information. Here, the fusion block is designed as a dual flow encoder-decoder structure, which could better integrate global context information and avoid the disadvantages of a transformer structure pretrained on large datasets. Besides, the structure of the fusion block is designed to be flexible and adjustable. After sufficient experiments on the conveyor belt dataset, the comparative results show that DFTNet can effectively balance accuracy and efficiency and has the best overall performance on belt edge detection tasks, outperforming full convolution methods. The processing image frame rate reaches 53.07 fps, which can meet the real-time requirements of the industry. At the same time, DFTNet can deal with belt edge detection problems in various scenarios, which gives it great practical value.

关键词： Edge detection belt deviation machine vision deep learning encoder-decoder

来源：评论

学校读者我要写书评

暂无评论

A Review on Resource-Constrained Embedded vision Systems-Based Tiny machine Learning for Robotic applications

引用

ALGORITHMS 2024年第11期17卷 476-476页

作者： Beltran-Escobar, Miguel Alarcon, Teresa E. Rumbo-Morales, Jesse Y. Lopez, Sonia Ortiz-Torres, Gerardo Sorcia-Vazquez, Felipe D. J. Emiliano Zapata Technol Univ State Morelos Acad Div Ind Mech Emiliano Zapata 62760 Mexico Univ Guadalajara Comp Sci & Engn Dept Ameca 46600 Mexico

The evolution of low-cost embedded systems is growing exponentially;likewise, their use in robotics applications aims to achieve critical task execution by implementing sophisticated control and computer vision algorithms. We review the state-of-the-art strategies available for Tiny machine Learning (TinyML) implementation to provide a complete overview using various existing embedded vision and control systems. Our discussion divides the article into four critical aspects that high-cost and low-cost embedded systems must include to execute real-time control and image processing tasks, applying TinyML techniques: Hardware Architecture, vision System, Power Consumption, and Embedded Software Platform development environment. The advantages and disadvantages of the reviewed systems are presented. Subsequently, the perspectives of them for the next ten years are present. A basic TinyML implementation for embedded vision application using three low-cost embedded systems, Raspberry Pi Pico, ESP32, and Arduino Nano 33 BLE Sense, is presented for performance analysis.

关键词： embedded system image processing mobile robotic TinyML

来源：评论

学校读者我要写书评

暂无评论

Measuring System for Elongation at Break of Cable Insulation Sheath Based on machine vision 14

Measuring System for Elongation at Break of Cable Insulation...

引用

14th International Conference on Digital image processing, ICDIP 2022

作者： Su, Xu Wang, Gangwei Zhang, Zhiqiang Yang, Jiale Zhang, Zhijia School of Artificial Intelligence Shenyang University of Technology China Land and Resources Exploration Center of Hebei Geological and Mineral Resources Exploration and Development Bureau China

ISBN: (纸本)9781510657564

In the production of power cables, the performance test of the cable insulation sheath is an important part. Compared with traditional testing methods, machine vision has the advantages of stable operation, high precision, and high efficiency. Because of this situation, firstly, based on machine vision theory, the structure of the old-fashioned tensile machine was reconstructed, and the whole tensile test process of the cable insulation sheath test was imaged by a CMOS camera, and the color recognition algorithm, effective area segmentation algorithm, and workpiece were proposed. The fracture judgment detection algorithm and the corrosion difference algorithm are used to calculate the distance between the marked lines and then calculate the elongation at the break of the cable material. Through systematic experiments on the same batch of cable jackets, the deviation of the elongation at break measured by visual inspection is the largest, no more than 1%. The experimental results and practical applications show that the machine vision-based visual inspection system has higher accuracy, faster efficiency, and more stable and reliable operation than the traditional inspection system. © 2022 SPIE.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

Research on the Application of machine Learning in Visual image Design

Research on the Application of Machine Learning in Visual Im...

引用

image processing, Computer vision and machine Learning (ICICML), International Conference on

作者： Minnan Cang Meng Zhang Wei Zhou Xi’an University of Technology Xi’an China

ISBN: (数字)9798350355413

ISBN: (纸本)9798350355420

With the rapid development of artificial intelligence, machine learning applications in visual design have become increasingly widespread, particularly in the field of image processing. This study presents an intelligent image generation platform based on Generative Adversarial Networks (GANs) aimed at improving design efficiency and creative expression. The platform integrates Deep Convolutional GANs (DCGANs) with style transfer techniques to efficiently generate high-quality images with artistic and visual appeal. Experimental results demonstrate that the model outperforms traditional methods in terms of clarity (PSNR of 28.4 dB), realism (Inception Score of 7.5), and style consistency (Style Loss of 0.0025). User experience evaluations indicate that designers rate the platform highly for ease of use and image generation quality. This model not only enhances design efficiency but also serves as a powerful tool for creative visual tasks, advancing the automation and intelligence of the design process.

关键词： Visualization Computer vision Automation image synthesis Computational modeling machine learning Generative adversarial networks User experience

来源：评论

学校读者我要写书评

暂无评论

Picture processing Optimization Technology Based on Mask R-CNN Algorithm 3

Picture Processing Optimization Technology Based on Mask R-C...

引用

3rd International Conference on machine Learning and Big Data Analytics for IoT Security and Privacy, SPIoT2023

作者： Tan, Guihua Liu, Yiran College of Information Engineering Loudi Xiaoxiang Vocational College Hunan Loudi417000 China

Picture processing is applied in all kind of fields, such as space science research, medical imaging, photography art. Because the human vision system is a complex nonlinear dynamic system, the traditional image enhancement methods often can not meet the requirements of real-time in practical applications. In the face of a large number of images, it is of great practical significance to obtain practical information from these data. With the rapid development of computer CnTech, image compression coding CnTech has also made great progress. And through the Mask R-CNN algorithm for effective segmentation and image recognition, can help people face massive image information, to their own needs as the goal, to achieve efficient retrieval, analysis, induction. The traditional methods of image classification by manually selecting features or manually extracting template matching have some defects. In this paper, a Mask R-CNN image optimization processing CnTech is proposed, which can accurately identify images and has higher accuracy for image feature extraction. It is meaningful to achieve automatic and accurate segmentation of microscopic images. Through training on COCO data set, it is verified through experiments that there are problems in image segmentation and recognition. In Mask R-CNN model, resource consumption is reduced and the efficiency of image segmentation and recognition is improved. Compared with the current cutting-edge algorithms, the image object detection on the strength of Mask R-CNN model has significant advantages. © 2023 The Authors. Published by Elsevier B.V.

关键词： image retrieval

来源：评论

学校读者我要写书评

暂无评论

Violet: A vision-Language Model for Arabic image Captioning with Gemini Decoder 1

Violet: A Vision-Language Model for Arabic Image Captioning ...

引用

1st Arabic Natural Language processing Conference, ArabicNLP 2023

作者： Mohamed, Abdelrahman Alwajih, Fakhraddin Nagoudi, El Moatez Billah Inciarte, Alcides Alcoba Abdul-Mageed, Muhammad Deep Learning & Natural Language Processing Group The University of British Columbia Canada Department of Natural Language Processing Department of Machine Learning MBZUAI United States

ISBN: (纸本)9781959429272

Although image captioning has a vast array of applications, it has not reached its full potential in languages other than English. Arabic, for instance, although the native language of more than 400 million people, remains largely underrepresented in this area. This is due to the lack of labeled data and powerful Arabic generative models. We alleviate this issue by presenting a novel vision-language model dedicated to Arabic, dubbed Violet. Our model is based on a vision encoder and a Gemini text decoder that maintains generation fluency while allowing fusion between the vision and language components. To train our model, we introduce a new method for automatically acquiring data from available English datasets. We also manually prepare a new dataset for evaluation. Violet performs sizeably better than our baselines on all of our evaluation datasets. For example, it reaches a CIDEr score of 61.2 on our manually annotated dataset and achieves an improvement of 13 points on Flickr8k. © 2023 Association for Computational Linguistics.

关键词： Decoding

来源：评论

学校读者我要写书评

暂无评论

Graph Moving Object Segmentation

引用

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND machine INTELLIGENCE 2022年第5期44卷 2485-2503页

作者： Giraldozuluaga, Jhony H. Javed, Sajid Bouwmans, Thierry La Rochelle Univ Lab MIA Math Image & Applicat F-17000 La Rochelle France Khalifa Univ Ctr Autonomous Robot Syst Abu Dhabi 127788 U Arab Emirates

Moving Object Segmentation (MOS) is a fundamental task in computer vision. Due to undesirable variations in the background scene, MOS becomes very challenging for static and moving camera sequences. Several deep learning methods have been proposed for MOS with impressive performance. However, these methods show performance degradation in the presence of unseen videos;and usually, deep learning models require large amounts of data to avoid overfitting. Recently, graph learning has attracted significant attention in many computer vision applications since they provide tools to exploit the geometrical structure of data. In this work, concepts of graph signal processing are introduced for MOS. First, we propose a new algorithm that is composed of segmentation, background initialization, graph construction, unseen sampling, and a semi-supervised learning method inspired by the theory of recovery of graph signals. Second, theoretical developments are introduced, showing one bound for the sample complexity in semi-supervised learning, and two bounds for the condition number of the Sobolev norm. Our algorithm has the advantage of requiring less labeled data than deep learning methods while having competitive results on both static and moving camera videos. Our algorithm is also adapted for Video Object Segmentation (VOS) tasks and is evaluated on six publicly available datasets outperforming several state-of-the-art methods in challenging conditions.

关键词： Videos Task analysis Signal processing algorithms Object segmentation Semisupervised learning Deep learning Complexity theory Moving object segmentation graph signal processing semi-supervised learning unseen videos video object segmentation

来源：评论

学校读者我要写书评

暂无评论

Automatic Sown Field Detection Using machine vision and Contour Analysis 21st

Automatic Sown Field Detection Using Machine Vision and Cont...

引用

21st International Conference on Computational Science and Its applications (ICCSA)

作者： Shirobokov, Mikhail Grishkin, Valeriy Kayumova, Diana St Petersburg State Univ Univ Embankment 7-9 St Petersburg 199034 Russia

ISBN: (纸本)9783030869601;9783030869595

The paper proposes a prototype of an algorithm based on the use of machine vision methods, which allows automatic identification and selection of fields sown with agricultural crops on images. The algorithm works with satellite images and consists of two stages. At the first stage, the image undergoes initial processing, after which edge detection and contour finding algorithms are applied to it. At the second stage, the obtained image areas enclosed within the contours are represented as a set of numerical and logical parameters which are used for filtering and classification of the areas.

关键词： image processing machine vision Contour analysis

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：