检索结果-内蒙古大学图书馆

21st International Conference on image Analysis and processing (ICIAP)

作者： San-Emeterio, Miguel G. Atos Res & Innovat Madrid 28037 Spain

ISBN: (纸本)9783031133244;9783031133237

This review article about Few-Shot Learning techniques is focused on Computer vision applications based on Deep Convolutional Neural Networks. A general discussion about Few-Shot Learning is given, featuring a context-constrained description, a short list of applications, a description of a couple of commonly used techniques and a discussion of the most used benchmarks for FSL computer vision applications. In addition, the paper features a few examples of recent publications in which FSL techniques are used for training models in the context of Human Behaviour Analysis and Smart City Environment Safety. These examples give some insight about the performance of state-of-the-art FSL algorithms, what metrics do they achieve, and how many samples are needed for accomplishing that.

关键词： Few-Shot Learning Deep Learning Computer vision Human Behaviour Analysis Smart City Environment Safety

来源：评论

学校读者我要写书评

暂无评论

Epidemiological Mucormycosis treatment and diagnosis challenges using the adaptive properties of computer vision techniques based approach: a review

引用

MULTIMEDIA TOOLS AND applications 2022年第10期81卷 14217-14245页

作者： Nira Kumar, Harekrishna GLA Univ Dept Elect & Commun Mathura 281406 India

As everyone knows that in today's time Artificial Intelligence, machine Learning and Deep Learning are being used extensively and generally researchers are thinking of using them everywhere. At the same time, we are also seeing that the second wave of corona has wreaked havoc in India. More than 4 lakh cases are coming in 24 h. In the meantime, news came that a new deadly fungus has come, which doctors have named Mucormycosis (Black fungus). This fungus also spread rapidly in many states, due to which states have declared this disease as an epidemic. It has become very important to find a cure for this life-threatening fungus by taking the help of our today's devices and technology such as artificial intelligence, data learning. It was found that the CT-Scan has much more adequate information and delivers greater evaluation validity than the chest X-Ray. After that the steps of image processing such as pre-processing, segmentation, all these were surveyed in which it was found that accuracy score for the deep features retrieved from the ResNet50 model and SvM classifier using the Linear kernel function was 94.7%, which was the highest of all the findings. Also studied about Deep Belief Network (DBN) that how easy it can be to diagnose a life-threatening infection like fungus. Then a survey explained how computer vision helped in the corona era, in the same way it would help in epidemics like Mucormycosis.

关键词： Mucormycosis Computer vision Black fungus Artificial intelligence Deep learning

来源：评论

学校读者我要写书评

暂无评论

image Understanding Through visual Question Answering: A Review from Past Research 23th

Image Understanding Through Visual Question Answering: A Rev...

引用

23rd International Conference on Intelligent Systems Design and applications, ISDA 2023

作者： Yanda, Nagamani Tagore Babu, J. Aswin Kumar, K. Taraka Rama Rao, M. Ranjith varma, K.v. Rahul Babu, N. GMR Institute of Technology Rajam532127 India

ISBN: (纸本)9783031648465

visual Question Answering (vQA) lies at the crossroads of computer vision, natural language processing, and deep learning, captivating researchers across various AI domains. This dynamic field involves processing an image alongside a corresponding textual question, generating, or selecting an answer from provided options. The past five years have witnessed substantial advancements in vQA and visual reasoning, fueled by deep learning and extensive annotated datasets. This study presents a comprehensive literature review, delving into the current state-of-the-art from four perspectives: problem definition, existing datasets, literature review, and evaluation metrics. Through a critical analysis, we address dataset limitations and scrutinize contemporary algorithms. Here we use multimodal Fusion, which achieves the state of the art compared to existing methodologies. Moreover, we explore potential future research directions to inspire innovative solutions and applications in this evolving domain, aiming to propel vQA into new realms of exploration and practical utility. This project will allow users to input an image and image-related text so that it will aid the question-answering system. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2024.

关键词： Deep learning

来源：评论

学校读者我要写书评

暂无评论

visually-Situated Natural Language Understanding with Contrastive Reading Model and Frozen Large Language Models

Visually-Situated Natural Language Understanding with Contra...

引用

Conference on Empirical Methods in Natural Language processing (EMNLP)

作者： Ki, Geewook Lee, Hodong Kim, Daehee Jung, Haeji Park, Sanghee Kim, Yoonsik Yun, Sangdoo Kim, Taeho Lee, Bado Park, Seunghyun NAVER Cloud AI Seoul South Korea KAIST Ai Daejeon South Korea Korea Univ Seoul South Korea NAVER AI Lab Seoul South Korea

ISBN: (纸本)9798891760608

Recent advances in Large Language Models (LLMs) have stimulated a surge of research aimed at extending their applications to the visual domain. While these models exhibit promise in generating abstract image captions and facilitating natural conversations, their performance on text-rich images still requires improvement. In this paper, we introduce Contrastive Reading Model (Cream), a novel neural architecture designed to enhance the language-image understanding capability of LLMs by capturing intricate details that are often overlooked in existing methods. Cream combines vision and auxiliary encoders, fortified by a contrastive feature alignment technique, to achieve a more effective comprehension of language information in visually situated contexts within the images. Our approach bridges the gap between vision and language understanding, paving the way for the development of more sophisticated Document Intelligence Assistants. Through rigorous evaluations across diverse visually-situated language understanding tasks that demand reasoning capabilities, we demonstrate the compelling performance of Cream, positioning it as a prominent model in the field of visual document understanding. We provide our codebase and newly-generated datasets at https://***/naver-ai/cream.

关键词： image enhancement

来源：评论

学校读者我要写书评

暂无评论

SHAPE DETECTION IN AN image USING PARALLELIZED TRADITIONAL image ANALYSIS TECHNIQUES 58

SHAPE DETECTION IN AN IMAGE USING PARALLELIZED TRADITIONAL I...

引用

58th Annual International Telemetering Conference, ITC 2023

作者： Loreto Cornídez, Alan Manuel Fuentes Gutiérrez, Rubén Diego College of Electrical and Computer Engineering The University of Arizona TucsonAZ85721 United States

Modern day computer vision applications are frequently implemented using machine learning approaches. While these implementations can perform very well, the performance is heavily dependent on sufficient and accurate training data. Due to a lack of adequate training data, the Arizona Autonomous vehicles Club (AZA) decided to implement the generalized hough transform to detect shapes in a live video feed from an unmanned aerial system (UAS). The hough transform is computationally intensive and since real-time performance is required, a serial approach may not have the execution speed necessary for the application. image processing techniques include matrix multiplication and convolution operations which are highly parallelizable. Therefore, the algorithm was parallelized and implemented on a graphics processing unit (GPU). Performance profiling was done on both machine learning and traditional approaches where execution time and accuracy were compared. © 2023 International Foundation for Telemetering. All rights reserved.

关键词： Unmanned aerial vehicles (UAv)

来源：评论

学校读者我要写书评

暂无评论

Guest Editorial: Smart Measurement in machine vision for Challenging applications

引用

IEEE INSTRUMENTATION & MEASUREMENT MAGAZINE 2023年第8期26卷 3-3页

作者： venkatesan, C. AI-Turjman, Fadi Pelusi, Danilo HKBK Coll Engn Dept ECE Bengaluru India Near East Univ Dept Artificial Engn Istanbul Turkiye Univ Teramo Dept CSE Teramo Italy

Smart measurements are widely deployed in many applications due to the technology advancement. For various industrial applications, automated inspection and analysis based on the image is provided by machine vision. For the measurements in these applications, sensors must be connected. machine vision tries to creatively combine already existing technology and use them to address current issues. The term "measurement" is frequently used to refer to many tasks and is the cornerstone of industrial automation and security deployment. This Special Issue of Instrumentation & Measurement Magazine addresses some novel achievements in the measurement and instrumentation science and technology fields. It advances machine vision concerning production, application of smart materials, measurement and estimation techniques, etc. The variety of selected papers reflects the efforts made by the authors to focus either on methodological aspects or technical issues. In particular, three papers have been accepted for publication, reflecting several aspects of the abovementioned fields by covering machine vision and image processing technology.

关键词： Special issues and sections Smart devices Measurement techniques machine vision

来源：评论

学校读者我要写书评

暂无评论

Research on Defect Detection Algorithm for Cigarette Case Appearance Based on machine vision

Research on Defect Detection Algorithm for Cigarette Case Ap...

引用

International Conference on Wireless Communications, Networking and applications (WCNA 2022)

作者： Huang, Chunhui Lu, Haihua Huang, Xiaoping Chen, Sixiao Shen, Miaojie Xiamen Tobacco Industry Co. Ltd. FJ Xiamen361022 China Ningbo Cigarette Factory China Tobacco Zhejiang Industrial Co. Ltd. Ningbo315504 China

ISBN: (纸本)9789819939503

In view of the demand for cigarette case appearance quality detection in the production process of cigarette enterprises, a machine vision-based method for detecting cigarette case appearance defects is proposed, and an appearance packaging quality line detection platform applicable to ultra-high-speed cigarette equipment is established. Haar wavelet de-noising is used for image pre-processing, and the decomposed low-frequency signal is reconstructed to obtain a compressed image. The double thresholding calculation in Canny operator is improved based on the Otsu algorithm, and the defect segmentation of the image is better achieved. Moreover, using Hu moments for defect feature extraction and training SvM classifier by sample features to realize the recognition and classification of cigarette case appearance defects. The ZB48 ultra-high-speed packaging machine was used as the test object to build a detection system, and the experimental results show that the algorithm can effectively achieve the online detection of cigarette case appearance defects. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

关键词： Defects

来源：评论

学校读者我要写书评

暂无评论

Cotton Leaf Disease Based on image processing Using Deep Learning

Cotton Leaf Disease Based on Image Processing Using Deep Lea...

引用

2024 International Conference on Innovations and Challenges in Emerging Technologies, ICICET 2024

作者： Menaga, D. Shri Roshan, B.T. Shri vikaas, S.R. St. Joseph's Institute of Technology Department of Computer Science and Engineering Chennai India

ISBN: (纸本)9798350319019

One of the most important occupations in India is agriculture. Out of all the crops, cotton is the best and is crucial to the agricultural economy of the country. In India, 40-50 million people work in the cotton trade and processing, while six million farmers directly depend on the crop. The cotton leaf disease has grown in importance over the last few decades, resulting in losses to crops, farming operations, and financial resources. To achieve this aim, we first need to acquire different images of cotton plants. We can use image processing techniques to analyze dead leaf images and extract features like color, texture, and other characteristics with the Deep CNN model's assistance. In addition to being less expensive and more straightforward, automatic disease detection supports machine vision, which offers image-based automated process control and inspection. To properly train the algorithm, we will be using a dataset of approximately 1752(approximately 440 images in each class) images classified into different categories according to the diseases. This model will be developed using tools present in Anaconda such as Jupyter Notebook, Spyder etc. The results of this project will demonstrate whether using it in real-time applications is feasible and whether traditional or manual disease and pest identification could benefit from the use of IT- based solutions. © 2024 IEEE.

关键词： Cotton

来源：评论

学校读者我要写书评

暂无评论

A comprehensive study of feature extraction techniques for plant leaf disease detection

引用

MULTIMEDIA TOOLS AND applications 2022年第1期81卷 367-419页

作者： vishnoi, vibhor Kumar Kumar, Krishan Kumar, Brajesh Gurukula Kangri Vishwavidyalaya Dept Comp Sci Haridwar 249404 Uttarakhand India MJP Rohilkhand Univ Dept Comp Sci & IT Bareilly 243006 Uttar Pradesh India

Agriculture has been the most primary source of the livelihood of man for thousands of years. Even today, it provides subsistence to about 50% of the world population. Plant diseases are the serious cause of big losses to crop production every year worldwide. It is necessary to keep the plants healthy at various stages of their growth/development to deal with the financial losses from plant diseases. Symptoms of infections are visible mainly at plant leaves;thus leaves are commonly used to detect and identify the diseases. Detecting the disease through visual observation is itself a challenging task and requires a lot of human expertise. image processing techniques along with computational intelligence or soft computing techniques can be used to provide a better assistance for disease detection to the farmers. A disease in plants can be detected based on its symptoms extracted in the form of features. Feature extraction techniques thus play a vital role in such systems. The paper emphasizes on the review of hand-crafted and deep learning based feature extraction with their merits and demerits. It provides a comprehensive discussion on a variety of image features such as color, texture, and shape for various disorders in different cultures.

关键词： Computer vision image processing Plant leaf diseases Feature extraction Classification machine learning

来源：评论

学校读者我要写书评

暂无评论

vashaNet: An automated system for recognizing handwritten Bangla basic characters using deep convolutional neural network

引用

machine LEARNING WITH applications 2024年 17卷

作者： Raquib, Mirza Hossain, Mohammad Amzad Islam, Md Khairul Miah, Md Sipon Noakhali Sci & Technol Univ Dept Informat & Commun Engn Noakhali 3814 Bangladesh Islamic Univ Dept Biomed Engn Kushtia 7003 Bangladesh Islamic Univ Dept Informat & Commun Technol Kushtia 7003 Bangladesh Univ Carlos III Madrid Dept Signal Theory & Commun Leganes 28911 Madrid Spain Univ Galway Sch Comp Sci Galway H91 TK33 Ireland

Automated character recognition is currently highly popular due to its wide range of applications. Bengali handwritten character recognition (BHCR) is an extremely difficult issue because of the nature of the script. very few handwritten character recognition (HCR) models are capable of accurately classifying all different sorts of Bangla characters. Recently, image recognition, video analytics, and natural language processing have all found great success using convolutional neural network (CNN) due to its ability to extract and classify features in novel ways. In this paper, we introduce a vashaNet model for recognizing Bangla handwritten basic characters. The suggested vashaNet model employs a 26 -layer deep convolutional neural network (DCNN) architecture consisting of nine convolutional layers, six max pooling layers, two dropout layers, five batch normalization layers, one flattening layer, two dense layers, and one output layer. The experiment was performed over 2 datasets consisting of a primary dataset of 5750 images, CMATERdb 3.1.2 for the purpose of training and evaluating the model. The suggested character recognition model worked very well, with test accuracy rates of 94.60% for the primary dataset, 94.43% for CMATERdb 3.1.2 dataset. These remarkable outcomes demonstrate that the proposed vashaNet outperforms other existing methods and offers improved suitability in different character recognition tasks. The proposed approach is a viable candidate for the high efficient practical automatic BHCR system. The proposed approach is a more powerful candidate for the development of an automatic BHCR system for use in practical settings.

关键词： Artificial intelligence Character recognition Computer vision Deep convolutional neural network image processing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：