检索结果-内蒙古大学图书馆

4th International Conference on Data, Engineering, and applications, IDEA 2022

作者： Sonawane, Sandip Patil, Nitin N. R. C. Patel Institute of Technology Shirpur India

ISBN: (纸本)9789819700363

Precision agriculture has recently gained significant importance in computer vision technologies. Various processes as a part of agricultural production cycle from planting to harvesting can be carried out automatically and effectively by using computer vision. The lack of publicly available image datasets is a major obstacle to the rapid design and assessment of computer vision based applications and also to machine learning algorithms which support these applications. To reduce this bottleneck, numerous image dataset collections have been discovered and made publicly available since 2015. In spite of this development, there is still a need to focus on survey of these datasets. Two most important concerns—choosing the right dataset and knowing how to pre-process and prepare the images in datasets, are considerably challenging task in every application. This review paper gives a thorough analysis of the public image datasets and numerous pre-processing techniques carried out in the field of precision agriculture. This thorough study can lead to development of suitable methods for improved quality and productivity of the crop along with proper weed management. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

关键词： Adversarial machine learning

来源：评论

学校读者我要写书评

暂无评论

Are Natural Domain Foundation Models Useful for Medical image Classification?

Are Natural Domain Foundation Models Useful for Medical Imag...

引用

IEEE/CVF Winter Conference on applications of Computer vision (WACV)

作者： Huix, Joana Pales Soderberg, Magnus Ganeshan, Adithya Raju Matsoukas, Christos Haslum, Johan Fredin Smith, Kevin KTH Royal Inst Technol Stockholm Sweden Sci Life Lab Stockholm Sweden AstraZeneca Gothenburg Sweden

ISBN: (纸本)9798350318920;9798350318937

The deep learning field is converging towards the use of general foundation models that can be easily adapted for diverse tasks. While this paradigm shift has become common practice within the field of natural language processing, progress has been slower in computer vision. In this paper we attempt to address this issue by investigating the transferability of various state-of-the-art foundation models to medical image classification tasks. Specifically, we evaluate the performance of five foundation models, namely SAM, SEEM, DINOv2, BLIP, and OPENCLIP across four well-established medical imaging datasets. We explore different training settings to fully harness the potential of these models. Our study shows mixed results. DINOv2 consistently outperforms the standard practice of imageNET pretraining. However, other foundation models failed to consistently beat this established baseline indicating limitations in their transferability to medical image classification tasks.

关键词： Algorithms Algorithms and algorithms applications Biomedical / healthcare / medicine Datasets and evaluations formulations machine learning architectures

来源：评论

学校读者我要写书评

暂无评论

When dual contrastive learning meets disentangled features for unpaired image deraining

引用

machine vision AND applications 2023年第5期34卷 1-12页

作者： Wang, Tianming Wang, Kaige Li, Qing Chinese Acad Sci Intelligent Mfg Elect Res Ctr Inst Microelect Beijing 100029 Peoples R China Univ Chinese Acad Sci Sch Integrated Circuits Beijing 100029 Peoples R China China Acad Aerosp Sci & Innovat Lab 2050 Beijing 100086 Peoples R China

As the basis work of image processing, rain removal from a single image has always been an important and challenging problem. Due to the lack of real rain images and corresponding clean images, most rain removal networks are trained by synthetic datasets, which makes the output images unsatisfactory in practical applications. In this work, we propose a new feature decoupling network for unsupervised image rain removal. Its purpose is to decompose the rain image into two distinguishable layers: clean image layer and rain layer. In order to fully decouple the features of different attributes, we use contrastive learning to constrain this process. Specifically, the image patch with similarity is pulled together as a positive sample, while the rain layer patch is pushed away as a negative sample. We not only make use of the inherent self-similarity within the sample, but also make use of the mutual exclusion between the two layers, so as to better distinguish the rain layer from the clean image. We implicitly constrain the embedding of different samples in the depth feature space to better promote rainline removal and image restoration. Our method achieves a PSNR of 25.80 on Test100, surpassing other unsupervised methods.

关键词： image processing Deraining Contrastive learning Unsupervised learning

来源：评论

学校读者我要写书评

暂无评论

An Overview on image Segmentation Techniques for Reversible Data Hiding

INTERNATIONAL JOURNAL OF MATHEMATICAL ENGINEERING AND MANAGE...

引用

INTERNATIONAL JOURNAL OF MATHEMATICAL ENGINEERING AND MANAGEMENT SCIENCES 2024年第5期9卷 1163-1184页

作者： Gupta, Rasika Delhi Technol Univ Dept Comp Sci & Engn Delhi India

The fields of image processing and computer vision have witnessed significant growth due to the proliferation of digital images across diverse domains. image Segmentation is the fundamental task in digital image processing, finding applications in pivotal areas such as medical imaging, covert communication, autonomous driving, satellite imaging, among others. One particularly intriguing application of image segmentation lies in Reversible Data Hiding (RDH), where the delineation of the main Region of Interest (ROI) and Non-Region of Interest (NROI) using segmentation plays a crucial role for effective data encryption in the images. Over the last two decades, various studies focussed on developing an efficient data hiding approach, which can embed secret data within ROI and NROI part of image while ensuring its quality. A comprehensive survey has been conducted that meticulously examines different segmentation techniques, along with its usage in reversible data hiding. The main objective of this survey is to compare the performance metrics of reversible data hiding after applying different image segmentation techniques. The image segmentation techniques have been categorized systematically into three main classes: i) Traditional segmentation techniques, encompassing a spectrum of approaches like thresholding, region-based and edge detection based techniques, ii) machine Learning (ML) based approach consisting of Clustering, Support Vector machine (SVM) and iii) Deep Learning (DL) based technique, propelled by Convolutional Neural Networks (CNNs) that have emerged as a transformative paradigm, revolutionizing segmentation tasks with their ability to learn complex images. The survey finds out that PSNR value of data embedded images is high after applying deep learning based segmentation technique.

关键词： Encoder-decoder model Dilated convolution model ROI segmentation Reversible data hiding .

来源：评论

学校读者我要写书评

暂无评论

Advanced thermal vision techniques for enhanced fault diagnosis in electrical equipment: a review

引用

INTERNATIONAL JOURNAL OF SYSTEM ASSURANCE ENGINEERING AND MANAGEMENT 2025年第5期16卷 1914-1932页

作者： Sasithradevi, A. Persiya, J. Roomi, S. Mohamed Mansoor Perumal, D. Arumuga Prakash, P. Vijayalakshmi, M. Ebenezer, L. Brighty Vellore Inst Technol Ctr Adv Data Sci Chennai Tamil Nadu India Vellore Inst Technol Sch Elect Engn Chennai Tamil Nadu India Thiagarajar Coll Engn Dept Elect & Commun Engn Madurai Tamilnadu India Natl Inst Technol Karnataka Dept Mech Engn Surathkal Mangalore India Anna Univ Dept Elect Engn MIT Campus Chennai India

Ensuring the reliability and safety of electrical equipment is essential for industrial and residential applications. Traditional fault diagnosis methods involving physical inspections are time-consuming and ineffective for early fault detection. Infrared (IR) thermography offers a non-invasive and efficient solution by identifying anomalies in temperature profiles. This review explores thermal vision-based fault diagnosis techniques, including region of interest (ROI) segmentation, image pre-processing, and fault diagnosis algorithms, with a focus on deep learning approaches. The study highlights the effectiveness of machine learning models in enhancing fault detection accuracy while identifying challenges such as environmental variations, data inconsistencies, and system integration issues. The review discusses the role of real-time applications, wireless technologies, and AI-based automation in improving fault detection. Research gaps are identified, and future directions are proposed to enhance efficiency, reliability, and industrial adoption.

关键词： Fault diagnosis Infrared thermography Electrical equipment machine learning Deep learning Segmentation

来源：评论

学校读者我要写书评

暂无评论

Quantum machine Learning for Computer vision: A Survey 23

Quantum Machine Learning for Computer Vision: A Survey

引用

23rd IEEE International Conference on machine Learning and applications, ICMLA 2024

作者： Islam, Md Majedul He, Jing Selena Kennesaw State University Department of Computer Science Marietta United States

ISBN: (纸本)9798350374889

This research delves into quantum machine learning (QML) in the context of computer vision analysis by exploring the progress made in quantum computing and its impact on machine learning applications such as managing datasets and improving large-scale data processing efficiency through QML techniques specialised for tasks like image segmentation and classification in computer vision projects, along with findings, from trials conducted using the EMNIST benchmark *** tests reached an accuracy level above 90% successfully categorising tasks, with precision. This study explores the uses of quantum machine learning (QML) in areas like identification medical scans and distant monitoring. It also delves into the existing constraints and hurdles linked to quantum computer technologies. © 2024 IEEE.

关键词： Quantum computers

来源：评论

学校读者我要写书评

暂无评论

Recent advances of deep learning algorithms for aquacultural machine vision systems with emphasis on fish

引用

ARTIFICIAL INTELLIGENCE REVIEW 2022年第5期55卷 4077-4116页

作者： Li, Daoliang Du, Ling China Agr Univ Natl Innovat Ctr Digital Fishery Beijing Peoples R China China Agr Univ Beijing Engn & Technol Res Ctr Internet Things Ag Beijing 100083 Peoples R China China Agr Univ China EU Ctr Informat & Commun Technol Agr Beijing 100083 Peoples R China China Agr Univ Key Lab Agr Informat Acquisit Technol Minist Agr Beijing 100083 Peoples R China China Agr Univ Coll Informat & Elect Engn Beijing 100083 Peoples R China

Monitoring the growth conditions and behavior of fish will enable scientific management, reduce the threat of losses caused by disease and stress. Traditional monitoring methods are time-consuming, laborious, and untimely monitoring readily leads to aquaculture accidents. As a non-invasive, objective, and repeatable tool, machine vision systems have been widely used in various aspects of aquaculture monitoring. Nevertheless, the complex underwater environment makes it difficult to obtain ideal data processing results only using traditional image processing methods. Due to their powerful feature extraction capabilities, deep learning (DL) algorithms have been widely used in underwater image processing. Hence, the combination of DL algorithms and machine vision for the automated monitoring of aquaculture is of great importance. As evidence for the multidisciplinary aspects of DL applications, attention is focused on the latest DL methods applied to five fields of research: classification, detection, counting, behavior recognition, and biomass estimation. Meanwhile, due to the low training efficiency of DL models caused by insufficient dataset, transfer learning and GAN have also put into spotlight of this filed to pursue high performance of DL models. We also present the challenges and benchmarks in terms of the advantages and disadvantages of the selected method in each field. In addition, we review the sources of image acquisition and pre-processing methods in aquaculture. Finally, the challenges and prospects of DL in aquaculture machine vision systems are discussed. The literature review shows that the deep neural networks such as AlexNet, LSTM, VGG, and GoogLeNet, have been used for aquaculture machine vision systems.

关键词： Deep learning machine vision Aquaculture image acquisition image preprocessing

来源：评论

学校读者我要写书评

暂无评论

Analysis of Different Object Detection Techniques Used in image processing

Analysis of Different Object Detection Techniques Used in Im...

引用

2024 OPJU International Technology Conference on Smart Computing for Innovation and Advancement in Industry 4.0, OTCON 2024

作者： Gour, Sonam Dubey, Shirish Mohan Sharma, Gaurav Joshi, Neetu Poornima College of Engineering Department of Computer Engineering Jaipur India

ISBN: (纸本)9798350373783

Computer vision research uses self-driving systems, robot surveillance, and science interpretation. A plethora of applications, including robotics, self-driving systems, video surveillance, and scene interpretation, have spurred intensive research in computer vision throughout the past ten years. Visual recognition systems have attracted a lot of research attention and include localization, detection, picture categorization, and detection. Given the significant developments in neural networks, especially deep learning, it performs remarkably well. One area where a lot of success with computer vision is needed is object recognition. The purpose of this work is to conduct a methodical investigation into the relevance of object detection and its applications in the field of computer vision. This work gives a thorough introduction to object detection, as well as various approaches, computer vision basics, and applications, which will be useful to the image processing and computer vision research communities. © 2024 IEEE.

关键词： machine vision

来源：评论

学校读者我要写书评

暂无评论

image enhancement algorithm combining histogram equalization and bilateral filtering

引用

SYSTEMS AND SOFT COMPUTING 2024年 6卷

作者： Wu, Mingzhu Zhong, Qiuyan Guangzhou Inst Technol Dept Informat Engn Guangzhou 510075 Peoples R China Guangzhou Railway Polytech Asset Management Div Guangzhou 511308 Peoples R China

In the process of image acquisition, transmission, and storage, the image quality is often degraded due to a variety of unfavorable factors, resulting in information loss, which poses certain difficulties for subsequent image processing and analysis. How to enhance the visibility of image details and maintain the naturalness of the image is one of the important challenges in image processing. In response to this challenge, an image enhancement algorithm is proposed based on the advantages of histogram equalization and bilateral filtering. This algorithm organically integrates histogram equalization and bilateral filtering, aiming to improve image quality while reducing noise in the image. Specifically, the study first utilizes an improved histogram equalization strategy to preprocess the image and then applies a bilateral filter for further optimization. The experimental results showed that the optimized histogram equalization could effectively improve the global contrast of the image and avoid excessive enhancement and gray phenomenon of the image. Moreover, its peak signal-to-noise ratio could reach 0.71. However, bilateral filters showed significant advantages in processing complex data sets, and the peak signal-to-noise ratio could reach 0.95. It illustrated that the optimal research method has obvious advantages in improving image quality and reducing noise. The new enhancement strategy not only significantly improves the global contrast of the image but also preserves the naturalness of the image, providing important technical support for image analysis, machine vision, and artificial intelligence applications.

关键词： image enhancement Bilateral filtering Histogram equalization Global histogram equalization Robustness

来源：评论

学校读者我要写书评

暂无评论

Automatic captioning for medical imaging (MIC): a rapid review of literature

引用

ARTIFICIAL INTELLIGENCE REVIEW 2023年第5期56卷 4019-4076页

作者： Beddiar, Djamila-Romaissa Oussalah, Mourad Seppanen, Tapio Univ Oulu Ctr Machine Vis & Signal Anal Oulu 90014 Finland Univ Oulu Fac Med Oulu 90014 Finland

Automatically understanding the content of medical images and delivering accurate descriptions is an emerging field of artificial intelligence that combines skills in both computer vision and natural language processing fields. Medical image captioning is involved in various applications related to diagnosis, treatment, report generation and computer-aided diagnosis to facilitate the decision making and clinical workflows. Unlike generic image captioning, medical image captioning highlights the relationships between image objects and clinical findings, which makes it a very challenging task. Although few review papers have already been published in this field, their coverage is still quite limited and only particular problems are addressed. This motivates the current paper where a rapid review protocol was adopted to review the latest achievements in automatic medical image captioning from the medical domain perspective. We aim through this review to provide the reader with an up-to-date literature in this field by summarizing the key findings and approaches in this field, including the related datasets, applications and limitations as well as highlighting the main competitions, challenges and future directions.

关键词： Automatic image captioning Caption Diagnosis generation Medical images Rapid review Report generation PRISMA

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：