检索结果-内蒙古大学图书馆

IEEE/CVF Conference on Computer vision and Pattern Recognition (CVPR)

作者： Arriaga, Octavio Palacio, Sebastian Valdenegro-Toro, Matias Univ Bremen Bremen Germany German Res Ctr Artificial Intelligence Kaiserslautern Germany Univ Groningen Groningen Netherlands

ISBN: (纸本)9798350302493

As more machine learning models are now being applied in real world scenarios it has become crucial to evaluate their difficulties and biases. In this paper we present an unsupervised method for calculating a difficulty score based on the accumulated loss per epoch. Our proposed method does not require any modification to the model, neither any external supervision, and it can be easily applied to a wide range of machine learning tasks. We provide results for the tasks of image classification, image segmentation, and object detection. We compare our score against similar metrics and provide theoretical and empirical evidence of their difference. Furthermore, we show applications of our proposed score for detecting incorrect labels, and test for possible biases.

关键词： Object detection

来源：评论

学校读者我要写书评

暂无评论

Performance Improvement in Welding Operations Through image processing 2

Performance Improvement in Welding Operations Through Image ...

引用

2nd International Conference on Artificial Intelligence and machine Learning applications, AIMLA 2024

作者： Sharma, Mohit Kumar Menon, Soumya V Tripathy, Padmaja Vivekananda Global University Department of Electrical Engineering Jaipur India School of Sciences Department of Chemistry and Biochemistry Karnataka Bangalore India ARKA JAIN University Department of Mechnical Engineering Jharkhand Jamshedpur India

ISBN: (纸本)9798350349221

This examination intends to enhance the overall performance of welding operations through picture processing. It's going to use an aggregate of PC vision and gadgets, getting to know to perceive better and tune welds, improve the accuracy of the system, and reduce the capability for mistakes. Particularly, the study will make use of a deep learning method to classify welds in specific classes, allowing the welding operations to be more effectively monitored and operated. Additionally, a convolutional neural network technique will be utilized to pick out the welds and estimate the vital parameters from the image statistics. Sooner or later, a robot arm geared up with a digital camera and a torch can be used to validate the welding process in an actual-world scenario. The effects of this take a look at will be used to enhance the performance and nice of welding operations through higher visibility into the system. © 2024 IEEE.

关键词： Welds

来源：评论

学校读者我要写书评

暂无评论

image Feature Extraction and Tracking for Robot vision Servo Control System Based on the Transformer Model

Image Feature Extraction and Tracking for Robot Vision Servo...

引用

2024 International Conference on Telecommunications and Power Electronics, TELEPE 2024

作者： He, Yanqiu Zhu, Yanyan Liu, Haisheng Harbin Institute of Petroleum Heilongjiang Harbin150028 China Harbin Huade University Heilongjiang Harbin150025 China The Seventh School of Harbin New District Heilongjiang Harbin150080 China

ISBN: (纸本)9798350369212

Robot vision servo control systems play an important role in modern automation systems, and image feature extraction and tracking, as its key components, have a direct impact on its performance and application scope. In this paper, we explore a novel approach based on the Transformer model, aiming to improve the image feature extraction and tracking function in robot vision servo control systems. First, we briefly introduce the basic principles of the Transformer model and its successful applications in the field of natural language processing. Then, we discuss in detail how to apply the Transformer model to image feature extraction and evaluate its performance with experimental results. Subsequently, we further discussed how to realize the image tracking function by using the Transformer model and proposed a new framework of visual servo control system for robots. Finally, we summarize the research results and look forward to possible future research directions. This research provides new ideas and methods to improve the performance and application range of robot visual servo control systems. © 2024 IEEE.

关键词： machine vision

来源：评论

学校读者我要写书评

暂无评论

Guided Distillation for Semi-Supervised Instance Segmentation

Guided Distillation for Semi-Supervised Instance Segmentatio...

引用

IEEE/CVF Winter Conference on applications of Computer vision (WACV)

作者： Berrada, Tariq Couprie, Camille Alahari, Karteek Verbeek, Jakob Meta FAIR Menlo Pk CA 94025 USA Univ Grenoble Alpes CNRS Inria Grenoble INPLJK Grenoble France

ISBN: (纸本)9798350318920;9798350318937

Although instance segmentation methods have improved considerably, the dominant paradigm is to rely on fully-annotated training images, which are tedious to obtain. To alleviate this reliance, and boost results, semi-supervised approaches leverage unlabeled data as an additional training signal that limits overfitting to the labeled samples. In this context, we present novel design choices to significantly improve teacher-student distillation models. In particular, we (i) improve the distillation approach by introducing a novel "guided burn-in" stage, and (ii) evaluate different instance segmentation architectures, as well as backbone networks and pre-training strategies. Contrary to previous work which uses only supervised data for the burn-in period of the student model, we also use guidance of the teacher model to exploit unlabeled data in the burn-in period. Our improved distillation approach leads to substantial improvements over previous state-of-the-art results. For example, on the Cityscapes dataset we improve mask-AP from 23.7 to 33.9 when using labels for 10% of images, and on the COCO dataset we improve mask-AP from 18.3 to 34.1 when using labels for only 1% of the training data.

关键词： Algorithms Algorithms and algorithms formulations image recognition and understanding machine learning architectures

来源：评论

学校读者我要写书评

暂无评论

All-in-One image Coding for Joint Human-machine vision with Multi-Path Aggregation 38

All-in-One Image Coding for Joint Human-Machine Vision with ...

引用

38th Conference on Neural Information processing Systems, NeurIPS 2024

作者： Zhang, Xu Guo, Peiyao Lu, Ming Ma, Zhan School of Electronic Science and Engineering Nanjing University China

image coding for multi-task applications, catering to both human perception and machine vision, has been extensively investigated. Existing methods often rely on multiple task-specific encoder-decoder pairs, leading to high overhead of parameter and bitrate usage, or face challenges in multi-objective optimization under a unified representation, failing to achieve both performance and efficiency. To this end, we propose Multi-Path Aggregation (MPA) integrated into existing coding models for joint human-machine vision, unifying the feature representation with an all-in-one architecture. MPA employs a predictor to allocate latent features among task-specific paths based on feature importance varied across tasks, maximizing the utility of shared features while preserving task-specific features for subsequent refinement. Leveraging feature correlations, we develop a two-stage optimization strategy to alleviate multi-task performance degradation. Upon the reuse of shared features, as low as 1.89% parameters are further augmented and fine-tuned for a specific task, which completely avoids extensive optimization of the entire model. Experimental results show that MPA achieves performance comparable to state-of-the-art methods in both task-specific and multi-objective optimization across human viewing and machine analysis tasks. Moreover, our all-in-one design supports seamless transitions between human- and machine-oriented reconstruction, enabling task-controllable interpretation without altering the unified model. Code is available at https://***/NJUvision/MPA. © 2024 Neural information processing systems foundation. All rights reserved.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Identification of triangular single crystals of transition metal dichalcogenides based on the detection algorithm

引用

OPTICS LETTERS 2024年第2期49卷 298-301页

作者： Mao, Yu Wang, Zixin Xu, Chang Wang, Yan Dong, Ningning Wang, Jun Chinese Acad Sci Shanghai Inst Opt & Fine Mech Photon Integrated Circuits Ctr Shanghai 201800 Peoples R China Univ Chinese Acad Sci Ctr Mat Sci & Optoelect Engn Beijing 100049 Peoples R China Chinese Acad Sci State Key Lab High Field Laser Phys Shanghai Inst Opt & Fine Mech Shanghai 201800 Peoples R China Chinese Acad Sci Ctr Excellence Ultraintense Laser Sci CEULS Shanghai 201800 Peoples R China

The distinctive properties and facile integration of 2D materials hold the potential to offer promising avenues for the on-chip photonic devices, and the expeditious and nondestructive identification and localization of diverse fundamental building blocks become key prerequisites. Here, we present a methodology grounded in digital image processing and deep learning, which effectively achieves the detection and precise localization of four monolayer-thick triangular single crystals of transition metal dichalcogenides with the mean average precision above 90%, and the approach demonstrates robust recognition capabilities across varied imaging conditions encompassing both white light and monochromatic light. This stands poised to serve as a potent data-driven tool enhancing the characterizing efficiency and holds the potential to expedite research initiatives and applications founded on the utilization of 2D materials. (c) 2024 Optica Publishing Group

关键词： Deep learning Digital image processing image metrics Imaging systems machine vision Photonic devices

来源：评论

学校读者我要写书评

暂无评论

A Comparative Study on Pruning Deep Convolutional Neural Networks Using Clustering Methods: K-Means, CLIQUE, DENCLUE, and OptiGrid 24

A Comparative Study on Pruning Deep Convolutional Neural Net...

引用

9th International Conference on Multimedia and image processing (ICMIP)

作者： Alqemlas, Danah Saud Jeragh, Mohammad Esmaeel Kuwait Univ Kuwait Kuwait Kuwait Oil Co Kuwait Kuwait

ISBN: (纸本)9798400716164

In the past years, machine learning (ML) and deep learning (DL) have led to the advancement of several applications, including computer vision, natural language processing, and audio processing. These complex tasks require large models, which is a challenge to deploy in devices with limited resources. These resource-constrained devices have limited computation power and memory. Hence, the neural networks must be optimized through network acceleration and compression techniques. This paper proposes a novel method to compress and accelerate neural networks from a small set of spatial convolution kernels. Firstly, a novel pruning algorithm is proposed based on the density-based clustering method that identifies and removes redundancy in CNNs while maintaining the accuracy and throughput tradeoff. Secondly, a novel pruning algorithm based on the grid-based clustering method is proposed to identify and remove redundancy in CNNs. The performance of the three pruning algorithms (density-based, grid-based, and partitional-based clustering algorithms) is evaluated against each other. The experiments were conducted using the deep CNN compression technique on the VGG-16 and ResNet models to achieve higher accuracy on image classification than the original model at a higher compression ratio and speedup.

关键词： Neural Network Pruning Clustering Methods image processing

来源：评论

学校读者我要写书评

暂无评论

Evolving processing Pipelines for Industrial Imaging with Cartesian Genetic Programming

Evolving Processing Pipelines for Industrial Imaging with Ca...

引用

4th IEEE International Conference on Autonomic Computing and Self-Organizing Systems (ACSOS)

作者： Margraf, Andreas Cui, Henning Stein, Anthony Haehner, Jorg Fraunhofer IGCV Digital Prod & AI Augsburg Germany Univ Augsburg Organ Comp Grp Augsburg Germany Univ Hohenheim AI Agr Engn Stuttgart Germany

ISBN: (纸本)9798350337440

The reconfiguration of machine vision systems heavily depends on the collection and availability of large datasets, rendering them inflexible and vulnerable to even minor changes in the data. This paper proposes a refinement of Miller's Cartesian Genetic Programming methodology, aimed at generating filter pipelines for image processing tasks. The approach is based on CGP-IP, but specifically adapted for image processing in industrial monitoring applications. The suggested method allows for retraining of filter pipelines using small datasets;this concept of self-adaptivity renders high-precision machine vision more resilient to faulty machine settings or changes in the environment and provides compact programs. A dependency graph is introduced to rule out invalid pipeline solutions. Furthermore, we suggest to not only generate pipelines from scratch, but store and reapply previous solutions and re-adjust filter parameters. Our modifications are designed to increase the likelihood of early convergence and improvement in the fitness indicators. This form of self-adaptivity allows for a more resource-efficient configuration of image filter pipelines with small datasets.

关键词： cgp image filters monitoring segmentation

来源：评论

学校读者我要写书评

暂无评论

Classification of Diseased Cotton Leaves and Plants Using Improved Deep Convolutional Neural Network

引用

MULTIMEDIA TOOLS AND applications 2023年第16期82卷 25307-25325页

作者： Rai, Chitranjan Kumar Pahuja, Roop Dr B R Ambedkar Natl Inst Technol Dept Instrumentat & Control Engn Jalandhar 144011 Punjab India

The automated detection and classification of plant diseases based on images of leaves is a significant milestone in agriculture. Due to the increasing popularity of digital image processing, machine learning, and computer vision techniques, it has been proposed that these could be used for the early detection of diseases. However, the accuracy of these techniques is still considered to be a challenge. In this paper, the concept of deep learning was used to identify and predict cotton plant disease status using images of leaves and plants collected in an uncontrolled environment. This paper focuses on solving the problem of cotton plants disease detection and classification using an improved Deep Convolution Neural Network based model. Three different experimental configurations were investigated to study the impacts of different data split ratios, different choices of pooling layer (max-pooling vs. average-pooling), and epoch sizes. The models were trained using a database of 2293 images of cotton leaves and plants. The data included four distinct classes of leaves, plant disease combinations, and their respective categories. For classifying leaves and plant diseases in cotton plants, our model attained an accuracy of 97.98%. The proposed technique outperformed the recent approaches indicated in earlier literature for relevant parameters. As a result, the technique is intended to reduce the time spent identifying cotton leaf disease in significant production regions and human error and the time spent determining its severity.

关键词： Convolution neural network image classification Deep learning image processing Plant leaf disease

来源：评论

学校读者我要写书评

暂无评论

A Micro-Topography Enhancement Method for DEMs: Advancing Geological Hazard Identification

引用

REMOTE SENSING 2025年第5期17卷 920-920页

作者： He, Qiulin Dong, Xiujun Li, Haoliang Deng, Bo Sima, Jingsong Chengdu Univ Technol State Key Lab Geohazard Prevent & Geoenvironm Prot Chengdu 610059 Peoples R China

Geological hazards in densely vegetated mountainous regions are challenging to detect due to terrain concealment and the limitations of traditional visualization methods. This study introduces the LiDAR image highlighting algorithm (LIHA), a novel approach for enhancing micro-topographical features in digital elevation models (DEMs) derived from airborne LiDAR data. By analogizing terrain profiles to non-stationary spectral signals, LIHA applies locally estimated scatterplot smoothing (Loess smoothing), wavelet decomposition, and high-frequency component amplification to emphasize subtle features such as landslide boundaries, cracks, and gullies. The algorithm was validated using the Mengu landslide case study, where edge detection analysis revealed a 20-fold increase in identified micro-topographical features (from 1907 to 37,452) after enhancement. Quantitative evaluation demonstrated LIHA's effectiveness in improving both human interpretation and automated detection accuracy. The results highlight LIHA's potential to advance early geological hazard identification and mitigation, particularly when integrated with machine learning for future applications. This work bridges signal processing and geospatial analysis, offering a reproducible framework for high-precision terrain feature extraction in complex environments.

关键词： digital elevation model LiDAR micro-topographical features vision enhancement wavelet decomposition edge detection geological hazard identification

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：