检索结果-内蒙古大学图书馆

37th Conference on Neural Information processing Systems (NeurIPS)

作者： Mayo, David Cummings, Jesse Lin, Xinyu Gutfreund, Dan Katz, Boris Barbu, Andrei MIT CSAIL Cambridge MA 02139 USA MIT CBMM Cambridge MA 02139 USA IBM Corp MIT IBM Watson AI Lab Cambridge MA USA

ISBN: (纸本)9781713899921

Humans outperform object recognizers despite the fact that models perform well on current datasets, including those explicitly designed to challenge machines with debiased images or distribution shift. This problem persists, in part, because we have no guidance on the absolute difficulty of an image or dataset making it hard to objectively assess progress toward human-level performance, to cover the range of human abilities, and to increase the challenge posed by a dataset. We develop a dataset difficulty metric MVT, Minimum Viewing Time, that addresses these three problems. Subjects view an image that flashes on screen and then classify the object in the image. images that require brief flashes to recognize are easy, those which require seconds of viewing are hard. We compute the imageNet and ObjectNet image difficulty distribution, which we find significantly undersamples hard images. Nearly 90% of current benchmark performance is derived from images that are easy for humans. Rather than hoping that we will make harder datasets, we can for the first time objectively guide dataset difficulty during development. We can also subset recognition performance as a function of difficulty: model performance drops precipitously while human performance remains stable. Difficulty provides a new lens through which to view model performance, one which uncovers new scaling laws: vision-language models stand out as being the most robust and human-like while all other techniques scale poorly. We release tools to automatically compute MVT, along with image sets which are tagged by difficulty. Objective image difficulty has practical applications - one can measure how hard a test set is before deploying a real-world system - and scientific applications such as discovering the neural correlates of image difficulty and enabling new object recognition techniques that eliminate the benchmark-vsreal-world performance gap.

关键词： Object recognition

来源：评论

学校读者我要写书评

暂无评论

Vitamin Deficiency Detection using image processing through Dermatological Symptoms

Vitamin Deficiency Detection using Image Processing through ...

引用

2024 IEEE International Conference on Intelligent Systems and Advanced applications, ICISAA 2024

作者： Janokar, Sagar Somani, Nimish Solanki, Purvi Solanke, Samiksha Solunke, Aditya Soman, Ishan Vishwakarma Institute of Technology Department of Electronics and Telecommunications Pune India

ISBN: (纸本)9798331539948

This study proposes a way to detect vitamin deficiency by combining machine learning and image processing. Computer vision enables the system to recognise visual symptoms of specific vitamin deficiencies. The recommended approach is that the entire procedure can be subdivided into specific key steps, which are initiated from image acquisition, followed by the image preprocessing steps used to enhance their quality. It catches the confusing patterns that are directed toward various abnormalities through a pretrained Convolutional Neural Network (CNN) model. Finally, with such patterns at hand, the categorisation takes place, which in turn helps to identify specific *** extensive experimentation across a diverse dataset, the system demonstrates remarkable accuracy in detecting deficiency. Its non-invasive nature permits early screening. This proves its potential for widespread implementation and directions for future enhancement, such as dataset expansion and exploration of other advanced architectures apart from CNN. With its promising capabilities, this approach represents a significant stride towards enhancing healthcare diagnostics and preventive measures related to vitamin deficiencies. © 2024 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

Identification Method of Dress Pattern Drawing based on machine vision Algorithm 3

Identification Method of Dress Pattern Drawing based on Mach...

引用

3rd International Conference on Computer vision, image and Deep Learning and International Conference on Computer Engineering and applications, CVIDL and ICCEA 2022

作者： Lyu, Ke Yan, Haizhang Zhejiang Sci-Tech University School of International Education Zhejiang310000 China Xidian University School of Computer Science and Technology Xi'an71000 China

ISBN: (纸本)9781665459112

This paper uses the machine vision method to identify the skirt module. We have constructed three kinds of machine recognition models of skirt profile processing, structure analysis of style drawing, and size estimation. The author constructs a relatively complete image recognition system for dress pattern drawing. In addition, we conducted an effect evaluation with a certain number of samples at the later stage of the experiment. This study has A good effect in distinguishing an A-type skirt from an H-type skirt, identifying the reasonable degree and length of the skirt, and determining the quantity statistics of each component element in the skirt pattern diagram. © 2022 IEEE.

关键词： image recognition

来源：评论

学校读者我要写书评

暂无评论

image Segmentation Using Deep Learning: A Survey

引用

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND machine INTELLIGENCE 2022年第7期44卷 3523-3542页

作者： Minaee, Shervin Boykov, Yuri Y. Porikli, Fatih Plaza, Antonio J. Kehtarnavaz, Nasser Terzopoulos, Demetri Snapchat Machine Learning Res Venice CA 90405 USA Univ Waterloo Waterloo ON N21 3G1 Canada Australian Natl Univ Canberra ACT 0200 Australia Huawei San Diego CA 92121 USA Univ Extremadura Badajoz 06006 Spain Univ Texas Dallas Richardson TX 75080 USA Univ Calif Los Angeles Los Angeles CA 90095 USA

image segmentation is a key task in computer vision and image processing with important applications such as scene understanding, medical image analysis, robotic perception, video surveillance, augmented reality, and image compression, among others, and numerous segmentation algorithms are found in the literature. Against this backdrop, the broad success of deep learning (DL) has prompted the development of new image segmentation approaches leveraging DL models. We provide a comprehensive review of this recent literature, covering the spectrum of pioneering efforts in semantic and instance segmentation, including convolutional pixel-labeling networks, encoder-decoder architectures, multiscale and pyramid-based approaches, recurrent networks, visual attention models, and generative models in adversarial settings. We investigate the relationships, strengths, and challenges of these DL-based segmentation models, examine the widely used datasets, compare performances, and discuss promising research directions.

关键词： image segmentation Computer architecture Semantics Deep learning Computational modeling Generative adversarial networks Logic gates image segmentation deep learning convolutional neural networks encoder-decoder models recurrent models generative models semantic segmentation instance segmentation panoptic segmentation medical image segmentation

来源：评论

学校读者我要写书评

暂无评论

New Indicators and Standards for Measuring of the End Mill's Helical Groove by image processing 10

New Indicators and Standards for Measuring of the End Mill's...

引用

Conference on Optical Metrology and Inspection for Industrial applications X

作者： Pivkin, Petr M. Ershov, Artem A. Grechishnikov, Vladimir A. Kuznetsov, Vladimir A. Nazarenko, Ekaterina S. Malysheva, Elena Yu. Nadykto, Alexey B. Moscow State Univ Technol STANKIN Lab Micromachining Technol Moscow 127055 Russia Moscow State Univ Technol STANKIN Dept Cutting Tools & Machining Technol Moscow 127055 Russia

ISBN: (纸本)9781510667877;9781510667884

The studies will be carried out using optical metrology methods on a Walter Helicheck inspection machine in reflected light and a number of images were stored to form a statistical sample. Established new indicators and criteria for grinding efficiency based on image processing of the helical groove of the end mill. As a result, recommendations for the selection of optical control techniques were made for the first time at the intermediate stage of technological preparation for production, in real time, and after processing. In this work, for the first time, we prove the possibility of determining the camera displacement pith distance during continuous scanning of the profile of a helical surface in a radial section, the measurement accuracy and recreating a three-dimensional model of the object. As a result of the work of the new algorithm using the Haar-wavelet with new indicators, it was established that the actual one is located inside the focal zone, which proves the possibility of applied application of the method of monitoring the shape of helical flute of end mills using computer vision. The measurement accuracy of the helical flute increased from 4 to 12% along its profile.

关键词： image processing standards for measuring cutting tools optical metrology new indicators helical groove Haar-like feature

来源：评论

学校读者我要写书评

暂无评论

Crop Disease and Pest Detection using Convolutional Neural Networks (CNN) 5

Crop Disease and Pest Detection using Convolutional Neural N...

引用

5th International Conference on image processing and Capsule Networks, ICIPCN 2024

作者： Kalaimanivel, S. France, K. Hindustan institute of technology and science Department of Computer Applications Chennai22295014 India Hindustan institute of technology and science Department of Computer Applications Chennai India

ISBN: (纸本)9798350367171

Agriculture is often known as the art and science of nurturing soil. It involves preparing plants and animals for use in products. Agriculture is the process of growing crops and rearing animals for human consumption, fiber production, and other reasons. It is one of the oldest and most important human activities, laying the groundwork for food production and billions of people's lives throughout the world. As technology advances, additional capabilities for crop protection and disease prevention become accessible. Artificial Intelligence (AI) and machine Learning (ML) algorithms capture features such as crop and soil monitoring, crop maturity detection, autonomous weeding, intelligent crop spraying, pest and disease detection, and more. This study suggests a novel technique for automated crop disease identification by utilizing Convolutional Neural Networks (CNNs) from the field of computer vision. By performing a thorough testing and validation on separate test sets, the proposed methodology outperforms other existing methods in terms of accuracy. The Plant Village collection, maintained by the Centers for Disease Control and Prevention (CDC), includes damaged plant leaf photos and labels. The proposed method has achieved an accuracy of about 99.6%. © 2024 IEEE.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

An Optimized Pipeline for image-Based Localization in Museums from Egocentric images 22nd

An Optimized Pipeline for Image-Based Localization in Museum...

引用

22nd International Conference on image Analysis and processing (ICIAP)

作者： Messina, Nicola Falchi, Fabrizio Furnari, Antonino Gennaro, Claudio Farinella, Giovanni Maria CNR ISTI Via G Moruzzi 1 I-56017 Pisa Italy Univ Catania Viale A Doria 6 I-95125 Catania Italy

ISBN: (纸本)9783031431470;9783031431487

With the increasing interest in augmented and virtual reality, visual localization is acquiring a key role in many downstream applications requiring a real-time estimate of the user location only from visual streams. In this paper, we propose an optimized hierarchical localization pipeline by specifically tackling cultural heritage sites with specific applications in museums. Specifically, we propose to enhance the Structure from Motion (SfM) pipeline for constructing the sparse 3D point cloud by a-priori filtering blurred and near-duplicated images. We also study an improved inference pipeline that merges similarity-based localization with geometric pose estimation to effectively mitigate the effect of strong outliers. We show that the proposed optimized pipeline obtains the lowest localization error on the challenging Bellomo dataset [11]. Our proposed approach keeps both build and inference times bounded, in turn enabling the deployment of this pipeline in real-world scenarios.

关键词： Localization Camera Pose Estimation Structure From Motion Egocentric vision

来源：评论

学校读者我要写书评

暂无评论

Computer vision and machine Learning based approaches for Food Security: A Review

引用

MULTIMEDIA TOOLS AND applications 2021年第18期80卷 27973-27999页

作者： Sood, Shivani Singh, Harjeet Chitkara Univ Inst Engn & Technol Chandigarh Punjab India

With the rapidly increase of population every day, it has become a major issue to fulfill everyone's need for food products (i.e., vegetables, fruits, milk, wheat, etc.) due to limited production of food products. Moreover, healthy food utilization among people is the foremost requirement. The major factors that affect the food system includes increasing food shortage, decreasing quality, wastage, and loss of food products, limited natural resources, etc. This article addresses the various computer vision and machine learning based techniques, used to minimize the aforementioned issues. image processing has become an effective technique for the analysis of many research applications. This study intends to focus on analysis of image processing based applications in food products and agriculture field. Such applications help in decision making , disease prediction, classification, fruit sorting, soil quality measurement, etc. Moreover, a comprehensive review has been accomplished for various computer vision and statistical approaches used in food production and agricultural field and concludes that Deep Learning (DL) based approaches produce better results, specifically for image processing applications. Additionally, an effort has been made to provide a list of publicly available datasets for the related study.

关键词： Food Ssecurity Deep learning Convolutional neural network Smart farming

来源：评论

学校读者我要写书评

暂无评论

LoG-VMamba: Local-Global vision Mamba for Medical image Segmentation 17th

LoG-VMamba: Local-Global Vision Mamba for Medical Image Seg...

引用

17th Asian Conference on Computer vision, ACCV 2024

作者： Dang, Trung DQ. Nguyen, Huy Hoang Tiulpin, Aleksei University of Oulu Oulu Finland

ISBN: (纸本)9789819609000

Mamba, a State Space Model (SSM), has recently shown competitive performance to Convolutional Neural Networks (CNNs) and Transformers in Natural Language processing and general sequence modeling. Various attempts have been made to adapt Mamba to Computer vision tasks, including medical image segmentation (MIS). vision Mamba (VM)-based networks are particularly attractive due to their ability to achieve global receptive fields, similar to vision Transformers, while also maintaining linear complexity in the number of tokens. However, the existing VM models still struggle to maintain both spatially local and global dependencies of tokens in high dimensional arrays due to their sequential nature. Employing multiple and/or complicated scanning strategies is computationally costly, which hinders applications of SSMs to high-dimensional 2D and 3D images that are common in MIS problems. In this work, we propose Local-Global vision Mamba, LoG-VMamba, that explicitly enforces spatially adjacent tokens to remain nearby on the channel axis, and retains the global context in a compressed form. Our method allows the SSMs to access the local and global contexts even before reaching the last token while requiring only a simple scanning strategy. Our segmentation models are computationally efficient and substantially outperform both CNN and Transformers-based baselines on a diverse set of 2D and 3D MIS tasks. The implementation of LoG-VMamba is available at https://***/Oulu-IMEDS/LoG-VMamba. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Method for robot recognition and localization based on third-person perspective 4

Method for robot recognition and localization based on third...

引用

4th International Conference on machine Learning and Computer Application, ICMLCA 2023

作者： Luo, Zongheng Lu, Jun Chengdu University of Information Technology Chengdu China

ISBN: (数字)9781510680265

ISBN: (纸本)9781510680258

To achieve the recognition and positioning functions of indoor mobile robots under limited computing power conditions, a method based on color recognition for robot recognition and positioning is proposed. The global image of the robot working in the field is collected by a camera outside the field, and the position of the robot is obtained through computer vision processing. Then, the robot is controlled to move according to this information. Experiments conducted within a 2m × 1m area have shown that the maximum error during robot operation is 7.8cm/m, with an average error of 7.0cm/m. The maximum error of the steering angle is 16.6°, with an average error of 7.7°. © 2024 SPIE.

关键词： Robot applications

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：