检索结果-内蒙古大学图书馆

2024 International Conference on Advancements in Power, Communication and Intelligent Systems, APCI 2024

作者： Chittibomma, Sukith Sai Kishan Surapaneni, Ravi Maruboina, Afraim V.R.Siddhartha Engineering College Department of Cse Andhra Pradesh Vijayawada India V.R.Siddhartha Engineering College Department of Cse Andhra Pradesh Vijayawada India

ISBN: (纸本)9798350363289

Face recognition is an area of computer vision and image processing that is quickly expanding, with many uses in security, surveillance, and biometric identity. The proposed model is to develop a criminal identification system based on face recognition using OpenCv, Haar cascade classifier, LBPH, and AdaBoost algorithm. The problem that this project aims to address is the identification and tracking of criminals, which is a crucial duty for law enforcement organisations. The suggested technology is capable of real-time facial recognition and detection of criminals, which is achieved through the use of face recognition and facial detection methods based on machine learning. The system can register criminals and manage their data through a dataset, enabling the tracking and identification of criminals through CCTv footage or manually provided images. Compared to existing technologies, the proposed system is faster, more accurate, robust, reliable, and easy to use. The usage of machine learning-based methods for identifying and recognising faces enables a more accurate and efficient criminal identification system, which is critical in today's world. © 2024 IEEE.

关键词： Face recognition

来源：评论

学校读者我要写书评

暂无评论

Light-weight Fine-tuning Method for Defending Adversarial Noise in Pre-trained Medical vision-Language Models

Light-weight Fine-tuning Method for Defending Adversarial No...

引用

2024 Conference on Empirical Methods in Natural Language processing, EMNLP 2024

作者： Han, Xu Jin, Linghao Ma, Xuezhe Liu, Xiaofeng Yale University United States Information Sciences Institute University of Southern California United States

ISBN: (纸本)9798891761681

Fine-tuning pre-trained vision-Language Models (vLMs) has shown remarkable capabilities in medical image and textual depiction synergy. Nevertheless, many pre-training datasets are restricted by patient privacy concerns, potentially containing noise that can adversely affect downstream performance. Moreover, the growing reliance on multi-modal generation exacerbates this issue because of its susceptibility to adversarial attacks. To investigate how vLMs trained on adversarial noisy data perform on downstream medical tasks, we first craft noisy upstream datasets using multi-modal adversarial attacks. Through our comprehensive analysis, we unveil that moderate noise enhances model robustness and transferability, but increasing noise levels negatively impact downstream task performance. To mitigate this issue, we propose rectify adversarial noise (RAN) framework, a recipe designed to effectively defend adversarial attacks and rectify the influence of upstream noise during fine-tuning. © 2024 Association for Computational Linguistics.

关键词： Adversarial machine learning

来源：评论

学校读者我要写书评

暂无评论

STAIR: Learning Sparse Text and image Representation in Grounded Tokens

STAIR: Learning Sparse Text and Image Representation in Grou...

引用

Conference on Empirical Methods in Natural Language processing (EMNLP)

作者： Chen, Chen Zhang, Bowen Cao, Liangliang Shen, Jiguang Gunter, Tom Jose, Albin Madappally Toshev, Alexander Zheng, Yantao Shlenst, Jonathon Pang, Ruoming Yang, Yinfei Apple AI ML Beijing Peoples R China

ISBN: (纸本)9798891760608

image and text retrieval is one of the foundational tasks in the vision and language domain with multiple real-world applications. State-of-the-art contrastive approaches, e.g. CLIP (Radford et al., 2021), ALIGN (Jia et al., 2021), represent images and texts as dense embeddings and calculate the similarity in the dense embedding space as the matching score. On the other hand, sparse semantic features like bag-of-words models are inherently more interpretable, but believed to suffer from inferior accuracy than dense representations. In this work, we show that it is possible to build a sparse semantic representation that is as powerful as, or even better than, dense presentations. We extend the CLIP model and build a sparse text and image representation (STAIR), where the image and text are mapped to a sparse token space. Each token in the space is a (sub-)word in the vocabulary, which is not only interpretable but also easy to integrate with existing information retrieval systems. STAIR model significantly outperforms a CLIP model with +4.9% and +4.3% absolute Recall@1 improvement on COCO-5k text -> image and image -> text retrieval respectively. It also achieved better performance on both of imageNet zero-shot and linear probing compared to CLIP. (1)

关键词： Semantics

来源：评论

学校读者我要写书评

暂无评论

Evaluating the Impact of Lossy Compression on ADAS Deep Learning Models using Fisheye Cameras 26

Evaluating the Impact of Lossy Compression on ADAS Deep Lear...

引用

26th Irish machine vision and image processing Conference, IMvIP 2024

作者： Simha, Srinidhi Mukanahallipatna Molloy, Dara Fahy, Darren Valeo Vision Systems Ireland University of Galway Ireland

ISBN: (纸本)9781837242672

The increasing deployment of Advanced Driver Assistance Systems (ADAS) alongside the continual rise in camera sensor resolution has led to high bandwidth, and generally high cost, computation, and intra-vehicle communication. While the sensor bandwidth impacts the vehicle architecture, it also affects the data collection, storage, deep learning model training, and validation infrastructures. However, if the bandwidth was low, while still achieving the goal of high accuracy ADAS perception, the time and cost associated with creating and deploying the system would be greatly reduced. This study investigates the influence of lossy compression on multi-task deep learning models for real-time perception in ADAS employing fisheye cameras. We leverage a large-scale dataset and train a representative multi-task ADAS perception model for pedestrian, kerb, line, and soiling classification. The testing dataset is subjected to compression using the popular H.264 video codec at varying compression ratios. Through rigorous evaluation, we analyse the effects of compression on model performance, providing insights into the feasibility of employing lossy compression techniques in ADAS applications. Our results reveal that lossy compression could be deployed in automotive perception applications and that a compression ratio of up to 98% (720Mb/s to 12Mb/s), could be utilised with negligible performance degradation. © This is an open access article published by the IET under the Creative Commons Attribution License (http://***/licenses/by/3.0/)

关键词： Advanced driver assistance systems

来源：评论

学校读者我要写书评

暂无评论

Selection of Distance Measure for visual and Long Wave Infrared image Region Similarity using CNN Features 2

Selection of Distance Measure for Visual and Long Wave Infra...

引用

2nd International Conference on machine Learning and Data Engineering, ICMLDE 2023

作者： Kuppala, Kavitha Banda, Sandhya Imambi, S Sagar CSE Department KL University Guntur India CSE Department Maturi Venkata Subba Rao Engineering College Hyderabad India

Similarity computation between images or image regions is a necessary precursor for several vision-based applications, such as retrieval, registration, change detection etc. A two-channel convolutional neural network architecture is designed to retrieve an appropriate visual (vS) image representative from the repository, given as query a long-wave infrared (LWIR) image patch of the same region. Both the vS and LWIR image regions are described using pretrained convolution neural network models and images are ranked by computing the dis/similarity between the feature vectors. It is essential to evaluate and identify the suitable combination of feature descriptor and distance measure, when applied to LWIR and visual image similarity, as pre-trained CNNs such as vGG16, vGG19, MobileNet are not trained for LWIR images. RoadScene dataset which contains a pair of aligned long-wave infrared and visible images is used and performance of CNN features and distance measures is objectively evaluated using computation time, number of patches in top 5 and also in computing how close the retrieved LWIR patch is to the visual patch. Results demonstrate that Cosine similarity measure is relatively better when compared to all the other distance measures in addressing the spectral variations. © 2024 Elsevier B.v.. All rights reserved.

关键词： Convolution

来源：评论

学校读者我要写书评

暂无评论

Intensity and phase stacked analysis of a 40-OTDR system using deep transfer learning and recurrent neural networks

引用

APPLIED OPTICS 2023年第7期62卷 1753-1764页

作者： Kayan, Ceyhun Efe Aldogan, Kivilcim Yuksel Gumus, Abdurrahman Izmir Inst Technol Elect & Elect Engn Izmir Turkiye

Distributed acoustic sensors (DAS) are effective apparatuses that are widely used in many application areas for recording signals of various events with very high spatial resolution along optical fibers. To properly detect and recognize the recorded events, advanced signal processing algorithms with high computational demands are crucial. Convolutional neural networks (CNNs) are highly capable tools to extract spatial information and are suitable for event recognition applications in DAS. Long short-term memory (LSTM) is an effective instrument to process sequential data. In this study, a two-stage feature extraction methodology that combines the capabilities of these neural network architectures with transfer learning is proposed to classify vibrations applied to an optical fiber by a piezoelectric transducer. First, the differential amplitude and phase information is extracted from the phasesensitive optical time domain reflectometer (40-OTDR) recordings and stored in a spatiotemporal data matrix. Then, a state-of-the-art pre-trained CNN without dense layers is used as a feature extractor in the first stage. In the second stage, LSTMs are used to further analyze the features extracted by the CNN. Finally, a dense layer is used to classify the extracted features. To observe the effect of different CNN architectures, the proposed model is tested with five state-of-the-art pre-trained models (vGG-16, ResNet-50, DenseNet-121, MobileNet, and Inception-v3). The results show that using the vGG-16 architecture in the proposed framework manages to obtain a 100% classification accuracy in 50 trainings and got the best results on the 40-OTDR dataset. The results of this study indicate that pre-trained CNNs combined with LSTM are very suitable to analyze differential amplitude and phase information represented in a spatiotemporal data matrix, which is promising for event recognition operations in DAS applications. (c) 2023 Optica Publishing Group

关键词： Feature extraction Fiber optic cables machine vision Neural networks Signal processing Spatial resolution

来源：评论

学校读者我要写书评

暂无评论

Automatic Data processing for Space Robotics machine Learning 74

Automatic Data Processing for Space Robotics Machine Learnin...

引用

74th International Astronautical Congress, IAC 2023

作者： Sheppard, Anja Skinner, Katherine A. Department of Robotics University of Michigan 2505 Hayward St Ann ArborMI48109 United States

Autonomous terrain classification is an important problem in planetary navigation, whether the goal is to identify scientific sites of interest or to traverse treacherous areas safely. Past Martian rovers have relied on human operators to manually identify a navigable path from transmitted imagery. Our goals on Mars in the next few decades will eventually require rovers that can autonomously move farther, faster, and through more dangerous landscapes-demonstrating a need for improved terrain classification for traversability. Autonomous navigation through extreme environments will enable the search for water on the Moon and Mars as well as preparations for human habitats. Advancements in machine learning techniques have demonstrated potential to improve terrain classification capabilities for ground vehicles on Earth. However, classification results for space applications are limited by the availability of training data suitable for supervised learning methods. This paper contributes an open source automatic data processing pipeline that uses camera geometry to co-locate Curiosity and Perseverance Mastcam image products with Mars overhead maps via ray projection over a terrain model. In future work, this automated data processing pipeline will be leveraged for development of machine learning methods for terrain classification. Copyright © 2023 by the International Astronautical Federation (IAF). All rights reserved.

关键词： computer vision geographic information systems open source robotics space

来源：评论

学校读者我要写书评

暂无评论

Understanding DeepFool Adversarial Attack and Defense with Skater Interpretations

Understanding DeepFool Adversarial Attack and Defense with S...

引用

2023 International Conference on Wireless Communications, Signal processing and Networking, WiSPNET 2023

作者： Ramesh, Dhivyashri Sriram, Ishwarya Sridhar, Kavya Dunston, Snofy D Mary Anita Rajam, v. Ceg Campus Anna University Dept. of Cse Chennai India Centre for Cybersecurity College of Engineering Guindy Anna University Dept. of Computer Science and Engineering Chennai India

ISBN: (纸本)9798350300451

With the incorporation of artificial intelligence in businesses, particularly features like computer vision, it has become increasingly important to ensure the robustness of the models being used. A popular technique used to exploit machine learning models is an adversarial attack. Adversarial attacks mis-lead a predictive model by providing it with perturbed input. In the context of computer vision, it involves creating perturbations in an image to deceive a model. One such adversarial attack is the DeepFool attack, which aims to create the most minimal perturbations to an image to deceive the model. These attacks can also affect the way in which interpretations are made. In this paper, we analyze the DeepFool attack and its countermeasures on the ResNet-50 model running on the NIH malarial dataset. To assess the efficiency of the attack and subsequent adversarial training, we have used accuracy and loss. The nature and impact of the attack and adversarial training are analysed using skater, a model interpretation framework. The variations in the interpretations when adversarial attacks are in place are also analysed. © 2023 IEEE.

关键词： Computer vision

来源：评论

学校读者我要写书评

暂无评论

Towards Intelligent Auditing: Exploring the Future of Artificial Intelligence in Auditing 11th

Towards Intelligent Auditing: Exploring the Future of Artifi...

引用

11th International Conference on applications and Techniques in Cyber Intelligence, ATCI 2023

作者： Huang, Ling Liu, Dongbing School of Management Wuhan Technology and Business University Hubei Wuhan China College of Mathematics and Computer Panzhihua University Sichuan Panzhihua China

Recent years have witnessed an increasingly broad application of artificial intelligence (AI) technologies such as speech recognition, computer vision, natural language processing, machine learning, algorithmic framework, cognitive computing, deep learning and neural networks in the field of auditing, producing a far-reaching impact on traditional audit work. However, the application of AI technologies in auditing practices is still in its infancy stage and further exploration and development are needed. Based on an in-depth investigation of AI-powered auditing practices, this paper proposes four innovative paths towards intelligent auditing in response to the key problems and challenges in practices, namely, audit procedure design, audit data processing, audit approach transformation and audit model exploration, with a view of achieving full coverage of intelligent auditing and making it standardized, normalized, popularized and practically effective. These innovations will effectively advance the improvement in auditing competencies and promote the high-quality development of audit work. © 2024 The Authors. Published by Elsevier B.v.

关键词： Audit Approach Transformation Audit Data processing Audit Model Exploration Audit Procedure Design

来源：评论

学校读者我要写书评

暂无评论

Review of Surface-Defect Detection Methods for Industrial Products Based on machine vision

引用

IEEE ACCESS 2025年 13卷 90668-90697页

作者： Wang, Quan Wang, Mengnan Sun, Jiadong Chen, Deji Shi, Pei Wuxi Univ Sch IoT Engn Wuxi 214105 Peoples R China Nanjing Univ Informat Sci & Technol Sch Comp Sci Nanjing 210044 Peoples R China

Industrial defect detection is crucial for ensuring product quality and production efficiency, playing a pivotal role in advancing smart manufacturing. This paper reviews defect detection technologies for various industrial products, including metals, textiles, and printed circuit boards, and introduces an innovative classification system. It also offers a detailed analysis of recent developments and practical applications of large models in industry defect detection. First, the basic principles of industrial defect detection are outlined. The detection methods are then categorized into three main groups: traditional image processing, machine learning, and deep learning, with their principles, case studies, limitations, and future development directions analyzed. Traditional methods consist of image preprocessing, segmentation, and feature extraction. machine learning methods are divided into point-distance-based, hyperplane-based, tree-based, and neural network-based classification algorithms. Deep learning models are classified into two types: accuracy-oriented and efficiency-oriented. The paper organizes industrial defect datasets by type (multi-product and single-product), evaluates data quality and availability, and summarizes common evaluation metrics for accuracy, efficiency by task requirements. It also compares the latest methods on two public datasets to guide further research in defect detection. Real-world examples illustrate the end-to-end process, from data processing and hardware configuration to model training and deployment, while exploring the value and limitations of these technologies from the perspective of industry stakeholders. Finally, a systematic analysis of the key challenges and corresponding solutions is presented at the data and performance levels, and looks forward to the future direction of technological development, highlighting innovative paths and application potentials.

关键词： Defect detection image processing Deep learning Production Reviews machine vision Inspection Surface treatment Classification algorithms Hardware Industrial defect detection machine vision machine learning deep learning

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：