检索结果-内蒙古大学图书馆

Conference on Optical Metrology and Inspection for Industrial applications X

作者： voronin, v. Gapon, N. Zhdanova, M. Semenishchev, E. Zelensky, A. Moscow State Univ Technol STANKIN Ctr Cognit Technol & Machine Vis Moscow Russia Don State Tech Univ Rostov Na Donu Russia Sci Mfg Complex Technol Ctr Zelenograd Russia

ISBN: (纸本)9781510667877;9781510667884

We present a new haze removal algorithm based on attention map-guided multi-scale image processing. The proposed method is based on the frequency-domain coefficient correction of a set of images followed by their fusion based on the Laplacian pyramid. A new stage is presented in obtaining a local-global estimate of high-contrast images, also used in the attention map-guided fusion model. The algorithm consists of the following steps: gamma correction with different gamma parameters;the weight map calculation by multiplying the saturation, contrast, and attention for each image;decomposition of the weight map into a Gaussian pyramid;3-D block-rooting enhancement;decomposition of images after 3-D block-rooting and gamma correction into the Laplacian pyramid;merging by multiplying multi-scale images and weights. The experiment results on the dataset D-HAZE confirmed the high efficiency of the proposed enhancement method compared to the state-of-the-art techniques for industrial inspection systems.

关键词： image enhancement image haze removal multi-scale processing frequency-domain transform industrial inspection attention map-guided

来源：评论

学校读者我要写书评

暂无评论

Low-Cost Hardware-Accelerated vision-Based Depth Perception for Real-Time applications

Low-Cost Hardware-Accelerated Vision-Based Depth Perception ...

引用

International Conference on Computer vision and machine Intelligence, CvMI 2022

作者： Aditya, N.G. Dhruval, P.B. Shylaja, S.S. Katharguppe, Srinivas Department of Computer Science PES University Karnataka Bengaluru560085 India

ISBN: (纸本)9789811978661

Depth estimation and 3D object detection are critical for autonomous systems to gain context of their surroundings. In recent times, compute capacity has improved tremendously, enabling computer vision and AI on the edge. In this paper, we harness the power of CUDA and OpenMP to accelerate ELAS (a stereoscopic vision-based disparity calculation algorithm) and 3D projection of the estimated depth while performing object detection and tracking. We also examine the utility of Bayesian inference in achieving real-time object tracking. Finally, we build a drive-by-wire car equipped with a stereo camera setup to test our system in the real world. The entire system has been made public and easily accessible through a Python module. © 2023, The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

关键词： Stereo image processing

来源：评论

学校读者我要写书评

暂无评论

16th International Workshop on Design and Architectures for Signal and image processing, DASIP 2023

16th International Workshop on Design and Architectures for ...

引用

16th International Workshop on Design and Architectures for Signal and image processing, DASIP 2023

ISBN: (纸本)9783031299698

The proceedings contain 9 papers. The special focus in this conference is on Design and Architectures for Signal and image processing. The topics include: Brain Blood vessel Segmentation in Hyperspectral images Through Linear Operators;Neural Network Predictor for Fast Channel Change on DvB Set-Top-Boxes;AINoC: New Interconnect for Future Deep Neural Network Accelerators;Real-Time FPGA Implementation of the Semi-global Matching Stereo vision Algorithm for a 4K/UHD video Stream;TaPaFuzz - An FPGA-Accelerated Framework for RISC-v IoT Graybox Fuzzing;Adaptive Inference for FPGA-Based 5G Automatic Modulation Classification;High-Level Online Power Monitoring of FPGA IP Based on machine Learning.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Advanced Encryption Standard (AES) Based Robust Watermarking Scheme 2

Advanced Encryption Standard (AES) Based Robust Watermarking...

引用

2nd IEEE International Conference on Computer vision and machine Intelligence, CvMI 2023

作者： Rai, Deepak Singh Rajput, Shayam Arya, K.v. Pranav, Prince Singh, Shikha Sinha, Aditi School of Cse and Technology Bennett University Greater Noida India National Institute of Technology Patna Department of Cse Patna India ABV-IIITM Gwalior Department of Cse Gwalior India

ISBN: (纸本)9798350305142

Digital watermarking is a widely used technique for embedding information into digital media to protect intellectual property rights. However, digital watermarks are vulnerable to various types of malicious attacks. In this paper, we propose an AES-based watermarking scheme to further improve the security and robustness of an existing watermark technique. The proposed scheme involves embedding the original watermark into the host image using a watermarking technique, and then encrypting the embedded watermark using the AES algorithm. The watermarked image is then subjected to a series of attacks and then is decrypted using the same AES algorithm. To assess the performance of the proposed approach in terms of security and robustness, it is implemented and tested on a set of images. The experimental findings demonstrate that, while maintaining a low distortion rate, the proposed technique offers high robustness against various attacks. The proposed AES-based watermarking scheme can be considered as an effective and secure solution for protecting digital watermarks in various applications. © 2023 IEEE.

关键词： Cryptography

来源：评论

学校读者我要写书评

暂无评论

Barriers to Integrated Flotation Control Solutions: Lessons from an Industrial Implementation 22

Barriers to Integrated Flotation Control Solutions: Lessons ...

引用

22nd World Congress of the International Federation of Automatic Control (IFAC)

作者： Oosthuizen, Daniel J. Williams, Bradley A. van der Spuy, Daniel D. v. Proc IQ Perth Australia

ISBN: (纸本)9781713872344

Different aspects of froth flotation have received varying levels of interest from the automation community over the past 30 years. Model-based level stabilisation and masspull based grade control strategies continue to deliver significant benefit to industry. However, industry seems slow to adopt the use of image processing applications and more comprehensive flotation models in industrial Advanced Process Control (APC) applications - despite the benefits reported in the literature. In this paper an industrial flotation control system that includes basic visual froth imaging functionality is presented as a case study, to highlight some of the challenges experienced, and to identify reasons why integrated industrial APC implementations including advanced models and machine learning components remain scarce. Copyright (c) 2023 The Authors.

关键词： System identification and modelling Advanced process control machine learning methods and applications

来源：评论

学校读者我要写书评

暂无评论

Transfer Learning Models for CNN Fusion With Fisher vector for Codebook Optimization of Foreground Features

引用

IEEE ACCESS 2024年 12卷 5648-5658页

作者： Kamaleldin, Mohamed Gamal M. Abu-Bakar, Syed A. R. Sheikh, Usman Ullah Arab Acad Sci Technol & Maritime Transport Elect & Commun Engn Dept Cairo 11799 Egypt Univ Teknol Malaysia Sch Elect Engn Comp Vis Video & Image Proc Res Lab Skudai 81300 Johor Malaysia Univ Teknol Malaysia Fac Elect Engn Johor Baharu Malaysia

Human action recognition has become one of the main topics in the computer vision field due to its high demand and competitiveness in real-world applications. The main goals of human action recognition are to improve classification accuracy and reduce computational complexity. Previous studies have mainly used two approaches: the hand-crafted feature extraction approach and the deep learning approach. The hand-crafted approach is simple, which confers it with an added advantage in terms of computational complexity. However, this method is low in accuracy. Conversely, the deep learning approach achieves high accuracy even for complex datasets, but it suffers in terms of computational complexity and long training time as it needs to process huge datasets during training. Other approaches include the use of pre-trained deep learning networks to fuse both methods. In this paper, we will introduce a combination of pre-trained convolutional neural networks (CNN) to extract features, an improved Fisher vector (iFv) codebook, and an optimized support vector machine SvM to achieve improved human action recognition. We leveraged three pre-trained CNNs, namely, Inception-ResNet-v2, NASNet-Large, and Xception, to extract the features. Then, we applied the improved Fisher vector codebook to encode them. We subsequently trained the codebook using SvM for classification and re- adjusted the SvM weights using five different optimization techniques, which are SGD, Adadelta, ADAM, Adamax, and Nadam. To evaluate the performance, we utilized UCF101 and HMDB51 datasets. The results demonstrate that the accuracy and computational complexity of our approach are comparable to state-of-the-art techniques.

关键词： Human action recognition pre-trained convolutional neural networks long short-term memory (LSTM) features encoding optimization

来源：评论

学校读者我要写书评

暂无评论

Enhanced Magnetic Resonance Imaging for Accurate Classification of Benign and Malignant Brain Cells 5

Enhanced Magnetic Resonance Imaging for Accurate Classificat...

引用

5th IEEE International Conference for Emerging Technology, INCET 2024

作者： Gowda, Dankan v. Kumar, Pullela S.v.v.S.R. Prasad, K.D.v. Ashreetha, B. Kumar, Mekala Bharath Karthikeya, Karanam Department of Electronics and Communication Engineering BMS Institute of Technology and Management Karnataka Bangalore India Department of Computer Science & Engineering Aditya College of Engineering Andhra Pradesh Surampalem India Symbiosis Institute of Business Management Hyderabad India Pune India Department of Electronics and Communication Engineering School of Engineering Mohan Babu University Andhra Pradesh Tirupati India Department of Electronics and Communication Engineering Sree Vidyanikethan Engineering College Andhra Pradesh Tirupati India

ISBN: (纸本)9798350361155

The difficulty in differentiating between the normal and cancerous cells in the brain through the frequent magnetic resonance imaging approaches is one of the major obstacles to realization of diagnostic precision. The presented work reveals a new MRI image processing technology, which includes an original software that contains complex algorithms and trained machine learning models, as the programs that make the images much better than before. Carefully calibrated computer vision dataset which features brain scans is subjected to the well-defined novel approach, whose performance is compared with the traditional MRI methods by calculating multiple metrics such as classification accuracy, sensitivity, specificity, ROC are and so on. This paper is the climax of a deep cognition of various cell types revealed by the newest MRI method which presents a better contrast than the old techniques. Next, the quality in the detection and recognition of the tumour after, comparison of these modalities displays that the modality of higher resolution, the ability to detect the tumour earlier and better. Such technological improvements in MRI machines will enable the surgeons to identify the growth of tumors at the early stages that will lay the right groundwork for the design of personalized treatment plans and also will have positive impact on the lives of the patients. Through this effect, the level of quality followed by MR imaging has been improved as well as arising new alliances between major imaging companies and machine learning technologies. It can be thought as the border - eraser diagnostics and imaging which will obey medical laws. Therefore, it states that the present hypothetical world should be improved while the advanced and proven diagnostics systems should be developed. The daily clinical applications of these advanced MRIs may well be the beginning of a new era in diagnostic oncology, which will be a very important way forward in improving treatment combined wi

关键词： Magnetic resonance imaging

来源：评论

学校读者我要写书评

暂无评论

Beyond clean data: Exploring the effects of label noise on object detection performance

引用

KNOWLEDGE-BASED SYSTEMS 2024年 304卷

作者： Freire, Agostinho Silva, Leandro H. de S. de Andrade, Joao v. R. Azevedo, George O. A. Fernandes, Bruno J. T. Univ Pernambuco Escola Politecn Pernambuco R Benf 455 BR-50720001 Recife Pe Brazil Fed Inst Paraiba Unidade Academ Area Ind R Jose Leoncio Silva 300 Lot Jardim Oasis BR-58900000 Cajazeiras Paraiba Brazil

In recent years, the growth of large-scale datasets has significantly propelled the progress of deep learning applications. Yet, annotating these datasets remains a labor-intensive endeavor, pushing the reliance on costeffective but less specialized data collection methods and internet data sources. This often results in noisy and inaccurate labels, compromising data quality. Traditional machine learning models assume clean data, but real-world datasets often exhibit significant label noise. This paper examines the impact of such noise on object detection performance, a pivotal aspect of computer vision. We analyze the influence of noisy labels using three renowned object detection frameworks: YOLOv5, Faster R-CNN, and the recent YOLOv8, alongside established datasets: MS COCO, vOC, and ExDARK. Additionally, experiments with the UvM dataset explore domain-specific tasks in dense object scenarios. Two new metrics - Model Health and Detection Capability - were introduced to evaluate the results. Findings indicate that models maintain over 80% of their health (a 20% decline in mAP from the baseline) with up to 40% label corruption. However, Detection Capability deteriorates more sharply under the same conditions. The research also employs the D-RISE method for model explainability, highlighting crucial image regions affecting detection outcomes. Despite the noise, critical detection areas in models remain similar to those in clean data up to the 40% corruption level, as verified by similarity metrics. This study underscores the resilience of object detection models to label noise and provides insights into maintaining performance amidst data quality challenges.

关键词： Object detection Label noise Deep learning Data corruption Explainability

来源：评论

学校读者我要写书评

暂无评论

Granular Privacy Control for Geolocation with vision Language Models

Granular Privacy Control for Geolocation with Vision Languag...

引用

2024 Conference on Empirical Methods in Natural Language processing, EMNLP 2024

作者： Mendes, Ethan Chen, Yang Hays, James Das, Sauvik Xu, Wei Ritter, Alan Georgia Institute of Technology United States Carnegie Mellon University United States

ISBN: (纸本)9798891761643

vision Language Models (vLMs) are rapidly advancing in their capability to answer information-seeking questions. As these models are widely deployed in consumer applications, they could lead to new privacy risks due to emergent abilities to identify people in photos, geolocate images, etc. As we demonstrate, somewhat surprisingly, current open-source and proprietary vLMs are very capable image geolocators, making widespread geolocation with vLMs an immediate privacy risk, rather than merely a theoretical future concern. As a first step to address this challenge, we develop a new benchmark, GPTGEOCHAT, to test the capability of vLMs to moderate geolocation dialogues with users. We collect a set of 1,000 image geolocation conversations between in-house annotators and GPT-4v, which are annotated with the granularity of location information revealed at each turn. Using this new dataset we evaluate the ability of various vLMs to moderate GPT-4v geolocation conversations by determining when too much location information has been revealed. We find that custom fine-tuned models perform on par with prompted API-based models when identifying leaked location information at the country or city level, however fine-tuning on supervised data appears to be needed to accurately moderate finer granularites, such as the name of a restaurant or building. © 2024 Association for Computational Linguistics.

关键词： visual languages

来源：评论

学校读者我要写书评

暂无评论

Automatic target detection utilizing an Edge IR vision transformer (EIR-viT) 33

Automatic target detection utilizing an Edge IR vision trans...

引用

Conference on Automatic Target Recognition XXXIII

作者： Adams, Ethan R. Depoian, Arthur C., II Kurz, Aidan G. Bailey, Colleen P. Guturu, Parthasarathy Univ North Texas Dept Elect Engn Denton TX 76207 USA

ISBN: (纸本)9781510661561;9781510661578

The detection and recognition of targets within imagery and video analysis is vital for military and commercial applications. The development of infrared sensor devices for tactical aviation systems imagery has increased the performance of target detection. Due to the advancements of infrared sensors capabilities, their use for field operations such as visual operations (visops) or reconnaissance missions that take place in a variety of operational environments have become paramount. Many techniques implemented stretch back to 1970, but were limited due to computational power. The AI industry has recently been able to bridge the gap between traditional signal processing tools and machine learning. Current state of the art target detection and recognition algorithms are too bloated to be applied for on ground or aerial mission reconnaissance. Therefore, this paper proposes Edge IR vision Transformer (EIR-viT), a novel algorithm for automatic target detection utilizing infrared images that is lightweight and operates on the edge for easier deployability.

关键词： Automatic Target Detection vision Transformer Infrared Imaging Object Detection FLIR image FLIR dataset Edge Computing

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：