检索结果-内蒙古大学图书馆

2nd International Conference on Signal processing, Communication, Power and Embedded systems, SCOPES 2024

作者： Kumar, Ashis Sahoo, Ashis Kumar Bhaul, Nishant Kumar Nayak, Sidhant Rath, Adyasha Panda, Ganapati C. V. Raman Global University Bhubaneswar India

ISBN: (纸本)9798331506452

Automating the diagnosing process has shown promising results in recent years due to the advancements in deep learning approaches. In this work, we provide a unique skin disease diagnosis system that uses Convolutional Neural Networks (CNN) and Generative Adversarial Networks (GAN) to provide effective diagnosis. In our work, the system is trained on the HAM10000 dataset, which contains a diverse collection of skin lesion images categorized into seven classes, including melanoma, basal cell carcinoma, and others. GANs are used to address the challenges of dataset imbalance and missing data by generating realistic synthetic images that augment the existing dataset and CNNs are used for image pre-processing, image classification, etc. Utilizing these techniques, our proposed system achieves an impressive classification accuracy of 90.7%. In conclusion, using deep learning methods helps our research improve the skin disease diagnosis system and also gives medical professionals a useful tool for the accurate identification of skin diseases. © 2024 IEEE.

关键词： Convolution

来源：评论

学校读者我要写书评

暂无评论

Deep Learning-Driven Black Box Approach For image Segmentation

Deep Learning-Driven Black Box Approach For Image Segmentati...

引用

Advances in Computing, Communication and Networking (ICAC2N), International Conference on

作者： Nuthalapati Sudha Sai Bhargav Kasetty K. Sai Madhuri Jajala Nikitha Issac Neha Margret K. Rajakumar School of Engineering Mahindra University Hyderabad Telangana India School of Computer Science and Engineering V.I.T – University Vellore Tamilnadu India Dept. of Computer Science and Data Science BVRIT Narsapur Hyderabad Telangana India Dept. of CSE – AIML & IoT VNR VIET Hyderabad Telangana India

ISBN: (数字)9798350356816

ISBN: (纸本)9798350356823

The diagnosis of a range of eye disorders needs to categorize the retinal vessels. Computerized implementation of this process is becoming increasingly essential for automated screening systems for retinal diseases. To achieve a more accurate extraction of the retinal vessels, a new pre-processing step is proposed. These proposed pre-processes are also compared to other algorithms to assess their impact. The proposed pre-processing process consists of two phases. The first phase is the implementation and validation of the pre-processing modules, and the second phase is the implementation of these pre-processes onto the retinal vessels that were to be extracted. To achieve a significantly improved segmented vessel image, the proposed pre-process phase employs a common image-processing technique. In recent years, there has been a great deal of focus on retinal vessel identification studies, and the importance of assessing and confirming the findings of retinal vessel segmentation.

关键词： image segmentation Analytical models Accuracy Computational modeling Retinal vessels Diseases

来源：评论

学校读者我要写书评

暂无评论

video Smoke Detection Algorithm Based on a Spatial-Temporal Neural Network Model

Video Smoke Detection Algorithm Based on a Spatial-Temporal ...

引用

6th International Conference on Intelligent Computing and Signal processing (ICSP)

作者： Zhen Cao Xi Zhang Ministry of Emergency Management Shenyang Fire Science and Technology Research Institute Shengyang China

ISBN: (数字)9798350376548

ISBN: (纸本)9798350376555

a kind of spatial-temporal neural network video smoke detection algorithm is proposed in order to solve the problems associated with the incorrect classification of the static approximate smoke background in the face of the detection of smoke in video detection networks, and the problem of false alarms and of the original test model algorithms being different in different detection environments. Based on the original YOLO v4 neural network algorithm, this paper introduces a k-means + + algorithm and genetic algorithm, while using the algorithm's clustering function to classify the sample points of the real boxes of the image data set, which make it a more suitable anchor. At the same time, the genetic algorithm is used to adjust its anchor in order to allow the generated anchor to adapt to the needs related to smoke detection. In the original neural network model, the dual-stream network model algorithm is used to extract information from the first step of the YOLO algorithm in order to further filter the smoke's characteristics as well as filter out error information, all to improve the detection capabilities of the overall neural network for video smoke fog images. Compared with traditional YOLOv4 networks, the algorithm obtained by the model algorithm has been improved by 8.51°/0. In actual tests, the alarm time requirements of the smoke alarm test program for early fire monitoring and the alarm systems for visual images were improved, and the detection accuracy of the network was also improved based on the assurance of the detection speed, while the performance of the model algorithm was also improved for different scenes.

关键词： YOLO image recognition Neural networks Clustering algorithms Lighting Filtering algorithms Approximation algorithms Information filters Classification algorithms Genetic algorithms

来源：评论

学校读者我要写书评

暂无评论

From Linear to Nonlinear Unfolded Condat-vú Algorithm for Spectro-Polarimetric Hight-Constrast image Recovery

From Linear to Nonlinear Unfolded Condat-Vú Algorithm for S...

引用

European Signal processing Conference (EUSIPCO)

作者： E. Chappon N. Pustelnik J. Tachella L. Denneulin A. Ferrari M. Langlois Laboratoire de Physique ENSL CNRS UMR 5672 Lyon France Laboratoire de Recherche de l'EPITA EPITA Le Kremlin-Bicêtre France Universite Côte d'Azur Observatoire de la Côte d'Azur CNRS Lab. J.-L. Lagrange France Univ Lyon Univ. Lyon 1 ENS de Lyon CNRS Centre de Recherche Astrophysique de Lyon UMR5574 Saint-Genis-Laval France

ISBN: (数字)9789464593617

ISBN: (纸本)9798331519773

Studying circumstellar environments is crucial for understanding exoplanets and stellar systems. Instruments like SPHERE can extract information about these environments by leveraging advanced image reconstruction methods, possibly based on deep learning. This work focuses on unfolded proximal neural networks based on Condat- vii iterations and proposes a new nonlinear formulation. To evaluate and compare the performance of the proposed reconstruction strategies, two datasets dedicated to circumstellar environments analysis in the context of high-contrast imagery have been created offering different level of complexity in the evaluation of the performance.

关键词： Deep learning Training image color analysis Neural networks Signal processing algorithms Signal processing image reconstruction Standards Synthetic data Finite difference methods

来源：评论

学校读者我要写书评

暂无评论

Optimized double transformer residual super-resolution network-based X-ray images for classification of pneumonia identification

引用

KNOWLEDGE-BASED systems 2025年 311卷

作者： Prasath, G. Jerald Prabu, S. Mayil, v. valli Saini, Sumit T John Inst Technol Dept Comp Sci & Engn Bangalore India SRM Inst Sci & Technol Sch Comp Tiruchirapalli Campus Tiruchirapalli Tamil Nadu India Koneru Lakshmaiah Educ Fdn Dept Comp Sci & Engn Guntur 522302 Andhra Pradesh India Cent Univ Haryana Dept Elect Engn Mahendergarh India

Pneumonia is an infectious disease characterized by inflammation of the lungs' air sacs, which results in the accumulation of fluid or pus. Medical images is important for the timely identification and precise diagnosis of illnesses;chest X-rays are a commonly utilized modality for respiratory disorders including pneumonia. In this research, optimized double transformer residual super-resolution network-related chest x-ray imageries for the classification of pneumonia identification (DTRSN-XRI-CPI). The procedure involves pre-processing the input image using region-aware neural graph collaborative filtering (RNGCF) to reduce noise, enhance contrast, and eliminate high and low frequencies from the collected dataset. Next, the Synchro-squeezed fractional wavelet transform (SFWT) is utilized for the feature extraction to extract color features such as color, shape, spatial, texture, and relation from the image. Hence, the weight parameters for DTRSN are optimized using the Hunter Prey Optimization algorithms (HPOA). Then the DTRSN-XRI-CPI is implemented in Python and the performance metrics like precision, accuracy, recall, specificity, F1-score, and ROC are analysed. The performance of the DTRSN-XRI-CPI approach attains 20.7 %, 22.6 % and 30.5 % higher accuracy;21.8 %, 29.3 % and 30.5 %higher precision and 21.8 %, 29.5 % and 32.6 % higher recall when analysed through existing an intelligent computational framework based on deep learning for the identification and classification of pneumonia illness (ICPDDL-ICF), an adaptive and altruistic deep feature selection approach based on PSO for pneumonia detection from chest X-rays (APSO-DFSM-PDCX) and a deep learning system that uses explainable AI (DLAIB-PI-EAI) techniques respectively.

关键词： Pneumonia Hunter-prey optimization algorithms Region-aware neural graph collaborative filtering Double transformer residual super-resolution network and synchrosqueezed fractional wavelet transform

来源：评论

学校读者我要写书评

暂无评论

Why are visually-grounded language models bad at image classification? 24

Why are visually-grounded language models bad at image class...

引用

Proceedings of the 38th International Conference on Neural Information processing systems

作者： Yuhui Zhang Alyssa Unell Xiaohan Wang Dhruba Ghosh Yuchang Su Ludwig Schmidt Serena Yeung-Levy Stanford University University of Washington Tsinghua University Stanford University and University of Washington

ISBN: (纸本)9798331314385

image classification is one of the most fundamental capabilities of machine vision intelligence. In this work, we revisit the image classification task using visually-grounded language models (vLMs) such as GPT-4v and LLavA. We find that existing proprietary and public vLMs, despite often using CLIP as a vision encoder and having many more parameters, significantly underperform CLIP on standard image classification benchmarks like imageNet. To understand the reason, we explore several hypotheses concerning the inference algorithms, training objectives, and data processing in vLMs. Our analysis reveals that the primary cause is data-related: critical information for image classification is encoded in the vLM's latent space but can only be effectively decoded with enough training data. Specifically, there is a strong correlation between the frequency of class exposure during vLM training and instruction-tuning and the vLM's performance in those classes; when trained with sufficient data, vLMs can match the accuracy of state-of-the-art classification models. Based on these findings, we enhance a vLM by integrating classification-focused datasets into its training, and demonstrate that the enhanced classification performance of the vLM transfers to its general capabilities, resulting in an improvement of 11.8% on the newly collected imageWikiQA dataset. Project page: https://***/vLMClassifier-Website/.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Assessing Machine Learning algorithms for Real-Time Fake Currency Detection

Assessing Machine Learning Algorithms for Real-Time Fake Cur...

引用

Sustainable Communication Networks and Application (ICSCNA), International Conference on

作者： Ajanthaa Lakkshmanan Revanth Sai Grandhi v. Girish Department of Computing Technologies School of Computing SRM Institute of Science and Technology Chennai India

ISBN: (数字)9798331530013

ISBN: (纸本)9798331530020

Counterfeiting poses a significant threat as the circulation of fake currency diminishes the value of genuine notes, thereby disrupting the country's economic stability. Addressing this issue is critical before the proliferation of counterfeit notes becomes unmanageable. Manual detection methods are often unreliable since counterfeit notes are produced with materials and inks that closely resemble the original. Across the country, counterfeit detection is typically carried out using hardware-based systems. However, these methods are time-consuming and struggle to process large volumes efficiently. To address these challenges and streamline the process of counterfeit currency detection, this study proposes an image processing-based computational technique. The objective is to accurately determine whether a given note is genuine or counterfeit, with a high prediction rate. This detection is enhanced using deep learning algorithms, which analyze key attributes such as color, form, paper thickness, serial numbers, and image filters on the currency. The proposed model is trained on a real-time dataset consisting of both genuine and counterfeit notes. Experimental results demonstrate that this method achieves an overall accuracy of 96.41%.

关键词： Deep learning Machine learning algorithms Manuals Computer architecture Streaming media Feature extraction Real-time systems Convolutional neural networks Counterfeiting Currencies

来源：评论

学校读者我要写书评

暂无评论

A wandb based corrosion detection of metals through live video analytics

A wandb based corrosion detection of metals through live vid...

引用

IoT, Communication and Automation Technology (ICICAT), International Conference on

作者： John Deva Prasanna D S Kuruvella Sai Neeraj G. Jaya Krishna Department of DSBS Faculty of Engineering and Technology SRM Institute of Science and Technology Department of DSBS SRM Institute of Science and Technology

ISBN: (数字)9798350368109

ISBN: (纸本)9798350368116

Detection of corrosion in moving objects like ships is challenging due to the dynamic nature of the input image. Existing machine learning techniques are suitable for static images and the algorithms suffer in performance when is a live video. In this paper, image processing for detecting corrosion using YOLOv8 which more suitable for processing live videos as speed and accuracy is better. This makes YOLO v8 for corrosion detection in live videos. In addition, Weights and Biases (W&B). is used in the algorithm as it is pivotal in establishing the connections between neurons and biases helps in circumventing flexible inputs. By combining YOLOv8's and W&B approach the accuracy and efficiency of corrosion detection systems is improved. This can ultimately assist in better maintenance and preservation of essential infrastructure resources.

关键词： Industries YOLO Technological innovation Accuracy Corrosion visual analytics Real-time systems Safety Maintenance Sustainable development

来源：评论

学校读者我要写书评

暂无评论

Feature Map Guided Adapter Network for Object Detection in Low-light Conditions

Feature Map Guided Adapter Network for Object Detection in L...

引用

IEEE International Symposium on Circuits and systems (ISCAS)

作者： Cong Pang Wei Zhou Haoyan Li Xiangyu Zhang Xin Lou School of Information Science and Technology ShanghaiTech University Key Laboratory of Intelligent Perception and Human-Machine Collaboration Ministry of Education Shanghai China

ISBN: (数字)9798350330991

ISBN: (纸本)9798350331004

Conventional ISP pipelines and image enhancement methods are designed and optimized for human vision, creating a gap between the requirements of computer and human visions. To bridge the requirement gap, we present a co-design framework in which backend computer vision plays a pivotal role in shaping the proceeding image processing algorithm. It features a pre- processing adapter network, responsible for the restoration and enhancement of RAW images from computer vision perspective, especially in challenging environmental conditions. Specifically, we extract feature maps from the backend vision network, utilizing them as constraints for optimizing the preprocessing adapter network. To validate the effectiveness of our proposed framework, we employ object detection in low-light conditions as the computer vision task, with YOLO-v5 as the backbone. Given the considerable noise in low-light images, we compare our results with state-of-the-art denoising algorithms, showcasing the superior performance of our framework.

关键词： Computer vision Circuits and systems Noise reduction Noise Pipelines Object detection Feature extraction

来源：评论

学校读者我要写书评

暂无评论

Breast Cancer Detection Using Texture Features and KNN Algorithm 20th

Breast Cancer Detection Using Texture Features and KNN Algor...

引用

20th International Conference on Hybrid Intelligent systems, HIS 2020 and 12th World Congress on Nature and Biologically Inspired Computing, NaBIC 2020

作者： Murugan, Tevar Durgadevi Kanojia, Mahendra G. Sheth L.U.J. & Sir M.V. College MumbaiMaharashtra India

ISBN: (纸本)9783030730499

Breast Cancer is the most common form of cancer in women, majorly occurring in the age group of 40–70 years and the second most common cancer worldwide. There are several advances in image processing techniques and machine learning algorithms which aids the medical domain. The image processing works with image enhancement and object localization. Machine learning algorithms input image features to train the breast cancer detection model. It is important to extract the image features accurately to achieve promising results. This paper gives detailed insights of histopathological image features describing their technical and usability aspects. The study covers the whole spectrum of the histopathological images which include Haralick texture features and KNN Algorithm using Dimension Reduction Algorithm (LDA and PCA) for the detection of breast cancer. © 2021, The Author(s), under exclusive license to Springer Nature Switzerland AG.

关键词： image enhancement

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：