检索结果-内蒙古大学图书馆

A skin lesion classification method based on expanding the surrounding lesion-shaped border for an end-to-end Inception-ResNet-v2 classifier

引用

SIGNAL image AND vIDEO processing 2023年第7期17卷 3525-3533页

作者： Dakhli, Rym Barhoumi, Walid Univ Tunis El Manar Inst Super Informat Lab Rech Informat Modelisat & Traitement Informat Res Team Intelligent Syst Imaging & Artificial Vis 2 Rue Abou Rayhane Bayrouni Ariana 2080 Tunisia Univ Carthage Ecole Natl Ingenieurs Carthage Tunis 2035 Tunisia

The latest computer vision and machine learning technologies have introduced various computer-aided diagnosis (CAD) systems to automate the early diagnosis of skin lesions. Nevertheless, improvements made by CAD systems are not optimal because of the similarity in the appearance of skin lesions of different classes as well as the limitations of segmentation. In addition, according to dermatologists, the shape of the lesion and its infiltration into the surrounding skin are decisive information for the diagnosis. Inspired by this idea, the proposed method is based on gradually expanding the lesion border by including the lesion-shaped border area in order to associate the input image with the corresponding skin cancer type using an end-to-end Inception-ResNet-v2 classifier. The main contribution of this work lies in investigating the Inception-ResNet-v2 model exclusively on the expanded lesion-shaped border. In fact, the obtained results showed that the proposed method is effective in achieving precision rates of 95.6% and 97.26% on HAM10000 and PH2 datasets, respectively.

关键词： Skin lesion classification Border region Lesion shape Inception-ResNet-v2 Soft attention

来源：评论

学校读者我要写书评

暂无评论

vision Based Runway Identification with marked or unmarked terrain for Automatic Landing applications of UAv 8

Vision Based Runway Identification with marked or unmarked t...

引用

8th Conference on Advances in Control and Optimization of Dynamical Systems (ACODS)

作者： Tripathi, Amit Kumar Patel, vijay v. Padhi, Radhakant Aeronaut Dev Agcy Bangalore Karnataka India Indian Inst Sci Fac Aerosp Engn Dept Bangalore Karnataka India

vision based runway identification using 'marked or unmarked terrain' image sequences captured from a fixed wing unmanned aerial vehicle through onboard stereovision sensor is presented in this paper. An innovative convolutional neural netwok (CNN) based YOLO-v8 object detection algorithm is used to detect the runway during approach segment of UAv. This deep learning algorithm detects the region of interest in real time and in a computationally efficient manner. The captured unknown road segment or runway image frames are processed and examined for width, length, level and smoothness aspects to qualify as a suitable runway for UAv landings. Also, it is ensured that there are no obstacles, patches or holes on the detected road or runway. Runway start and end threshold lines and regions, touchdown point and runway edge lines are considered as the region of interest. image processing algorithms are applied on the captured runway or road images to detect strong features in the region of interest. Feature detector based image processing algorithm with stereo vision constraint is used to establish the relation between unmanned aerial vehicle's center of gravity and detected runway feature points image processing algorithms like hough line detection, RANSAC, Oriented FAST and Rotated BRIEF (ORB), median filters, morphological methods are applied to extract terrain features. Based on the detected runway orientation and position with respect to UAv position. An automatic landing manoeuvre is performed by UAv autopilot to land the UAv on intended touchdown point on runway computed through detected feature points.

关键词： image processing Runway Identification Feature detection Region of Interest threshold lines touch down point YOLOv8 ORB CNN runway width slant distance

来源：评论

学校读者我要写书评

暂无评论

A Practical Approach to Tracking Estimation Using Object Trajectory Linearization

引用

INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS 2024年第1期17卷 175-175页

作者： Yousefi, Seyed Mohammad Mehdi Mohseni, Seyed Saleh Dehbovid, Hadi Ghaderi, Reza Islamic Azad Univ Elect Engn Dept Nour Branch Mazandaran Iran Shahid Beheshti Univ Dept Elect Engn Tehran Iran

In the field of image processing and machine vision, object tracking is a significant and rapidly developing subfield. The numerous potential applications of object tracking have garnered much attention in recent years. The effectiveness of tracking and detecting moving targets is directly related to the quality of motion detection algorithms. This paper presents a new method for estimating the tracking of objects by linearizing their trajectories. Estimating the movement paths of objects in dynamic and complex environments is one of the fundamental challenges in various fields, such as surveillance systems, autonomous navigation, and robotics. Existing methods, such as the Kalman filter and particle filter, each have their strengths and weaknesses. The Kalman filter is suitable for linear systems but less efficient in nonlinear systems, while the particle filter can better handle system nonlinearity but requires more computations. The main goal of this research is to improve the accuracy and efficiency of estimating the movement paths of objects by combining path linearization techniques with existing advanced methods. In this method, the nonlinear model of the object's path is first transformed into a simpler linear model using linearization techniques. The Kalman filter is then used to estimate the states of the linearized system. This approach simplifies the calculations while increasing the estimation accuracy. In the subsequent step, a particle filter-based method is employed to manage noise and sudden changes in the object's trajectory. This combination of two different methods allows leveraging the advantages of both, resulting in a more accurate and robust estimate. Experimental results show that the proposed method performs better than traditional methods, achieving higher accuracy in various conditions, including those with high noise and sudden changes in the movement path. Specifically, the proposed approach improves movement forecasting accuracy by abo

关键词： Trajectory of objects Estimation of tracking image processing Linearization

来源：评论

学校读者我要写书评

暂无评论

Classifying beams carrying orbital angular momentum with machine learning: tutorial

引用

JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS image SCIENCE AND vision 2023年第1期40卷 64-77页

作者： Avramov-zamurovic, S. v. E. T. L. A. N. A. Esposito, Joel M. Nelson, C. H. A. R. L. E. S. US Naval Acad Weap Robot & Control Engn Dept 597 McNair Rd Hopper Hall Annapolis MD 21402 USA US Naval Acad Elect & Comp Engn Dept 597 McNair Rd Hopper Hall Annapolis MD 21402 USA

This tutorial discusses optical communication systems that propagate light carrying orbital angular momentum through random media and use machine learning (aka artificial intelligence) to classify the distorted images of the received alphabet symbols. We assume the reader is familiar with either optics or machine learning but is likely not an expert in both. We review select works on machine learning applications in various optics areas with a focus on beams that carry orbital angular momentum. We then discuss optical experimental design, including generating Laguerre-Gaussian beams, creating and characterizing optical turbulence, and engineering considerations when capturing the images at the receiver. We then provide an accessible primer on convolutional neural networks, a machine learning technique that has proved effective at image classification. We conclude with a set of best prac-tices for the field and provide an example code and a benchmark dataset for researchers looking to try out these techniques.(c) 2022 Optica Publishing Group

关键词： Atmospheric turbulence Effective refractive index Laser beam propagation Optical turbulence Partial coherence Structured light

来源：评论

学校读者我要写书评

暂无评论

machine-learning methods for detecting tuberculosis in Ziehl-Neelsen stained slides: A systematic literature review

引用

INTELLIGENT SYSTEMS WITH applications 2024年 22卷

作者： Tamura, Gabriel Llano, Gonzalo Aristizabal, Andres valencia, Juan Sua, Luz Fernandez, Liliana Univ Icesi Dept Comp & Intelligent Syst Cali Valle Del Cauca Colombia Univ Icesi Ctr Artificial Intelligence & Data Sci Cali Valle Del Cauca Colombia Univ Icesi Res Grp Informat Technol & Telecommun i2T Cali Valle Del Cauca Colombia Univ Icesi Fac Hlth Sci Cali Valle Del Cauca Colombia Fdn Valle Lili Dept Pathol & Lab Med Cali Valle Del Cauca Colombia Fdn Valle Lili Dept Internal Med Pulmonol Serv Cali Valle Del Cauca Colombia

Tuberculosis (TB) remains a global health threat, and rapid, automated and accurate diagnosis is crucial for effective control. The tedious and subjective nature of Ziehl-Neelsen (ZN) stained smear microscopy for identifying Mycobacterium tuberculosis (MTB) motivates the exploration of alternative approaches. In recent years, machine learning (ML) methods have emerged as promising tools for automated TB detection in ZN-stained images. This systematic literature review (SLR) comprehensively examines the application of ML methods for TB detection between 2017 and 2023, focusing on their performance metrics and employed dataset characteristics. The study identifies advancements, establishes the state of the art, and pinpoints areas for future research and development in this domain. It sheds light on the discussion about the readiness of machine-learning methods to be confidently, reliably and cost-effectively used to automate the process of tuberculosis detection in ZN slides, being it significant for the health systems worldwide. Following established SLR guidelines, we defined research questions, retrieved 175 papers from 7 well-known sources, and discarded those not complying with the inclusion criteria. Data extraction and analysis were performed on the resulting 65 papers to address our research questions. The key contributions of this review are as follows. First, it presents a characterization of the state of the art of ML methods for ZN-stained TB detection, especially in sputum and tissue. Second, it analyzes top-performing methods and pre-processing techniques. Finally, it pinpoints key research gaps and opportunities.

关键词： Tuberculosis Digital pathology Medical image processing Artificial intelligence Computer vision machine learning

来源：评论

学校读者我要写书评

暂无评论

International Conference on Internet of Everything and Quantum Information processing, IEQIP 2023

International Conference on Internet of Everything and Quant...

引用

International Conference on Internet of Everything and Quantum Information processing, IEQIP 2023

ISBN: (纸本)9783031619281

The proceedings contain 31 papers. The special focus in this conference is on Internet of Everything and Quantum Information processing. The topics include: Revolutionizing Agriculture: A Mobile App for Rapid Plant Disease Prediction and Sustainable Food Security;EMG Based Human machine Integration for IoT Based Instruments;medrack: Bridging Trust and Technology for Safer Drug Supply Chain Using Ethereum and IoT;a Review on Tuberculosis Pattern Detection Based on various machine Learning Techniques;sensor Based Hand Gesture Identification for Human machine Interface;an Improved Detection System Using Genetic Algorithm and Decision Tree;a Detailed Analysis of Colorectal Polyp Segmentation with U-Network;a Review on Internet of Things (IoT): Parkinson’s Disease Monitoring Device;machine Learning-Based Prediction of Temperature Rise in Squirrel Cage Induction Motor (SCIM);quantum Many-Body Problems: Quantum machine Learning applications;Experimental Study on the Impact of Airborne Dust Deposition on Pv Modules Using Internet of Things;bidirectional Converter with Time Utilization-Based Tariff Investigation and IoT Monitoring of Charging Parameters Based on G2v and v2G Operations;predictive Analysis of Telecom Customer Churn Using machine Learning Techniques;baker’s Map Based Chaotic image Encryption in Military Surveillance Systems;Cyber Security Investigation of GPS-Spoofing Attack in Military UAv Networks;ioT Based Enhanced Safety Monitoring System for Underground Coal Mines Using LoRa Technology;ioT Based Hydroponic System for Sustainable Organic Farming;predicting Stride Length from Acceleration Signals Using Lightweight machine Learning Algorithms;unveiling Hate: Multimodal Perspectives and Knowledge Graphs;vision-Based Toddler Activity Recognition: Challenges and applications;automated W-Sitting Posture Detection in Toddlers.

关键词：

来源：评论

学校读者我要写书评

暂无评论

image Generation from Arabic Text: Comparative Study of Proposed Architectures 8th

Image Generation from Arabic Text: Comparative Study of Prop...

引用

8th International Conference on Arabic Language processing

作者： Bahani, Mourad El Ouaazizi, Aziza Maalmi, Khalil Essahlaoui, Abdelouahed Sidi Mohamed Ben Abdellah Univ Natl Sch Appl Sci Artificial Intelligence Data Sci & Emerging Syst Fes Morocco Sidi Mohamed Ben Abdellah Univ Lab Engn Sci FPT Taza Morocco

ISBN: (纸本)9783031804373;9783031804380

Text-to-image generation is a cutting-edge technology that enables computers to generate images from textual descriptions. While this technology has been extensively researched and applied to English language text, applying it to Arabic language text is still in its early stages. Additionally, the Arabic language is challenging due to its right-to-left writing system and extensive vocabulary of 1.3 million words. In this paper, we explore text-to-image generation for generating images from Arabic language text descriptions. Firstly, we fine-tune a transformer-based model pre-trained on the Arabic text to transform the text information into affine transformation within the DF-GAN generator. Secondly, we present a text transformer that combines LSTM layers to address the limitation of unrecognized words. Thirdly, a mask predictor is trained into the generator using a weakly supervised method and incorporated into the affine transformation for a more effective integration of image and text features. In addition, we add the DAMSM loss function as a regularization to the loss function to achieve convergences and stability in the training phase. The experiment on two challenging datasets CUB and Oxford-flower shows that our architectures can accurately generate high-quality images faithfully representing the Arabic textual descriptions. We believe the scaling of this task could have critical applications in fields such as Arabic visual learning, e-commerce, advertising, and entertainment.

关键词： machine Learning Deep Learning Computer vision Generative Adversarial Networks Text-to-image Generation Arabic Text processing

来源：评论

学校读者我要写书评

暂无评论

Chromosome analysis using a hybrid deep CNN and structural feature-based grouping model

引用

Multimedia Tools and applications 2025年 1-30页

作者： Isfahani, Farahnaz Peiravi Pourghassem, Hossein Mahdavi-Nasab, Homayoun Naghsh, Alireza Department of Electrical Engineering Najafabad Branch Islamic Azad University Najafabad Iran Digital Processing and Machine Vision Research Center Najafabad Branch Islamic Azad University Najafabad Iran

Chromosome analysis and classification are essential in clinical applications to diagnose various structural and numerical abnormalities. Recently, karyotype analysis using intelligent image processing methods, especially deep learning, has attracted significant attention as a genetic abnormality test. This paper presents a novel chromosome classification algorithm that uses high-level features extracted from deep convolutional neural networks (DCNN) along with morphological features designed to identify and modify the classes of misclassified chromosomes. Initially, chromosomes are classified using a DCNN. Some structural features, such as centromere and banding profile, are then extracted to group chromosomes again. Based on the results of the two preceding methods, a decision strategy is utilized to identify misclassified chromosomes. Here, a final DCNN-based strategy is introduced to assign misclassified chromosomes to the associated classes. The proposed method can be used in parallel with other chromosome classification methods to modify misclassified chromosomes and promote the accuracy of the classification. Evaluation results show that the proposed algorithm outperforms relevant state-of-the-art algorithms regarding the classification precision and accuracy of 99.66 and 96.52%, respectively. © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2024.

关键词： Convolutional neural networks

来源：评论

学校读者我要写书评

暂无评论

The Background Also Matters: Background-Aware Motion-Guided Objects Discovery

The Background Also Matters: Background-Aware Motion-Guided ...

引用

IEEE/CvF Winter Conference on applications of Computer vision (WACv)

作者： Kara, Sandra Ammar, Hejer Chabot, Florian Quoc-Cuong Pham Univ Paris Saclay CEA List F-91120 Palaiseau France

ISBN: (纸本)9798350318920;9798350318937

Recent works have shown that objects discovery can largely benefit from the inherent motion information in video data. However, these methods lack a proper background processing, resulting in an over-segmentation of the non-object regions into random segments. This is a critical limitation given the unsupervised setting, where object segments and noise are not distinguishable. To address this limitation we propose BMOD, a Background-aware Motion-guided Objects Discovery method. Concretely, we leverage masks of moving objects extracted from optical flow and design a learning mechanism to extend them to the true foreground composed of both moving and static objects. The background, a complementary concept of the learned foreground class, is then isolated in the object discovery process. This enables a joint learning of the objects discovery task and the object/non-object separation. The conducted experiments on synthetic and real-world datasets show that integrating our background handling with various cutting-edge methods brings each time a considerable improvement. Specifically, we improve the objects discovery performance with a large margin, while establishing a strong baseline for object/non-object separation.

关键词： Algorithms Algorithms Algorithms and algorithms formulations image recognition and understanding machine learning architectures video recognition and understanding

来源：评论

学校读者我要写书评

暂无评论

FPGA-Based Implementation of Real-Time Cardiologist-Level Arrhythmia Detection and Classification in Electrocardiograms Using Novel Deep Learning

引用

INTERNATIONAL JOURNAL OF CIRCUIT THEORY AND applications 2024年第0期

作者： Chandrasekaran, Saravanakumar Chandran, Srinivasan Selvam, Immaculate Joy SRM Valliammai Engn Coll ECE Chennai Tamil Nadu India Rajalakshmi Engn Coll EEE Chennai Tamil Nadu India Saveetha Engn Coll ECE Chennai Tamil Nadu India

Cardiac arrhythmia refers to irregular heartbeats caused by anomalies in electrical transmission in the heart muscle, and it is an important threat to cardiovascular health. Conventional monitoring and diagnosis still depend on the laborious visual examination of electrocardiogram (ECG) devices, even though ECG signals are dynamic and complex. This paper discusses the need for an automated system to assist clinicians in efficiently recognizing arrhythmias. The existing machine-learning (ML) algorithms have extensive training cycles and require manual feature selection;to eliminate this, we present a novel deep learning (DL) architecture. Our research introduces a novel approach to ECG classification by combining the vision transformer (viT) and the capsule network (CapsNet) into a hybrid model named viT-Cap. We conduct necessary preprocessing operations, including noise removal and signal-to-image conversion using short-time Fourier transform (SIFT) and continuous wavelet transform (CWT) algorithms, on both normal and abnormal ECG data obtained from the MIT-BIH database. The proposed model intelligently focuses on crucial features by leveraging global and local attention to explore spectrogram and scalogram image data. Initially, the model divides the images into smaller patches and linearly embeds each patch. Features are then extracted using a transformer encoder, followed by classification using the capsule module with feature vectors from the viT module. Comparisons with existing conventional models show that our proposed model outperforms the original viT and CapsNet in terms of classification accuracy for both binary and multi-class ECG classification. The experimental findings demonstrate an accuracy of 99% on both scalogram and spectrogram images. Comparative analysis with state-of-the-art methodologies confirms the superiority of our framework. Additionally, we configure a field-programmable gate array (FPGA) to implement the proposed model for real-time ar

关键词： accuracy arrhythmia capsule network deep learning electrocardiogram field-programmable gate array power processing time vision transformer

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：