Retinal optical coherence tomography (OCT) images are widely used to diagnose and grade macular diseases, such as age-related macular degeneration (AMD). However, manual interpretation of OCT images is time-consuming ...
Retinal optical coherence tomography (OCT) images are widely used to diagnose and grade macular diseases, such as age-related macular degeneration (AMD). However, manual interpretation of OCT images is time-consuming and subjective. Therefore, automated and accurate classification of OCT images is essential for assisting ophthalmologists in clinical decision-making. This paper proposes a pyramidal deep neural network that can diagnose normal and two types of AMD (dry and wet) in OCT images. Our network leverages features from different scales of a pre-trained convolutional neural network (CNN) and integrates them with two advanced versions of feature pyramid networks: bidirectional feature pyramid network (BiFPN) and path aggregation network (PANet). We evaluate our network on the NEH dataset and compare it with its predecessor. Our results show that our BiFPN-VGG16 and PAN-VGG16 models achieve accuracies of 94.S% and 95.0%, respectively, which are 2.8 to 3% higher than the previous models. Our approach demonstrates the potential of multi-scale feature networks for OCT image classification and can serve as an auxiliary diagnostic tool for ophthalmologists.
This paper presents a system design for a smart bike helmet with multiple safety features that are intended to empower bicycle riders to proactively avoid potential sources of danger or injury. A Smart Sensor/Actuator...
This paper presents a system design for a smart bike helmet with multiple safety features that are intended to empower bicycle riders to proactively avoid potential sources of danger or injury. A Smart Sensor/Actuator Node (SSAN), driven by an Arduino Uno single-board microcontroller, contains input sensors and actuators to provide riders the ability to send and receive warnings promptly on their helmet. A Vision Node, driven by an NVIDIA Jetson Nano and a cable pin-connected camera, executes AI object detection algorithms for any dangerous objects that are out of sight of the rider and sends alerts to the SSAN as needed. By combining safety features of the SSAN and Vision Node while continuously sending data to an IoT-enabled backend web server, the safety operation of a typical bike ride can be substantially improved.
Traditional multiframe Track-Before-Detect(TBD)may incur adverse integration loss resulting from model mismatch in sensor *** suboptimal integration strategy may cause target envelope *** address these issues,a pseudo...
详细信息
Traditional multiframe Track-Before-Detect(TBD)may incur adverse integration loss resulting from model mismatch in sensor *** suboptimal integration strategy may cause target envelope *** address these issues,a pseudo-spectrum-based multiframe TBD in mixed coordinates is proposed *** data search for energy integration is conducted based on an accurate model in the x-y plane while target energy is integrated based on pseudo-spectrum in sensor *** algorithm performance is improved since the model mismatch is eliminated,and the pseudo-spectrum based integration facilitates well maintained target *** detailed multiframe integration procedure and theoretical target integrated envelope are ***,to cope with the unknown target velocity,a velocity filter bank based on pseudo-spectrum in mixed coordinates is *** effect of velocity mismatch on algorithm performance is analyzed and an efficient method for filter bank design is ***,a parameter estimation method using characteristics of integrated envelope is presented for improved target polar position and Cartesian velocity ***,numerical results are provided to demonstrate the effectiveness of the proposed method.
In this work, we draw connections between the classical Shannon interpolation of bandlimited deterministic signals and the literature on estimating continuous-time random processes from their samples (known in various...
详细信息
Chunk-level speech emotion recognition (SER) is a common modeling scheme to obtain better recognition performance than sentence-level formulations. A key open question is the role of lexical boundary information in th...
详细信息
Chunk-level speech emotion recognition (SER) is a common modeling scheme to obtain better recognition performance than sentence-level formulations. A key open question is the role of lexical boundary information in the process of splitting a sentence into small chunks. Is there any benefit in providing precise lexical boundary information to segment the speech into chunks (e.g., word-level alignments)? This study analyzes the role of lexical boundary information by exploring alternative segmentation strategies for chunk-level SER. We compare six chunk-level segmentation strategies that either consider word-level alignments or traditional time-based segmentation methods by varying the number of chunks and the duration of the chunks. We conduct extensive experiments to evaluate these chunk-level segmentation approaches using multiples corpora, and multiple acoustic feature sets. The results show a minor contribution of the word-level timing boundaries, where centering the chunks around words does not lead to significant performance gains. Instead, the critical factor to effectively segment a sentence into data chunks is to define the number of chunks according to the number of spoken words in the sentence.
The pansharpening of high-resolution panchromatic (Pan) image and low-resolution multispectral (MS) images represents an important task in the remote sensing field. It allows the joint exploitation of the information ...
详细信息
This paper addresses the classical problem of one-bit compressed sensing using a deep learning-based reconstruction algorithm that leverages a trained generative model to enhance the signal reconstruction performance....
详细信息
Early diagnosis and treatment of polyps during colonoscopy are essential for reducing the incidence and mortality of Colorectal Cancer (CRC). However, the variability in polyp characteristics and the presence of artif...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Early diagnosis and treatment of polyps during colonoscopy are essential for reducing the incidence and mortality of Colorectal Cancer (CRC). However, the variability in polyp characteristics and the presence of artifacts in colonoscopy images and videos pose significant challenges for accurate and efficient polyp detection and segmentation. This paper presents a novel approach to polyp segmentation by integrating the Segment Anything Model (SAM 2) with the YOLOv8 model. Our method leverages YOLOv8’s bounding box predictions to autonomously generate input prompts for SAM 2, thereby reducing the need for manual annotations. We conducted exhaustive tests on five benchmark colonoscopy image datasets and two colonoscopy video datasets, demonstrating that our method exceeds state-of-the-art models in both image and video segmentation tasks. Notably, our approach achieves high segmentation accuracy using only bounding box annotations, significantly reducing annotation time and effort. This advancement holds promise for enhancing the efficiency and scalability of polyp detection in clinical settings https://***/sajjad-sh33/YOLO_SAM2.
In this paper, we are going to propose a novel structure of all-optical NOT, XOR and XNOR logic gates are presented using a two-dimensional photonic crystal (2D-PhC). This structure is optimized by varying the radius ...
详细信息
Cancer is one of the deadliest diseases for human health. The classification of cancers poses many challenges in biomedical research because it allows an accurate and effective diagnosis and guarantees the success of ...
详细信息
暂无评论