Chromosome analysis and classification are essential in clinical applications to diagnose various structural and numerical abnormalities. Recently, karyotype analysis using intelligent imageprocessing methods, especi...
详细信息
Recent works have shown that objects discovery can largely benefit from the inherent motion information in video data. However, these methods lack a proper background processing, resulting in an over-segmentation of t...
详细信息
ISBN:
(纸本)9798350318920;9798350318937
Recent works have shown that objects discovery can largely benefit from the inherent motion information in video data. However, these methods lack a proper background processing, resulting in an over-segmentation of the non-object regions into random segments. This is a critical limitation given the unsupervised setting, where object segments and noise are not distinguishable. To address this limitation we propose BMOD, a Background-aware Motion-guided Objects Discovery method. Concretely, we leverage masks of moving objects extracted from optical flow and design a learning mechanism to extend them to the true foreground composed of both moving and static objects. The background, a complementary concept of the learned foreground class, is then isolated in the object discovery process. This enables a joint learning of the objects discovery task and the object/non-object separation. The conducted experiments on synthetic and real-world datasets show that integrating our background handling with various cutting-edge methods brings each time a considerable improvement. Specifically, we improve the objects discovery performance with a large margin, while establishing a strong baseline for object/non-object separation.
Cardiac arrhythmia refers to irregular heartbeats caused by anomalies in electrical transmission in the heart muscle, and it is an important threat to cardiovascular health. Conventional monitoring and diagnosis still...
详细信息
Cardiac arrhythmia refers to irregular heartbeats caused by anomalies in electrical transmission in the heart muscle, and it is an important threat to cardiovascular health. Conventional monitoring and diagnosis still depend on the laborious visual examination of electrocardiogram (ECG) devices, even though ECG signals are dynamic and complex. This paper discusses the need for an automated system to assist clinicians in efficiently recognizing arrhythmias. The existing machine-learning (ML) algorithms have extensive training cycles and require manual feature selection;to eliminate this, we present a novel deep learning (DL) architecture. Our research introduces a novel approach to ECG classification by combining the vision transformer (ViT) and the capsule network (CapsNet) into a hybrid model named ViT-Cap. We conduct necessary preprocessing operations, including noise removal and signal-to-image conversion using short-time Fourier transform (SIFT) and continuous wavelet transform (CWT) algorithms, on both normal and abnormal ECG data obtained from the MIT-BIH database. The proposed model intelligently focuses on crucial features by leveraging global and local attention to explore spectrogram and scalogram image data. Initially, the model divides the images into smaller patches and linearly embeds each patch. Features are then extracted using a transformer encoder, followed by classification using the capsule module with feature vectors from the ViT module. Comparisons with existing conventional models show that our proposed model outperforms the original ViT and CapsNet in terms of classification accuracy for both binary and multi-class ECG classification. The experimental findings demonstrate an accuracy of 99% on both scalogram and spectrogram images. Comparative analysis with state-of-the-art methodologies confirms the superiority of our framework. Additionally, we configure a field-programmable gate array (FPGA) to implement the proposed model for real-time ar
Geometric Deep Learning has recently made striking progress with the advent of continuous deep implicit fields. They allow for detailed modeling of watertight surfaces of arbitrary topology while not relying on a 3D e...
详细信息
Geometric Deep Learning has recently made striking progress with the advent of continuous deep implicit fields. They allow for detailed modeling of watertight surfaces of arbitrary topology while not relying on a 3D euclidean grid, resulting in a learnable parameterization that is unlimited in resolution. Unfortunately, these methods are often unsuitable for applications that require an explicit mesh-based surface representation because converting an implicit field to such a representation relies on the Marching Cubes algorithm, which cannot be differentiated with respect to the underlying implicit field. In this work, we remove this limitation and introduce a differentiable way to produce explicit surface mesh representations from Deep Implicit Fields. Our key insight is that by reasoning on how implicit field perturbations impact local surface geometry, one can ultimately differentiate the 3D location of surface samples with respect to the underlying deep implicit field. We exploit this to define DeepMesh - an end-to-end differentiable mesh representation that can vary its topology. We validate our theoretical insight through several applications: Single view 3D Reconstruction via Differentiable Rendering, Physically-Driven Shape Optimization, Full Scene 3D Reconstruction from Scans and End-to-End Training. In all cases our end-to-end differentiable parameterization gives us an edge over state-of-the-art algorithms.
As one of the most indispensable means of transportation in modern society, vehicles guarantee our daily commuting and logistics transportation. However, with the increasing number of vehicles, vehicles have also caus...
详细信息
As one of the most indispensable means of transportation in modern society, vehicles guarantee our daily commuting and logistics transportation. However, with the increasing number of vehicles, vehicles have also caused increasingly serious traffic safety problems while providing convenience to our lives. One of the most common of these is traffic accidents caused by vehicle yaw due to driver distraction. As a potential solution to this problem, lane departure warning systems (LDWS) focus on detecting and determining whether the vehicle is deviating from the driveway, considered an essential part of autonomous driving technology, and have received significant attention in recent years. A large number of different types of LDWS systems have been developed, especially in recent years, with the development of artificial intelligence technology, many methods based on deep learning and machinevision have been proposed. However, it is well known that due to the complexity of the network structure in deep learning-based object detection algorithms, the operation of such methods relies on a large amount of computing power support. However, due to the limitation of the overall energy supply of the vehicle, it is usually unable to support computing power similar to the laboratory level. Therefore, how to realize efficient lane departure warnings under the condition of limited computing power is a critical problem to be solved. Accordingly, in this paper, we propose a novel lightweight LDWS model. Different from deep learning methods of LDWS, our LDWS model LEHA can achieve high accuracy and efficiency by relying only on simple hardware. The proposed LEHA consists of three modules: the imageprocessing module, the lane detection module, and the lane departure recognition module. The image pre-processing module is applied to pre-process the original road image, which can improve the accuracy and efficiency of the following lane detection module. After obtaining the processed i
Human civilization is based on agriculture, and one of the biggest threats to agricultural productivity is the existence of wild animals on farmlands. Animals are taking over some farmlands, causing significant crop l...
详细信息
The integration of human-robot interaction (HRI) technologies with industrial automation has become increasingly essential for enhancing productivity and safety in manufacturing environments. In this paper, we propose...
详细信息
machine learning algorithms have had a profound impact on the field of computer science over the past few decades. The performance of these algorithms heavily depends on the representations derived from the data durin...
详细信息
machine learning algorithms have had a profound impact on the field of computer science over the past few decades. The performance of these algorithms heavily depends on the representations derived from the data during the learning process. Successful learning processes aim to produce concise, discrete, meaningful representations that can be effectively applied to various tasks. Recent advancements in deep learning models have proven to be highly effective in capturing high-dimensional, non-linear, and multi-modal characteristics. In this work, we provide a comprehensive overview of the current state-of-the-art in deep representation learning and the principles and developments made in the process of representation learning. Our study encompasses both supervised and unsupervised methods, including popular techniques such as autoencoders, self-supervised methods, and deep neural networks. Furthermore, we explore a wide range of applications, including image recognition and natural language processing. In addition, we discuss recent trends, key issues, and open challenges in the field. This survey endeavors to make a significant contribution to the field of deep representation learning, fostering its understanding and facilitating further advancements.
Alzheimer's disease (AD) is a progressive and irreversible neurological disorder that leads to memory loss and cognitive decline. It is a prevalent form of dementia among individuals aged 65 and above. Accurate an...
详细信息
Alzheimer's disease (AD) is a progressive and irreversible neurological disorder that leads to memory loss and cognitive decline. It is a prevalent form of dementia among individuals aged 65 and above. Accurate and early diagnosis of AD is of utmost importance. Diagnostic neuroimaging and software techniques have emerged as crucial tools for assessing early-stage dementia. The aim of this study is to provide a comprehensive review of recent research that employs deep learning (DL) techniques for the assessment of dementia, particularly the early stages of AD. The objective is to analyze the current state of research and explore the future directions of this field. The study involves a systematic review of literature that focuses on the utilization of DL techniques in the assessment of dementia and early AD diagnosis. Various datasets commonly used for AD prediction are examined. The study encompasses the discussion of different applications of contemporary AI algorithms in AD detection, as well as their merits, limitations, and performance. The review reveals that DL techniques have shown promise in the detection and diagnosis of AD. The use of DL, particularly in image classification and natural language processing, has demonstrated significant advancements in the field of AI. The study also highlights the potential of AI in AD genetic studies, providing valuable insights into the broad scope of this research..
This article proposes a simple and effective method for image subject segmentation. Our research mainly focuses on the characteristics of material images in the experimental platform. Through in-depth research, we hav...
详细信息
暂无评论