作者:
Wang, YunkeDu, BoXu, ChangSchool of Computer Science
National Engineering Research Center for Multimedia Software Institute of Artificial Intelligence Hubei Key Laboratory of Multimedia and Network Communication Engineering Wuhan University Wuhan China School of Computer Science
Faculty of Engineering The University of Sydney Australia
Adversarial imitation learning has become a widely used imitation learning framework. The discriminator is often trained by taking expert demonstrations and policy trajectories as examples respectively from two catego...
详细信息
The study of human engagement has significantly grown in recent years, particularly accelerated by the interaction with a growing number of smart computing machines [1, 2, 3]. Engagement estimation has significant imp...
详细信息
Planar grasp detection is one of the most fundamental tasks to robotic manipulation, and the recent progress of consumer-grade RGB-D sensors enables delivering more comprehensive features from both the texture and sha...
Planar grasp detection is one of the most fundamental tasks to robotic manipulation, and the recent progress of consumer-grade RGB-D sensors enables delivering more comprehensive features from both the texture and shape modalities. However, depth maps are generally of a relatively lower quality with much stronger noise compared to RGB images, making it challenging to acquire grasp depth and fuse multi-modal clues. To address the two issues, this paper proposes a novel learning based approach to RGB-D grasp detection, namely Depth Guided Cross-modal Attention Network (DGCAN). To better leverage the geometry information recorded in the depth channel, a complete 6-dimensional rectangle representation is adopted with the grasp depth dedicatedly considered in addition to those defined in the common 5-dimensional one. The prediction of the extra grasp depth substantially strengthens feature learning, thereby leading to more accurate results. Moreover, to reduce the negative impact caused by the discrepancy of data quality in two modalities, a Local Cross-modal Attention (LCA) module is designed, where the depth features are refined according to cross-modal relations and concatenated to the RGB ones for more sufficient fusion. Extensive simulation and physical evaluations are conducted and the experimental results highlight the superiority of the proposed approach.
Medical dialogue systems have attracted growing research attention as they have the potential to provide rapid diagnoses, treatment plans, and health consultations. In medical dialogues, a proper diagnosis is crucial ...
详细信息
The rapid advancements in artificial intelligence (AI) and deep learning have revolutionized various sectors, enabling unprecedented levels of innovation and efficiency. This paper delves into the transformative impac...
详细信息
ISBN:
(数字)9798331512965
ISBN:
(纸本)9798331512972
The rapid advancements in artificial intelligence (AI) and deep learning have revolutionized various sectors, enabling unprecedented levels of innovation and efficiency. This paper delves into the transformative impact of deep learning across multiple domains, highlighting key innovations and their broader implications. By examining cutting-edge applications in healthcare, finance, autonomous systems, and natural language processing, the study elucidates how deep learning models, such as convolutional neural networks (CNNs) and recurrent neural networks (RNNs), are pushing the boundaries of what's possible. Moreover, the paper explores the integration of deep learning with emerging technologies such as the Internet of Things (IoT), edge computing, and quantum computing, underscoring the synergistic potential of these combinations. The role of deep learning in predictive analytics and personalized solutions is discussed, showcasing its ability to enhance decision-making processes and improve user experiences. Furthermore, the paper discusses the ethical considerations, challenges, and future prospects associated with the widespread adoption of deep learning technologies. The insights provided aim to foster a deeper understanding of how harnessing AI with deep learning can drive progress while addressing critical societal and technological issues.
This paper introduces a novel telehealth communication system, designed to enhance the security and integrity of medical data exchange. In the rapidly evolving digital healthcare landscape, the protection of sensitive...
详细信息
ISBN:
(数字)9798350350548
ISBN:
(纸本)9798350350555
This paper introduces a novel telehealth communication system, designed to enhance the security and integrity of medical data exchange. In the rapidly evolving digital healthcare landscape, the protection of sensitive patient information is paramount. To address this, our system uniquely combines quantum cryptography, specifically the BB84 protocol, with blockchain technology, offering a dual-layered security framework. The Quantum Layer, underpinned by the BB84 protocol, establishes quantum-secure communication channels, effectively encrypting data exchanges between patients, doctors, and hospitals. This layer guarantees that medical information remains confidential and safe from potential quantum-level eavesdropping threats. The subsequent Blockchain Layer further strengthens the system by storing these encrypted communications in an immutable blockchain ledger. This approach not only secures the data against unauthorized alterations but also provides a transparent and permanent record of all transactions, thereby enhancing the auditability of medical communications.
Deep learning algorithms are becoming more potent and producing human-synthesized, undifferentiated footage is a simple process thanks to advances in computing power. A method of synthesizing human images called deep ...
详细信息
ISBN:
(数字)9798350352689
ISBN:
(纸本)9798350352696
Deep learning algorithms are becoming more potent and producing human-synthesized, undifferentiated footage is a simple process thanks to advances in computing power. A method of synthesizing human images called deep fakes is built on neural network techniques like GAN. These technologies use deep learning algorithms to overlay target pictures onto the source films, producing realistic-looking deep-fake photos and videos. In this work, we provide a novel deep-learning approach that can reliably discriminate between actual and AI-generated fake videos. In this work, we test our method on a large number of balanced and mixed datasets created by mixing various publicly available datasets, such as Face-Forensic++, Deepfake detection, Celeb DF, and other publicly available datasets as images and videos, to emulate real-time scenarios and improve model performance with real-time data by using LSTM and ResNet.
As a fundamental operation, sparse matrix-vector multiplication (SpMV) plays a key role in solving a number of scientific and engineering problems. This paper presents a NUMA-Aware optimization technique for the SpMV ...
详细信息
Alzheimer's disease is a neurodegenerative disorder affecting millions worldwide, causing cognitive decline. Early detection is crucial for effective treatment, but manual diagnosis is error-prone and time-consumi...
Alzheimer's disease is a neurodegenerative disorder affecting millions worldwide, causing cognitive decline. Early detection is crucial for effective treatment, but manual diagnosis is error-prone and time-consuming. This study proposes a deep learning-based model for diagnosing and classifying Alzheimer's disease using VGG-16 CNN architectures. The model uses the Alzheimer's Disease Neuroimaging Initiative (ADNI) dataset, which includes MRI scans from individuals with Alzheimer's and healthy controls. The model uses a pre-trained VGG-16 convolutional neural network as the base, fine-tuning on the dataset, and data augmentation techniques. The trained model achieves a 96.75% accuracy on the testing dataset, distinguishing MRI scans between Alzheimer's patients and healthy controls. The VGG-16 architecture demonstrated superior performance during training and testing.
Non-intrusive Load Monitoring (NILM) is becoming a paramount in both industrial and residential sectors to achieve efficient energy consumption. Thus, research on this matter flourished in recent years, where deep neu...
详细信息
暂无评论