Cyber-Physical Systems(CPS)represent an integration of computational and physical elements,revolutionizing industries by enabling real-time monitoring,control,and optimization.A complementary technology,Digital Twin(D...
详细信息
Cyber-Physical Systems(CPS)represent an integration of computational and physical elements,revolutionizing industries by enabling real-time monitoring,control,and optimization.A complementary technology,Digital Twin(DT),acts as a virtual replica of physical assets or processes,facilitating better decision making through simulations and predictive *** and DT underpin the evolution of Industry 4.0 by bridging the physical and digital *** survey explores their synergy,highlighting how DT enriches CPS with dynamic modeling,realtime data integration,and advanced simulation *** layered architecture of DTs within CPS is examined,showcasing the enabling technologies and tools vital for seamless *** study addresses key challenges in CPS modeling,such as concurrency and communication,and underscores the importance of DT in overcoming these *** in various sectors are analyzed,including smart manufacturing,healthcare,and urban planning,emphasizing the transformative potential of CPS-DT *** addition,the review identifies gaps in existing methodologies and proposes future research directions to develop comprehensive,scalable,and secure CPSDT *** synthesizing insights fromthe current literature and presenting a taxonomy of CPS and DT,this survey serves as a foundational reference for academics and *** findings stress the need for unified frameworks that align CPS and DT with emerging technologies,fostering innovation and efficiency in the digital transformation era.
The self-attention mechanism of Transformers,which captures long-range contextual information,has demonstrated significant potential in image ***,their ability to learn local,contextual relationships between pixels re...
详细信息
The self-attention mechanism of Transformers,which captures long-range contextual information,has demonstrated significant potential in image ***,their ability to learn local,contextual relationships between pixels requires further *** methods face challenges in efficiently managing multi-scale fea-tures of different granularities from the encoder backbone,leaving room for improvement in their global representation and feature extraction *** address these challenges,we propose a novel Decoder with Multi-Head Feature Receptors(DMHFR),which receives multi-scale features from the encoder backbone and organizes them into three feature groups with different granularities:coarse,fine-grained,and full *** groups are subsequently processed by Multi-Head Feature Receptors(MHFRs)after feature capture and modeling *** include two Three-Head Feature Receptors(THFRs)and one Four-Head Feature Receptor(FHFR).Each group of features is passed through these MHFRs and then fed into axial transformers,which help the model capture long-range dependencies within the *** three MHFRs produce three distinct feature *** output from the FHFR serves as auxiliary auxiliary features in the prediction head,and the prediction output and their losses will eventually be *** results show that the Transformer using DMHFR outperforms 15 state of the arts(SOTA)methods on five public ***,it achieved significant improvements in mean DICE scores over the classic Parallel Reverse Attention Network(PraNet)method,with gains of 4.1%,2.2%,1.4%,8.9%,and 16.3%on the CVC-ClinicDB,Kvasir-SEG,CVC-T,CVC-ColonDB,and ETIS-LaribPolypDB datasets,respectively.
This study explores the effectiveness of Convolutional Neural Networks (CNNs) in automatically classifying skin cancer for e-health applications. The trained model showcases impressive performance by leveraging the HA...
详细信息
Reliable Human Orientation Estimation (HOE) from a monocular image is critical for autonomous agents to understand human intention. Significant progress has been made in HOE under full observation. However, the existi...
详细信息
ISBN:
(数字)9798350377705
ISBN:
(纸本)9798350377712
Reliable Human Orientation Estimation (HOE) from a monocular image is critical for autonomous agents to understand human intention. Significant progress has been made in HOE under full observation. However, the existing methods easily make a wrong prediction under partial observation and give it an unexpectedly high confidence. To solve the above problems, this study first develops a method called Part-HOE that estimates orientation from the visible joints of a target person so that it is able to handle partial observation. Subsequently, we introduce a confidence-aware orientation estimation method, enabling more accurate orientation estimation and reasonable confidence estimation under partial observation. The effectiveness of our method is validated on both public and custom-built datasets, and it shows great accuracy and reliability improvement in partial observation scenarios. In particular, we show in real experiments that our method can benefit the robustness and consistency of the Robot Person Following (RPF) task.
In the rapidly advancing field of intelligent transportation systems, integrating artificial intelligence (AI) with edge computing presents a promising way to enhance the safety and efficiency of the Internet of Vehic...
详细信息
The robotic autonomous luggage trolley collection system employs robots to gather and transport scattered luggage trolleys at airports. However, existing methods for detecting and locating these luggage trolleys often...
详细信息
Skin cancer diagnosis, a critical task in the medical domain, can be revolutionized through the application of advanced deep-learning techniques. This work investigates the efficacy of Convolutional Neural Networks (C...
详细信息
Skin cancer diagnosis, a critical task in the medical domain, can be revolutionized through the application of advanced deep-learning techniques. This work investigates the efficacy of Convolutional Neural Networks (CNNs) in the automated classification of skin cancer. The process begins with a comprehensive explanation of key CNN layers: Conv2D, MaxPool2D, Dropout, and Dense. The Conv2D layers employ learnable filters that transform localized image segments, while MaxPool2D contributes to downsampling, effectively reducing computational cost and overfitting risk. Integrating these layers enables the network to capture local and global characteristics, which is crucial for accurate classification. Adding Dropout layers enhances generalization and mitigates overfitting by introducing randomness during training. ReLU activation functions infuse non-linearity, and the Flatten layer facilitates the transition to fully connected layers. The proposed CNN architecture is meticulously designed considering filter counts, kernel sizes, and pooling dimensions. The trained model demonstrates promising performance by utilizing the HAM10000 dataset, encompassing diverse skin lesion images across seven classes. The CNN model’s parameters and architecture are systematically presented, offering insights into its design rationale. The model undergoes optimization with the Adam optimizer and annealing techniques to facilitate convergence. The model’s effectiveness is evaluated on validation and test datasets, demonstrating an accuracy of 78.55% and 76.49%, respectively, for skin cancer classification. Data augmentation strategies are introduced to enhance model generalization further. The results underscore CNN’s potential as a robust tool for automating skin cancer diagnosis, aligning with the broader trend of leveraging deep learning for medical image analysis
Nine-degrees-of-freedom (9-DoF) object pose and size estimation is crucial for enabling augmented reality and robotic manipulation. Category-level methods have received extensive research attention due to their potent...
详细信息
Nine-degrees-of-freedom (9-DoF) object pose and size estimation is crucial for enabling augmented reality and robotic manipulation. Category-level methods have received extensive research attention due to their potential for generalization to intra-class unknown objects. However, these methods require manual collection and labeling of large-scale real-world training data. To address this problem, we introduce a diffusion-based paradigm for domain-generalized category-level 9-DoF object pose estimation. Our motivation is to leverage the latent generalization ability of the diffusion model to address the domain generalization challenge in object pose estimation. This entails training the model exclusively on rendered synthetic data to achieve generalization to real-world scenes. We propose an effective diffusion model to redefine 9-DoF object pose estimation from a generative perspective. Our model does not require any 3D shape priors during training or inference. By employing the Denoising Diffusion Implicit Model, we demonstrate that the reverse diffusion process can be executed in as few as 3 steps, achieving near real-time performance. Finally, we design a robotic grasping system comprising both hardware and software components. Through comprehensive experiments on two benchmark datasets and the real-world robotic system, we show that our method achieves state-of-the-art domain generalization performance.
Edge computing emerges as a pivotal model in the era of next-generation consumer electronics and the emerging challenges of multimodal data-driven decision-making. Specifically, edge computing offers an open computing...
详细信息
Edge computing emerges as a pivotal model in the era of next-generation consumer electronics and the emerging challenges of multimodal data-driven decision-making. Specifically, edge computing offers an open computing architecture for the vast and diverse consumer multimodal data generated by mobile computing and Internet of Things (IoT) technologies. While edge computing is instrumental in optimizing latency and bandwidth control in processing consumer multimodal data, the viability of employing edge resources is complicated by high service costs and the complexities of managing multimodal data diversity. This study introduces an innovative optimization method for distributing multimodal data on edge storage, considering both the I/O (input/output) speed and the overall distribution cost. The core part of our approach is the deployment of intelligent algorithms that ensure equitable data distribution across storage servers, thus eliminating unused space and reducing extra costs. Given the complexity of this NP-hard (non-deterministic polynomial-time) challenge, our study reveals a unique model incorporating an edge-broker component in combination with novel algorithms. The proposed algorithms aim to harmonize data distribution and reduce resource allocation expenses in a multimodal edge environment. Our proposed approach achieves excellent results, highlighting the efficacy of the proposed algorithms in several parameters such as makespan, cost, multimodal data security, and total processing time. Empirical tests reveal that the BCA (Best Clustering Algorithm) performs best, achieving a minimum load balancing rate of 92.2%, an average variance of 0.04, and an average run time of 0.56 seconds. IEEE
This study explores the effectiveness of Convolutional Neural Networks (CNNs) in automatically classifying skin cancer for e-health applications. The trained model showcases impressive performance by leveraging the HA...
详细信息
ISBN:
(数字)9798350351255
ISBN:
(纸本)9798350351262
This study explores the effectiveness of Convolutional Neural Networks (CNNs) in automatically classifying skin cancer for e-health applications. The trained model showcases impressive performance by leveraging the HAM10000 dataset, which includes a wide range of skin lesion images from seven different classes. The parameters and architecture of the CNN model are presented in a systematic manner, providing valuable insights into the reasoning behind its design. The model is optimized using the Adam optimizer and annealing techniques to ensure efficient convergence. The model’s performance is assessed on validation and test datasets, showcasing an accuracy of 78.55% and 76.49%, respectively, for skin cancer classification. This study highlights the significant potential of CNN as a powerful tool for automating the diagnosis of skin cancer, which is in line with the growing trend of using deep learning for medical image analysis.
暂无评论