Due to the aging of the world's population, the incidence of retinal diseases is on the rise. Machine learning is expected to have a crucial role in identifying retinal disease. Multiple medical institutions coope...
详细信息
In the recent years, the development of generative models has stimulated the health care progress, specially medical image generation. The synthetic medical images can be applied to several fields and have many utiliz...
详细信息
Entity matching is a crucial aspect of data management systems, requiring the identification of real-world entities from diverse expressions. Despite the human ability to recognize equivalences among entities, machine...
详细信息
Recently, Graph Neural Networks (GNNs) using aggregating neighborhood collaborative information have shown effectiveness in recommendation. However, GNNs-based models suffer from over-smoothing and data sparsity probl...
详细信息
As a result of its aggressive nature and late identification at advanced stages, lung cancer is one of the leading causes of cancer-related deaths. Lung cancer early diagnosis is a serious and difficult challenge that...
详细信息
1 Introduction The process of complex diseases is closely linked to the disruption of key biological pathways,it is crucial to identify the dysfunctional pathways and quantify the degree of dysregulation at the indivi...
详细信息
1 Introduction The process of complex diseases is closely linked to the disruption of key biological pathways,it is crucial to identify the dysfunctional pathways and quantify the degree of dysregulation at the individual sample level[1].
This paper deals with the problem of estimator-based sliding mode control against denial-of-service(DoS) attacks and discrete events via a time-delay approach. A networked system is considered an uncertain dynamical s...
详细信息
This paper deals with the problem of estimator-based sliding mode control against denial-of-service(DoS) attacks and discrete events via a time-delay approach. A networked system is considered an uncertain dynamical system with matched and mismatched perturbations and exogenous disturbance in the network environment. A network-resource-aware event-triggering mechanism is designed with aperiodically releasing system measurements. Furthermore, to describe the DoS attack duration and inter-event time, a time-delay modeling approach considers the DOS attack duration and inter-event time as a “time delay” of the measurements between the sensor and controller over the network is proposed. Consequently, an intervaltime-delay system with uncertainties is formulated. A state-observer-based sliding mode controller, by which the ideal sliding mode can be achieved, is proposed against the DoS attacks. The resulting sliding motion is proved to be robust and stable with an Lgain performance. Finally, the effectiveness and applicability of the present sliding mode control are validated in a simulated pendulum system.
With recent advancements in robotic surgery,notable strides have been made in visual question answering(VQA).Existing VQA systems typically generate textual answers to questions but fail to indicate the location of th...
详细信息
With recent advancements in robotic surgery,notable strides have been made in visual question answering(VQA).Existing VQA systems typically generate textual answers to questions but fail to indicate the location of the relevant content within the *** limitation restricts the interpretative capacity of the VQA models and their abil-ity to explore specific image *** address this issue,this study proposes a grounded VQA model for robotic surgery,capable of localizing a specific region during answer *** inspiration from prompt learning in language models,a dual-modality prompt model was developed to enhance precise multimodal information ***,two complementary prompters were introduced to effectively integrate visual and textual prompts into the encoding process of the model.A visual complementary prompter merges visual prompt knowl-edge with visual information features to guide accurate *** textual complementary prompter aligns vis-ual information with textual prompt knowledge and textual information,guiding textual information towards a more accurate inference of the ***,a multiple iterative fusion strategy was adopted for comprehensive answer reasoning,to ensure high-quality generation of textual and grounded *** experimental results vali-date the effectiveness of the model,demonstrating its superiority over existing methods on the EndoVis-18 and End-oVis-17 datasets.
Rapid urbanization has made road construction and maintenance imperative, but detecting road diseases has been time-consuming with limited accuracy. To overcome these challenges, we propose an efficient YOLOv7 road di...
详细信息
People-centric activity recognition is one of the most critical technologies in a wide range of real-world applications,including intelligent transportation systems, healthcare services, and brain-computer interfaces....
详细信息
People-centric activity recognition is one of the most critical technologies in a wide range of real-world applications,including intelligent transportation systems, healthcare services, and brain-computer interfaces. Large-scale data collection and annotation make the application of machine learning algorithms prohibitively expensive when adapting to new tasks. One way of circumventing this limitation is to train the model in a semi-supervised learning manner that utilizes a percentage of unlabeled data to reduce the labeling burden in prediction tasks. Despite their appeal, these models often assume that labeled and unlabeled data come from similar distributions, which leads to the domain shift problem caused by the presence of distribution gaps. To address these limitations, we propose herein a novel method for people-centric activity recognition,called domain generalization with semi-supervised learning(DGSSL), that effectively enhances the representation learning and domain alignment capabilities of a model. We first design a new autoregressive discriminator for adversarial training between unlabeled and labeled source domains, extracting domain-specific features to reduce the distribution gaps. Second, we introduce two reconstruction tasks to capture the task-specific features to avoid losing information related to representation learning while maintaining task-specific consistency. Finally, benefiting from the collaborative optimization of these two tasks, the model can accurately predict both the domain and category labels of the source domains for the classification task. We conduct extensive experiments on three real-world sensing datasets. The experimental results show that DGSSL surpasses the three state-of-the-art methods with better performance and generalization.
暂无评论