We introduce a novel task, called Generalized Relation Discovery (GRD), for open-world relation extraction. GRD aims to identify unlabeled instances in existing pre-defined relations or discover novel relations by ass...
详细信息
The remaining useful life (RUL) of bearings is critical to the proper operation of mechanical equipment, maintenance of equipment costs and availability. The existing domain adaptation methods have had great success i...
详细信息
The remaining useful life (RUL) of bearings is critical to the proper operation of mechanical equipment, maintenance of equipment costs and availability. The existing domain adaptation methods have had great success in RUL prediction. However, when the target bearing data are unavailable or unknown to be involved in model training, the domain adaptation approaches also incapable. To solve the problem, we propose a parallel reversible instance normalization method based on adaptive threshold stage division for remaining useful life prediction of unknown bearings. First, we design an adaptive threshold method to find degradation points to divide the healthy and degradation stages. Then according to time series, we merge the original vibration data and its instance normalized data to increase the data distribution diversity. Finally, we combine instance normalization and parallel reversible normalization of the source bearing data into unified RUL learning framework to solve the uncertainty of counterfactual data and improve RUL prediction performance. The results show that the method is superior to the state-of-the-art methods for RUL prediction of unknown bearings.
Service robots play an increasingly important role in people's daily life. The density of pedestrians is large and the movement is irregular in pedestrian-robot mixed traffic flows. Robots are prone to collision w...
详细信息
Service robots play an increasingly important role in people's daily life. The density of pedestrians is large and the movement is irregular in pedestrian-robot mixed traffic flows. Robots are prone to collision with pedestrians, and the tasks to be offloaded are closely related to pedestrians. How to analyze the tasks of robots and select the appropriate roadside unit is an important issue. In this paper, the social force model is used to predict the positions of pedestrians and robots, taking into account the influence of various forces to avoid collisions. A task offloading resource optimization algorithm with position prediction is proposed. According to the predicted information, the size and position distribution of all tasks in the scenario are obtained, and then the neural network trained beforehand based on deep Q-Iearning is used to generate a task offloading strategy. The simulation results show that the running time of the proposed algorithm is very short, and the resource allocation required for task offloading is completed in advance based on the predicted information before robots arriving the corresponding positions. Besides, the algorithm significantly reduces the task offloading delay.
Domain adaptation (DA) -based RUL prediction methods have achieved great success for the adaptation ability of the distribution discrepancy between the source and target domains. However, DA methods are powerless when...
详细信息
Domain adaptation (DA) -based RUL prediction methods have achieved great success for the adaptation ability of the distribution discrepancy between the source and target domains. However, DA methods are powerless when the target domain data are not available for training. To solve this problem, we propose an inter-domain intra-domain normalized generalization (IDIDNG) network, which consists of three modules, respectively, the pre-processing module, the feature transformation module, and the RUL prediction module. First, we design the pre-processing module to process the bearing vibration data with peak-to-peak and Z-score. Finally, it is connected into a four-dimensional array. In the feature transformation module, via intra-domain and inter-domain normalization as well as mean-variance cross-swap, we transform the data distribution expressions of invariant features of bearings from the perspective of different bearings and different degradation stage of one bearing, such that the model enables to learn the intra-domain and inter-domain discrepancy. Further we design four adaptable weighting parameters into the intra-domain normalization to learn the appropriate normalized mean and variance via the model training. Finally, we design the GRU-based RUL prediction module to predict the unknown bearings. We conducted experiments under the PHM2012 dataset, experimental results show that our method achieves satisfactory prediction accuracy in the unknown bearings.
Offline Reinforcement Learning (RL) optimizes policy using pre-collected data instead of direct environment interaction, offering a safe and cost-effective solution for sequential decision-making in the real world. Ho...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Offline Reinforcement Learning (RL) optimizes policy using pre-collected data instead of direct environment interaction, offering a safe and cost-effective solution for sequential decision-making in the real world. However, it faces challenges such as distribution shift issues and vulnerability under perturbations. Researchers have developed various conservative methods to improve the robustness of offline RL. Nevertheless, model-based methods can result in transition distribution shift issues, while model-free value-based uncertainty penalty methods may not be sufficiently robust. To address these problems, we propose a new method called Robust Offline RL via Conservative Smoothing and Dynamics Controlling (RCSD). To achieve reliable value estimation of out-of-distribution (OOD) actions, RCSD uses both model-free uncertainty penalty and model-based simulation methods. It introduces a new one-step simulation method with conservative dynamics controlling to avoid value overestimation caused by transition distribution shifts. Moreover, RCSD considers both current and next states when generating OOD states to ensure cautious value estimation and efficient data utilization. RCSD uses conservative Q-smoothing and policy smoothing to strengthen the policy against sudden changes under perturbations. Experiments on D4RL benchmark demonstrate that RCSD can achieve state-of-the-art performance compared to baselines in either benchmark or adversarial attack tests.
Logical reasoning consistently plays a fundamental and significant role in the domains of knowledgeengineering and artificial intelligence. Recently, Large Language Models (LLMs) have emerged as a noteworthy innovati...
详细信息
Kolmogorov-Arnold Networks (KAN) is an emerging neural network architecture in machine learning. It has greatly interested the research community about whether KAN can be a promising alternative to the commonly used M...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Kolmogorov-Arnold Networks (KAN) is an emerging neural network architecture in machine learning. It has greatly interested the research community about whether KAN can be a promising alternative to the commonly used Multi-Layer Perceptions (MLP). Experiments in various fields demonstrated that KAN-based machine learning can achieve comparable if not better performance than MLP-based methods, but with much smaller parameter scales and are more explainable. In this paper, we explore the incorporation of KAN into the actor and critic networks for offline reinforcement learning (RL). We evaluated the performance, parameter scales, and training efficiency of various KAN and MLP-based conservative Q-learning (CQL) on the classical D4RL benchmark for offline RL. Our study demonstrates that KAN can achieve performance close to the commonly used MLP with significantly fewer parameters. This allows us to choose the base networks according to the offline RL task requirements.
Temporal information is pervasive and crucial in medical records and other clinical text,as it formulates the development process of medical conditions and is vital for clinical decision ***,providing a holistic knowl...
详细信息
Temporal information is pervasive and crucial in medical records and other clinical text,as it formulates the development process of medical conditions and is vital for clinical decision ***,providing a holistic knowledge representation and reasoning framework for various time expressions in the clinical text is *** order to capture complex temporal semantics in clinical text,we propose a novel Clinical Time Ontology(CTO)as an extension from OWL *** specifically,we identified eight timerelated problems in clinical text and created 11 core temporal classes to conceptualize the fuzzy time,cyclic time,irregular time,negations and other complex aspects of clinical ***,we extended Allen’s and TEO’s temporal relations and defined the relation concept description between complex and simple ***,we provided a formulaic and graphical presentation of complex time and complex time *** carried out empirical study on the expressiveness and usability of CTO using real-world healthcare ***,experiment results demonstrate that CTO could faithfully represent and reason over 93%of the temporal expressions,and it can cover a wider range of time-related classes in clinical domain.
Since image editing methods in real world scenarios cannot be exhausted, generalization is a core challenge for image manipulation detection, which could be severely weakened by semantically related features. In this ...
Since image editing methods in real world scenarios cannot be exhausted, generalization is a core challenge for image manipulation detection, which could be severely weakened by semantically related features. In this paper we propose SAFL-Net, which constrains a feature extractor to learn semantic-agnostic features by designing specific modules with corresponding auxiliary tasks. Applying constraints directly to the features extracted by the encoder helps it learn semantic-agnostic manipulation trace features, which prevents the biases related to semantic information within the limited training data and improves generalization capabilities. The consistency of auxiliary boundary prediction task and original region prediction task is guaranteed by a feature transformation structure. Experiments on various public datasets and comparisons in multiple dimensions demonstrate that SAFL-Net is effective for image manipulation detection.
Diffusion models are powerful generative models, and this capability can also be applied to discrimination. The inner activations of a pre-trained diffusion model can serve as features for discriminative tasks, namely...
暂无评论