With the development of information security, localization of image manipulations havs become a hot topic. In this paper, a hybrid loss network is proposed for the manipulated image forensics. First, the patch predict...
详细信息
We investigate the emergence of unconventional corner mode in a two-dimensional topolectrical circuits induced by asymmetric couplings. The non-Hermitian skin effect of two kinked one-dimensional lattices with multipl...
详细信息
Cooperative hunting is a typical and significant scene to study multi-agent behaviors, where conventional control strategies are difficult to cope with, due to its high dimensionality of state space and locality of co...
详细信息
Cooperative hunting is a typical and significant scene to study multi-agent behaviors, where conventional control strategies are difficult to cope with, due to its high dimensionality of state space and locality of communication. Reinforcement learning provides a framework and a set of tools for this issue by trial-and-error interactions with the environment. Though promising, it often requires a large number of empirical sample data to learn effective hunting strategies, leading to low sample efficiency, understood as the training episodes required for the agent to learn effective behavior strategies. To improve the sampling efficiency, we propose a data enhancement strategy integrated in the execution (CTDE) training framework to train the multi-agent system. The data enhancement strategy is based on a state transfer dynamics model to generate additional predicted data, which we called dynamic prediction model, combined with the empirical data by interacting with the environment, for higher sample efficiency. The simulation results on the Webots platform show that our method outperforms some state-of-the-art methods, such as MAPPO, with high data sample efficiency.
Mobile Robot can realize non-contact operation of medical and rehabilitation services in the outbreak of CoVID-19, and Mobile Robot Platform (MRP) is the key component in Mobile Robots. By using Mecanum wheels, the om...
详细信息
In the Doppler biological radar-based applications of noncontact measurement of vital signs, effectively extracting heartbeat information from weak thoracic mechanical motion is an important problem to be solved. This...
详细信息
This paper concerns the fault prediction modeling for Condition-based Maintenance (CBM) of on-board railway train control systems. Based on the field data from the CTCS2-200H on-board equipment, the imbalance problem ...
详细信息
With the development of online education, the problem of emotional deficiency has gradually emerged. In recent years, the research about emotional interaction in online education, and the application of Affective Comp...
详细信息
High-speed train (HST) has garnered significant attention from both academia and industry due to the rapid development of railways worldwide. Millimeter wave (mmWave) communication, known for its large bandwidth is an...
详细信息
In this paper, we consider a hierarchical distributed multi-task learning (MTL) system where distributed users wish to jointly learn different models orchestrated by a central server with the help of a layer of multip...
详细信息
暂无评论