Offline safe reinforcement learning (RL) aims to train a constraint satisfaction policy from a fixed dataset. Current state-of-the-art approaches are based on supervised learning with a conditioned policy. However, th...
详细信息
Offline safe reinforcement learning (RL) aims to train a constraint satisfaction policy from a fixed dataset. Current state-of-the-art approaches are based on supervised learning with a conditioned policy. However, these approaches fall short in real-world applications that involve complex tasks with rich temporal and logical structures. In this paper, we propose temporal logic Specification-conditioned Decision Transformer (SDT), a novel framework that harnesses the expressive power of signal temporal logic (STL) to specify complex temporal rules that an agent should follow and the sequential modeling capability of Decision Transformer (DT). Empirical evaluations on the DSRL benchmarks demonstrate the better capacity of SDT in learning safe and high-reward policies compared with existing approaches. In addition, SDT shows good alignment with respect to different desired degrees of satisfaction of the STL specification that it is conditioned on. Copyright 2024 by the author(s)
Braess s paradox is a counterintuitive and undesirable phenomenon, in which for a given graph with prescribed source and sink vertices and cost functions for all edges, removal of edges decreases the cost of a Nash fl...
详细信息
We show that a classical spin liquid phase can emerge from an ordered magnetic state in the two-dimensional frustrated Shastry-Sutherland Ising lattice due to lateral confinement. Two distinct classical spin liquid st...
详细信息
We show that a classical spin liquid phase can emerge from an ordered magnetic state in the two-dimensional frustrated Shastry-Sutherland Ising lattice due to lateral confinement. Two distinct classical spin liquid states are stabilized: (i) long-range spin-correlated dimers, and (ii) exponentially decaying spin-correlated disordered states, depending on widths of W=3n, 3n+1 or W=3n+2,n being a positive integer. Stabilization of spin liquids in a square-triangular lattice moves beyond the conventional geometric paradigm of kagome, triangular, or tetrahedral arrangements of antiferromagnetic ions, where spin liquids have been discussed conventionally.
Damage to parcels reduces customer satisfactionwith delivery services and increases return-logistics *** can be prevented by detecting and addressing the damage before the parcels reach the ***,various studies have be...
详细信息
Damage to parcels reduces customer satisfactionwith delivery services and increases return-logistics *** can be prevented by detecting and addressing the damage before the parcels reach the ***,various studies have been conducted on deep learning techniques related to the detection of parcel *** study proposes a deep learning-based damage detectionmethod for various types of *** is intended to be part of a parcel information-recognition systemthat identifies the volume and shipping information of parcels,and determines whether they are damaged;this method is intended for use in the actual parcel-transportation *** this purpose,1)the study acquired image data in an environment simulating the actual parcel-transportation process,and 2)the training dataset was expanded based on StyleGAN3 with adaptive discriminator ***,3)a preliminary distinction was made between the appearance of parcels and their damage status to enhance the performance of the parcel damage detection model and analyze the causes of parcel ***,using the dataset constructed based on the proposed method,a damage type detection model was trained,and its mean average precision was *** model can improve customer satisfaction and reduce return costs for parcel delivery companies.
We study the role of material nonlocality (spatial dispersion) in dynamical Casimir effects in time-varying frequency-dispersive nanophotonic systems. We first show that local models may lead to nonphysical prediction...
详细信息
We study the role of material nonlocality (spatial dispersion) in dynamical Casimir effects in time-varying frequency-dispersive nanophotonic systems. We first show that local models may lead to nonphysical predictions, such as diverging emission rates of entangled polariton pairs. We then theoretically demonstrate that nonlocality regularizes this behavior by correcting the asymptotic response of the system for large wave vectors and leads to physical effects missed by local models, including a significant broadening of the emission rate distribution, which are relevant for future experimental observations. Our work sheds light on the importance of nonlocal effects in this new frontier of nanophotonics.
We consider word-of-mouth social learning involving m Kalman filter agents that operate sequentially. The first Kalman filter receives the raw observations, while each subsequent Kalman filter receives a noisy measure...
详细信息
Among various power system disturbances,cascading failures are considered the most serious and extreme threats to grid operations,potentially leading to significant stability issues or even widespread power *** power ...
详细信息
Among various power system disturbances,cascading failures are considered the most serious and extreme threats to grid operations,potentially leading to significant stability issues or even widespread power *** power systems’behaviors during cascading failures is of great importance to comprehend how failures originate and propagate,as well as to develop effective preventive and mitigative control *** intricate mechanism of cascading failures,characterized by multi-timescale dynamics,presents exceptional challenges for their *** paper provides a comprehensive review of simulation models for cascading failures,providing a systematic categorization and a comparison of these *** challenges and potential research directions for the future are also discussed.
This paper proposes a novel ZQ calibration method based on a reference voltage loop operation. ZQ calibration technology improves the integrity of signals transmitted on the channel by calibrating on-die termination (...
详细信息
The next generation of communication devices will require robust connectivity for millions of ground devices such as sensors or mobile devices in remote or disaster-stricken areas to be connected to the network. Non-t...
详细信息
The International Solid-State Circuits Conference (ISSCC) is the flagship conference of the IEEE Solid-State Circuits Society. The theme for ISSCC 2022 is 'Intelligent Silicon for a Sustainable World.' This th...
详细信息
暂无评论