A major challenge in robotics is to design robust policies which enable complex and agile behaviors in the real world. On one end of the spectrum, we have model-free reinforcement learning (MFRL), which is incredibly ...
详细信息
ISBN:
(数字)9798350384574
ISBN:
(纸本)9798350384581
A major challenge in robotics is to design robust policies which enable complex and agile behaviors in the real world. On one end of the spectrum, we have model-free reinforcement learning (MFRL), which is incredibly flexible and general but often results in brittle policies. In contrast, model predictive control (MPC) continually re-plans at each time step to remain robust to perturbations and model inaccuracies. However, despite its real-world successes, MPC often under-performs the optimal strategy. This is due to model quality, myopic behavior from short planning horizons, and approximations due to computational constraints. And even with a perfect model and enough compute, MPC can get stuck in bad local optima, depending heavily on the quality of the optimization algorithm. To this end, we propose Deep Model Predictive Optimization (DMPO), which learns the inner-loop of an MPC optimization algorithm directly via experience, specifically tailored to the needs of the control problem. We evaluate DMPO on a real quadrotor agile trajectory tracking task, on which it improves performance over a baseline MPC algorithm for a given computational budget. It can outperform the best MPC algorithm by up to 27% with fewer samples and an end-to-end policy trained with MFRL by 19%. Moreover, because DMPO requires fewer samples, it can also achieve these benefits with 4.3× less memory. When we subject the quadrotor to turbulent wind fields with an attached drag plate, DMPO can adapt zero-shot while still outperforming all baselines. Additional results can be found at https://***/mr2ywmnw.
Mining dense subgraphs on multilayer graphs offers the opportunity for more in-depth discoveries than classical dense subgraph mining on single-layer graphs. However, the existing approaches fail to ensure the densene...
详细信息
ISBN:
(数字)9798350317152
ISBN:
(纸本)9798350317169
Mining dense subgraphs on multilayer graphs offers the opportunity for more in-depth discoveries than classical dense subgraph mining on single-layer graphs. However, the existing approaches fail to ensure the denseness of a discovered subgraph on layers of users' interest and simultaneously gain partial supports on the denseness from other layers. In this paper, we introduce a novel dense subgraph model called FocusCore (FoCore) for multilayer graphs, which can pay more attention to layers focused on by users. The FoCore decomposition problem, i.e., identifying all nonempty FoCores in a multilayer graph, can be addressed by executing the peeling process with respect to all possible configurations of focus and background layers. By utilizing the nice properties of FoCores, we devise an interleaved peeling algorithm and a vertex-centric algorithm towards efficient FoCore decomposition. As an application, we propose a FoCore-decomposition-based algorithm to approximate the densest subgraph in a multilayer graph with a provable approximation guarantee. The extensive experiments on real-world datasets verify the effectiveness of the FoCore model and the efficiency of the proposed algorithms.
The background of vegetable disease images is complex, with the diseased areas exhibiting minimal contrast against the backdrop, making the precise identification of foliar diseases in vegetables a significant challen...
详细信息
ISBN:
(数字)9798331517090
ISBN:
(纸本)9798331517106
The background of vegetable disease images is complex, with the diseased areas exhibiting minimal contrast against the backdrop, making the precise identification of foliar diseases in vegetables a significant challenge in agricultural production. This study aims to address the insufficiencies in recognition accuracy inherent to traditional identification models by implementing enhancements based on the Swin Transformer model. The paper proposes the design of a local perception module that, by expanding the size of the perceptual field, enhances the capability for local feature extraction to optimize model recognition accuracy. Additionally, the Poly Loss function is employed to address the issue of data imbalance within the vegetable disease dataset. Experimental results indicate that the model proposed herein consistently outperforms other recognition models in comparative assessments, achieving an accuracy rate of 97.14%, which represents an approximate 3.6% improvement over the baseline model. The model holds promising prospects for application in identifying foliar diseases of vegetables.
An opinion illusion refers to a phenomenon in social networks where agents may witness distributions of opinions among their neighbours that do not accurately reflect the true distribution of opinions in the populatio...
详细信息
ISBN:
(纸本)9798400714269
An opinion illusion refers to a phenomenon in social networks where agents may witness distributions of opinions among their neighbours that do not accurately reflect the true distribution of opinions in the population as a whole. A specific case of this occurs when there are only two possible choices, such as whether to receive the COVID-19 vaccine or vote on EU membership, which is commonly referred to as a majority illusion. In this work, we study the topological properties of social networks that lead to opinion illusions and focus on minimizing the number of agents that need to be influenced to eliminate these illusions. To do so, we propose an initial, but systematic study of the algorithmic behaviour of this *** show that the problem is NP-hard even for underlying topologies that are rather restrictive, being planar and of bounded diameter. We then look for exact algorithms that scale well as the input grows (FPT). We argue the in-existence of such algorithms even when the number of vertices that must be influenced is bounded, or when the social network is arranged in a ''path-like'' fashion (has bounded pathwidth). On the positive side, we present an FPT algorithm for networks with a ''star-like'' structure (bounded vertex cover number). Finally, we construct an FPT algorithm for ''tree-like'' networks (bounded treewidth) when the number of vertices that must be influenced is bounded. This algorithm is then used to provide a PTAS for planar graphs.
In this paper, the near-field tracking problem harnessing the non-line-of-sight (NLoS) paths is investigated for reconfigurable intelligent surface (RIS)-aided systems. In order to reduce the complexity of the positio...
详细信息
ISBN:
(数字)9788831299107
ISBN:
(纸本)9798350366327
In this paper, the near-field tracking problem harnessing the non-line-of-sight (NLoS) paths is investigated for reconfigurable intelligent surface (RIS)-aided systems. In order to reduce the complexity of the position estimation, the near-field signal model, where the virtual line-of-sight (VLoS) paths and NLoS paths coexist, is constructed by employing the subarray far-field model. A probability transition model for the tracking problem is established, as well as the corresponding factor graph. Then, a low complexity near-field tracking algorithm based on message passing is developed to jointly estimate the UE's and scatterers' positions in each time slot. Numerical results show that the mean square error (MSE) of the proposed algorithm performs close to the misspecified Cramér-Rao Lower Bound (MCRLB), and validate the performance gain of harnessing the NLoS paths.
Grid-based methods in sparse signal reconstruction (SSR) are well-regarded for their efficacy in direction-of-arrival (DoA) estimation. This paper presents the EP (Expectation Propagation)-SURE (Stein's Unbiased R...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Grid-based methods in sparse signal reconstruction (SSR) are well-regarded for their efficacy in direction-of-arrival (DoA) estimation. This paper presents the EP (Expectation Propagation)-SURE (Stein's Unbiased Risk Estimate)-SBL (Sparse Bayesian Learning) algorithm, designed for single snapshot DoA estimation. The algorithm divides DoA estimation into two parts: grid-on estimation and off-grid error estimation, employing first-order and second-order Taylor expansions. In grid-on estimation, sparse Bayesian learning is employed for sparse modeling. To tackle hyperparameter estimation challenges within sparse Bayesian learning, the algorithm adopts SURE estimator instead of the commonly-used expectation-maximization (EM) approach. For off-grid error estimation, the algorithm utilizes the EP technique to handle high-dimensional, non-tractable integration in posterior mean calculations. The feasibility and effectiveness of the proposed algorithm are validated through extensive simulations.
Two various versions of the Modified Volume Integral Equation Method (MVIEM) for solving the diffraction problem on a permeable, inhomogeneous body located in free space are presented. The results obtained using these...
详细信息
ISBN:
(数字)9781665465687
ISBN:
(纸本)9781665465694
Two various versions of the Modified Volume Integral Equation Method (MVIEM) for solving the diffraction problem on a permeable, inhomogeneous body located in free space are presented. The results obtained using these algorithms are compared to those obtained using the Modified Method of Discrete Sources, by the examples of diffraction problems on a homogeneous sphere, homogeneous spheroid, and round cylinder. MVIEM has also applied to non axisymmetrical bodies specifically, the problem of diffraction on an inhomogeneous rectangular parallelepiped.
The rapidly changing nature of information world-wide often leads to incomplete and obsolete knowledge facts stored in knowledge bases (KBs). Therefore, reasoning over the dynamic KB sequences, which targets at knowle...
详细信息
ISBN:
(数字)9798350317152
ISBN:
(纸本)9798350317169
The rapidly changing nature of information world-wide often leads to incomplete and obsolete knowledge facts stored in knowledge bases (KBs). Therefore, reasoning over the dynamic KB sequences, which targets at knowledge inference from evolving facts, is of great importance to maintain KB completeness as well as freshness. Existing approaches for KB updating mainly either focus on knowledge representation learning methods, which suffer from lack of interpretability, or attempt to mine path-based logical rules, which are limited in capturing structural semantics of KB. In this work, we present KartGPS, a system for KB updating taking advantage of temporal graph pattern-based semantic (tGPS) rules. Specifically, the tGPS rules are learned from KB sequences and thus are capable of capturing both temporal and topological regularities of KBs along the evolving of time. Due to the huge amount and imperfect quality of tGPS rules, directly generating and applying all generated rules in a brute-force manner for knowledge updating over large-scale KB sequences would be highly time-consuming and error-prone. Therefore, we investigate the problem of Knowledge Update Rule Discovery (KURD), which aims at deriving an optimal subset of tGPS rules for performing knowledge updating, considering the rule quality and coverage. We show that the KURD problem is NP-hard and design two effective approximation algorithms with greedy and pruning strategies. We demonstrate the effectiveness and efficiency of proposed approaches by extensive experiments on real-world KB datasets.
We analyze quantum fluctuation effects at the onset of incommensurate 2kF charge- or spin-density wave order in two-dimensional metals, for a model where the ordering wave vector Q connects a single pair of hot spots ...
详细信息
Flexible and scalable decentralized learning solutions are fundamentally important in the application of multi-agent systems. While several recent approaches introduce (ensembles of) kernel machines in the distributed...
详细信息
ISBN:
(数字)9798350368741
ISBN:
(纸本)9798350368758
Flexible and scalable decentralized learning solutions are fundamentally important in the application of multi-agent systems. While several recent approaches introduce (ensembles of) kernel machines in the distributed setting, Bayesian solutions are much more limited. We introduce a fully decentralized, asymptotically exact solution to computing the random feature approximation of Gaussian processes. We further address the choice of hyperparameters by introducing an ensembling scheme for Bayesian multiple kernel learning based on online Bayesian model averaging. The resulting algorithm is tested against Bayesian and frequentist methods on simulated and real-world datasets.
暂无评论