In this article the problem of characterizing sets, described by vector nonlinear inequalities of the form v(x) w, as robustly positively invariant and targets of uniformly ultimate bounded nonlinear systems is invest...
详细信息
Animals use stochastic search strategies to find a potential nest or to localize the food source. Animals do not rely much on the environmental cues or their sensory inputs. Since their cognitive capabilities are rath...
详细信息
We analyze losses resulting from uncertain transition probabilities in Markov decision processes with bounded nonnegative rewards. We assume that policies are precomputed using exact dynamic programming with the estim...
详细信息
ISBN:
(纸本)9781467320658
We analyze losses resulting from uncertain transition probabilities in Markov decision processes with bounded nonnegative rewards. We assume that policies are precomputed using exact dynamic programming with the estimated transition probabilities, but the system evolves according to different, true transition probabilities. Given a bound on the total variation error of estimated transition probability distributions, we derive upper bounds on the loss of expected total reward. The approach analyzes the growth of errors incurred by stepping backwards in time while precomputing value functions, which requires bounding a multilinear program. Loss bounds are given for the finite horizon undiscounted, finite horizon discounted, and infinite horizon discounted cases, and a tight example is shown.
作者:
Marko BunicStjepan BogdanUniversity of Zagreb
Faculty of Electrical Engineering and Computing Department of Control and Computer Engineering LARICS–Laboratory for Robotics and Intelligent Control Systems Zagreb Croatia
This paper presents the extension of the previously proposed method for multi-agent formation control based on potential function. The method derived for 2D space is extended to 3D. It has been shown that the control ...
This paper presents the extension of the previously proposed method for multi-agent formation control based on potential function. The method derived for 2D space is extended to 3D. It has been shown that the control algorithm keeps all the properties of the original scheme in case of multi-agent formation moving in 3D. An adaptation mechanism that assures avoidance of unwanted stable equilibria, used in 2D, is implemented in the same form for 3D formations. The obtained simulation results demonstrate stable behavior of the system for various sets of parameters - the desired 3D formation is reached in finite time and maintained during trajectory execution.
The paper deals with the problem of unveiling the link structure of a network of linear dynamical systems. A technique is provided guaranteeing an exact detection of the links of a network of dynamical systems with no...
详细信息
ISBN:
(纸本)9781467320658
The paper deals with the problem of unveiling the link structure of a network of linear dynamical systems. A technique is provided guaranteeing an exact detection of the links of a network of dynamical systems with no undirected cycles (Linear Dynamic Polytrees). In particular, the presence of unobserved (latent) nodes is taken into account. Knowledge on the specific number of hidden processes is not required. It is proven that the topology can be consistently reconstructed, as long the degree of each latent node is at least three with outdegree of at least two. The result extends previous work that was limited to a more restricted class of dynamical systems (Rooted Trees).
This paper examines the value of storage in securing reliability of a system with uncertain supply and demand, and supply friction. The storage is frictionless as a supply source, but it cannot be filled up instantane...
详细信息
ISBN:
(纸本)9781457710957
This paper examines the value of storage in securing reliability of a system with uncertain supply and demand, and supply friction. The storage is frictionless as a supply source, but it cannot be filled up instantaneously. The focus is on application to an energy network in which the nominal supply and demand are assumed to match perfectly, while deviations from the nominal values are modeled as random shocks with stochastic arrivals. Due to friction, the random shocks cannot be tracked by the main supply sources. Storage, when available, can be used to compensate, fully or partially, for the surge in demand or sudden drop in supply. The problem of optimal utilization of storage with the objective of maximizing system reliability is formulated as minimization of the expected discounted cost of blackouts over an infinite horizon. It is shown that when the stage cost is linear in the size of the blackout, the optimal policy is myopic in the sense that all shocks will be compensated by storage up to the available level of storage. However, when the stage cost is strictly convex, it may be optimal to curtail some of the demand and allow a small blackout in the interest of maintaining a higher level of reserve, which may help avoid a large blackout in the future. The value of storage capacity in improving reliability, as well as the effects of the associated optimal policies under different stage costs on the probability distribution of blackouts are examined.
The implementation of automated regulatory control has been around since the middle of the last century through analog means. It has allowed engineers to operate the plant more consistently by focusing on overall oper...
详细信息
Animals use stochastic search strategies to find a potential nest or to localize the food source. Animals do not rely much on the environmental cues or their sensory inputs. Since their cognitive capabilities are rath...
Animals use stochastic search strategies to find a potential nest or to localize the food source. Animals do not rely much on the environmental cues or their sensory inputs. Since their cognitive capabilities are rather limited, search strategies are usually simplistic and stochastic. And yet such simple search strategies result in high efficiency in finding potential targets in limited time intervals. Similar problems with perception of the environment and limited computational power and sensory inputs are observed on the mobile robots used in search scenarios. In spite of their limitations, such robotic systems usually perform deterministic search strategies which heavily depend on environmental cues and sensory input. The research question we are addressing in this paper is which stochastic search strategy has the best performance in terms of area coverage and can stochastic search strategies outperform deterministic search strategies.
Hybrid zero dynamics extends the Byrnes-Isidori notion of zero dynamics to a class of hybrid models called systems with impulse effects. Specifically, given a smooth submanifold that is contained in the zero set of an...
详细信息
ISBN:
(纸本)9781467320658
Hybrid zero dynamics extends the Byrnes-Isidori notion of zero dynamics to a class of hybrid models called systems with impulse effects. Specifically, given a smooth submanifold that is contained in the zero set of an output function and is invariant under both the continuous flow of the system with impulse effects as well as its reset map, the restriction dynamics is called the hybrid zero dynamics. Prior results on the stabilization of periodic orbits of the hybrid zero dynamics have relied on input-output linearization of the transverse variables. The principal result of this paper shows how control Lyapunov functions can be used to exponentially stabilize periodic orbits of the hybrid zero dynamics, thereby significantly extending the class of stabilizing controllers. An illustration of this result on a model of a bipedal walking robot is provided.
In this paper, a hierarchical neural network with cascading architecture is proposed and its application to classification is analyzed. This cascading architecture consists of multiple levels of neural network structu...
详细信息
暂无评论