Adaptive Critic methods, which approximate dynamic programming, have been used successfully for solving optimal control problems. The Adaptive Critic learning algorithm optimizes a secondary utility function that is t...
详细信息
Adaptive Critic methods, which approximate dynamic programming, have been used successfully for solving optimal control problems. The Adaptive Critic learning algorithm optimizes a secondary utility function that is the sum of the present and an future primary utility. The primary utility function measures the instantaneous cost incurred for the last action taken and the resulting state. The motivation for using a fuzzy primary utility function comes from the set of control problems for which there is only a qualitative definition of performance - for example, success or failure. Previous work in applying Adaptive Critic methods to this type of problem showed that a crisp definition of success resulted in solutions that met the control objective, but in an undesirable manner. An appropriate fuzzy utility function, on the other hand, is able to generate the optimal solution. Another motivation for incorporating fuzzy techniques into the utility function is to overcome measurement noise. Measurement noise has a significant adverse effect on the reliability and speed of Adaptive Critic learning;by incorporating fuzzy sets into the utility function, the effect of the noise can be mitigated.
In this paper we demonstrate the use of Linguistic Adaptive Critics (LACs) for the design of fuzzy control systems. This technique is related to Adaptive Critic based Approximate Dynamic programming as developed and a...
详细信息
In this paper we demonstrate the use of Linguistic Adaptive Critics (LACs) for the design of fuzzy control systems. This technique is related to Adaptive Critic based Approximate Dynamic programming as developed and applied in the neurocontrol context. We have previously shown how such techniques can be conveniently generalized to fuzzy controllers or neuro-fuzzy hybrids. We demonstrate LACs by designing a temperature controller for a simple water bath system.
The neuroethology of song learning, production, and maintenance in songbirds presents interesting similarities to human speech. We have developed a biophysical model of the manner in which song could be maintained in ...
详细信息
The neuroethology of song learning, production, and maintenance in songbirds presents interesting similarities to human speech. We have developed a biophysical model of the manner in which song could be maintained in adult songbirds. This model may inform us about the human counterpart to these processes. In songbirds, signals generated in nucleus High Vocal center (HVc) follow a direct route along a premotor pathway to the robust nucleus of the archistriatum (RA) as well as an indirect route to RA through the anterior forebrain pathway (AFP): the neurons of RA are innervated from both sources. HVc expresses very sparse bursts of spikes having interspike intervals of about 2ms. The expressions of these bursts arrive at the RA with a time difference ΔT≈50±10ms between the two pathways. The observed combination of AMPA and NMDA receptors at RA projection neurons suggests that long-term potentiation and long-term depression can both be induced by spike timing plasticity through the pairing of the HVc and AFP signals. We present a dynamical model that stabilizes this synaptic plasticity through a feedback from the RA to the AFP using known connections. The stabilization occurs dynamically and is absent when the RA→AFP connection is removed. This requires a dynamical selection of ΔT. The model does this, and ΔT lies within the observed range. Our model represents an illustration of a functional consequence of activity-dependent plasticity directly connected with neuroethological observations. Within the model the parameters of the AFP, and thus the magnitude of ΔT, can also be tuned to an unstable regime. This means that destabilization might be induced by neuromodulation of the AFP.
Adaptive critic methods, which approximate dynamic programming, have been used successfully for solving optimal control problems. The adaptive critic learning algorithm optimizes a secondary utility function that is t...
详细信息
Adaptive critic methods, which approximate dynamic programming, have been used successfully for solving optimal control problems. The adaptive critic learning algorithm optimizes a secondary utility function that is the sum of the present and all future primary utility. The primary utility function measures the instantaneous cost incurred for the last action taken and the resulting state. The motivation for using a fuzzy primary utility function comes from the set of control problems for which there is only a qualitative definition of performance - for example, success or failure. Previous work in applying adaptive critic methods to this type of problem showed that a crisp definition of success resulted in solutions that met the control objective, but in an undesirable manner. An appropriate fuzzy utility function, on the other hand, is able to generate the optimal solution. Another motivation for incorporating fuzzy techniques into the utility function is to overcome measurement noise. Measurement noise has a significant adverse effect on the reliability and speed of adaptive critic learning; by incorporating fuzzy sets into the utility function, the effect of the noise can be mitigated.
Finite size effects on the cooperative thermal denaturation of proteins are considered. A dimensionless measure of cooperativity, Ωc, scales as Nζ, where N is the number of amino acids. Surprisingly, we find that ζ...
详细信息
Finite size effects on the cooperative thermal denaturation of proteins are considered. A dimensionless measure of cooperativity, Ωc, scales as Nζ, where N is the number of amino acids. Surprisingly, we find that ζ is universal with ζ=1+γ, where the exponent γ characterizes the divergence of the susceptibility for a self-avoiding walk. Our lattice model simulations and experimental data are consistent with the theory. Our finding rationalizes the marginal stability of proteins and substantiates the earlier predictions that the efficient folding of two-state proteins requires TF≈Tθ, where Tθ and TF are the collapse and folding transition temperatures, respectively.
A simplified set of equations is derived systematically below for the interaction of large scale flow fields and precipitation in the tropical atmosphere. These equations, the Tropical Climate Model, have the form of ...
详细信息
In this work, the radial time-independent Schrödinger equation of a screened Coulomb potential system at the zero energy limit is first converted to a weighted eigenvalue problem of an ordinary differential opera...
详细信息
In this work, a novel approximation scheme based on a recently developed representation, called the High Dimensional Model Representation (HDMR), is proposed to approximate evolution operators. The approximation is no...
详细信息
This work deals with the optimal control of one dimensional quantum harmonic oscillator under an external field characterized by a linear dipole function. The penalty term is taken as kinetic energy. The objective ope...
详细信息
This work aims to apply High Dimensional Model Representation (HDMR) to the sensitivity coefficient determination of the solutions of a multivariate extrema problem. The derivations are made for general functional str...
详细信息
暂无评论