The paper considers the problem of global stabilization of an underactuated autonomous underwater vehicle (AUV) to a point, with a desired orientation. Controllability and stabilizability properties of the vehicle mod...
详细信息
The paper considers the problem of global stabilization of an underactuated autonomous underwater vehicle (AUV) to a point, with a desired orientation. Controllability and stabilizability properties of the vehicle model are discussed and a logic-based hybrid controller is proposed that yields global convergence of the AUV to an arbitrarily small neighborhood of the target point. Convergence and stability of the closed loop system are analyzed. To illustrate the control law developed, simulation results are presented using the model of the Sirene AUV.
The paper discusses new developments of the data fusion paradigm due to Cortesao and Koeppe (1999, 2000). A bank of Kalman filters is analyzed in the fusion process. Experiments for a robotic compliant motion task (pe...
详细信息
The paper discusses new developments of the data fusion paradigm due to Cortesao and Koeppe (1999, 2000). A bank of Kalman filters is analyzed in the fusion process. Experiments for a robotic compliant motion task (peg-in-hole) emerged from human skills are reported. Stereo vision and pose sense are fused to execute the task. Feedforward artificial neural networks (ANNs) are trained to transfer human skills to robotic manipulators.
Cooperation in learning improves the speed of convergence and the quality of learning. Special care is needed when heterogeneous agents cooperate in learning. It is discussed that, cooperation in learning may cause th...
详细信息
Cooperation in learning improves the speed of convergence and the quality of learning. Special care is needed when heterogeneous agents cooperate in learning. It is discussed that, cooperation in learning may cause the learning process to diverge if heterogeneity is not handled properly. In this paper, it is assumed that two heterogeneous Q-learning agents cooperate to learn. The heterogeneity is assumed in their action order (and not in their action set). A Q-learning-based method is introduced for the agents to learn the mapping among their actions. It is shown that, the agents are able to learn this mapping while cooperating in learning. Some simulation results are reported to show the effectiveness of the proposed method.
Q-learning is widely used in many multi agent systems. In most cases, a separate critic is considered for qualifying each individual agent behavior or it is assumed that the critic is completely aware of effects of al...
详细信息
Q-learning is widely used in many multi agent systems. In most cases, a separate critic is considered for qualifying each individual agent behavior or it is assumed that the critic is completely aware of effects of all agents' actions on the team qualification. But, in many cases, the role of each team member in the group performance is not known. In order to distribute a common credit among the agents, a suitable criterion must be provided to estimate the role of each agent in the team performance and to judge if an agent has done a wrong action. In this paper two such criteria, named certainty and expertness, for a team of agents with parallel tasks are introduced. In addition, two methods for reinforcing the agents based on the proposed measures are provided. Some simulation results are also reported to show the effectiveness of the proposed measures and methods.
Using other agents' experiences and knowledge, a learning agent may learn faster, make fewer mistakes, and create some rides for unseen situations. These benefits will be gained if the learning agents know the are...
详细信息
Using other agents' experiences and knowledge, a learning agent may learn faster, make fewer mistakes, and create some rides for unseen situations. These benefits will be gained if the learning agents know the area of expertise and the expertness values of each other. In this paper, some Q-learning agents with different skills and expertness levels cooperate in learning. The agents use some criteria to judge others information and knowledge. Four expertness criterion, certainty and entropy measures are used to assign degrees of importance to others' Q-Tables. Effects of measuring these values based on their whole Q-Table, a portion of Q-Tables that reflects their proficiencies, and the states in Q-Tables on the learning quality are studied. Simple strategy sharing and two different weighted strategy-sharing methods are used to combine the acquired knowledge from different agents.
In multiagent reinforcement learning, inter-agent credit assignment is a fundamental problem, since a single scalar reinforcement signal is the only reliable feedback that teams of learning agents receive. This proble...
详细信息
In multiagent reinforcement learning, inter-agent credit assignment is a fundamental problem, since a single scalar reinforcement signal is the only reliable feedback that teams of learning agents receive. This problem is more critical in groups of independent learners with a joint task. In this research, it is assumed that a critic agent receives the environment feedback and assigns a proper credit to each agent using some measures. Three of such measures for a team of cooperative agents with a parallel and AND-type task are introduced. These measures somehow compare the agents' knowledge. One of these criteria, called normal expertness, is a non-relative measure while two other ones (certainty and relative normal expertness) are relative measure. It is experimentally shown that relative measures work better as they contain more information for the critic agent.
A new approach for an adaptive neuro-fuzzy inference system for modeling and control is proposed. This approach uses a general regression neural network with a different learning capability from the classical clusteri...
详细信息
A new approach for an adaptive neuro-fuzzy inference system for modeling and control is proposed. This approach uses a general regression neural network with a different learning capability from the classical clustering algorithm normally used by this specific network. The antecedent parameters of the regression network are obtained through an iterative grid partition process instead of the usual gradient descent algorithm or the classical grid partition method in the literature of neural network modeling. The membership functions used in the antecedent part are asymmetric and with varying shapes (triangles, gaussian, trapezoidal, etc) which is less common in the fuzzy modeling literature. The consequent parameters are obtained using the least squares estimates algorithm. In the simulation, the adaptive neuro-fuzzy inference system architecture is used to model a nonlinear function and to control the motion of a helicopter in the hover flight mode with promising results.
An essential factor in understanding the motor learning capability of humans, is the coordinate transformation learning of the visual feedback controller. Although a number of learning models for the visual feedback c...
详细信息
An essential factor in understanding the motor learning capability of humans, is the coordinate transformation learning of the visual feedback controller. Although a number of learning models for the visual feedback controller have been proposed, none has been able to establish a definitive approach. In our previous work, we have suggested a learning model that uses disturbance noise and the feedback error signal to learn the human visual feedback controller's coordinate transformation. However, the model does not fully consider the time delay in the visual feedback control loop. This paper presents a modified learning model taking into account the time delay and the convergence properties of the model. Numerical simulations are presented to illustrate the effectiveness of the proposed approach.
Algorithms for tracking generic 2D object boundaries in a video sequence should not make strong assumptions about the shapes to be tracked. When only a weak prior is at hand, the tracker performance becomes heavily de...
详细信息
暂无评论