In the context of multi-agent systems, we are proposing a hierarchical robot control architecture that comprises artificial intelligence (AI) techniques and traditional control methodologies, based on the realization ...
详细信息
In the context of multi-agent systems, we are proposing a hierarchical robot control architecture that comprises artificial intelligence (AI) techniques and traditional control methodologies, based on the realization of a learning team of agents in a continuous problem setting. In a multi-agent system, action selection is important for cooperation and coordination among the agents. By employing reinforcement learning (RL) methods in a fuzzified state-space, we accomplish to design a control architecture and a corresponding methodology, engaged in a continuous space, which enables the agents to learn, over a period of time, to perform sequences of continuous actions in a cooperative manner, in order to reach their goal without any prior generated task model. By organizing the agents in a nested architecture, as proposed in this work, a type of problem-specific recursive knowledge acquisition is attempted. Furthermore, the agents try to exploit the knowledge gathered in order to be in position to execute tasks that indicate certain degree of similarity. The agents correspond in fact to independent degrees of freedom of the system, and achieve to gain experience over the task that they collaboratively perform, by exploring and exploiting their state-to-action mapping space. A numerical experiment is presented in this paper, performed on a simulated planar 4 degrees of freedom (DOF) manipulator, in order to evaluate both the proposed hierarchical multi-agent architecture as well as the proposed methodological framework. It is anticipated that such an approach can be highly scalable for the control of robotic systems that are kinematically more complex, comprising multiple DOFs and potentially redundancies in open or closed kinematic chains, particularly dexterous manipulators.
In the this paper, a CMAC-Q-learning based Dyna agent is presented to relieve the problem of learning speed in reinforcement learning, in order to achieve the goals of shortening training process and increasing the le...
In the this paper, a CMAC-Q-learning based Dyna agent is presented to relieve the problem of learning speed in reinforcement learning, in order to achieve the goals of shortening training process and increasing the learning, speed. We combine CMAC, Q-learning, and prioritized sweeping techniques to construct the Dyna agent in which a Q-learning is trained for policy learning; meanwhile, model approximators, called CMAC-model and CMAC-R-model, are in charge of approximating the environment model. The approximated model provides the Q-learning with virtual interaction experience to further update the policy within the time gap when there is no interplay between the agent and the real environment. The Dyna agent switches seamlessly between the real environment and the virtual environment model for the objective of policy learning. A simulation for controlling a differential-drive mobile robot has been conducted to demonstrate that the proposed method can preliminarily achieve the design goal.
This paper proposes the application of an adaptive impedance control scheme to alleviate some of the problems associated with the presence of time delays in a haptic teleoperation system. Continuous on-line estimation...
详细信息
This paper proposes the application of an adaptive impedance control scheme to alleviate some of the problems associated with the presence of time delays in a haptic teleoperation system. Continuous on-line estimation of the remote environment's impedance is performed, and is then used as a local model for haptic display control. Lyapunov stability of the proposed impedance adaptation law is demonstrated. A series of experiments is performed to evaluate the performance of this teleoperation control scheme. Two performance measures are defined to assess transparency and stability of the teleoperator. Simulation results show the superior performance of the proposed adaptive scheme, with respect to direct teleoperation, particularly in terms of increasing the stability margin and of significantly ameliorating transparency in the presence of large time delays. Experimental results, using a phantom omni as the haptic master device, support this conclusion.
We propose robust adaptive control designs for a twin-rotor aircraft with specific focus on attaining good disturbance attenuation properties to facilitate operation in a shipboard environment wherein the aircraft is ...
详细信息
We propose robust adaptive control designs for a twin-rotor aircraft with specific focus on attaining good disturbance attenuation properties to facilitate operation in a shipboard environment wherein the aircraft is subject to severe aerodynamic disturbances including ship airwake, deck vortices, and rotor downwash especially during ship roll and pitch motions. Furthermore, the controllers can operate on their own or can gracefully co-exist with a baseline controller in a control augmentation fashion thus minimizing flight software change impact. The performance of the proposed controllers was validated through extensive simulation studies.
This paper describes the motivation and learning subsystems of Arisco which is a mechatronic head with interactive capacity which includes high expressivity through gesturing, voice recognition, text to speech generat...
详细信息
ISBN:
(纸本)9781424420575
This paper describes the motivation and learning subsystems of Arisco which is a mechatronic head with interactive capacity which includes high expressivity through gesturing, voice recognition, text to speech generation, visual tracking, and internet information retrieval. The general architecture is first described in the paper. Then, the learning capacity of Arisco is addressed. It learns and performs associations between different stimulus responses through several dynamic neural networks, guided by motivational drives. A number of experiments are discussed, covering stimulus competition, habituation, classical and operant conditioning.
Mapping dynamic environments is an open issue in the field of robotics. In this paper, we extend the well known Occupancy Grid structure to address the problem of generating valid maps for dynamic indoor environments....
详细信息
The problem of Input Output Decoupling is studied for the case of general neutral multi-delay systems, via proportional realizable output feedback. Using a pure algebraic approach, the necessary and sufficient conditi...
详细信息
The necessary and sufficient conditions for the solvability of the exact model matching problem for general neutral single input - single output multi-delay systems, via a realizable dynamic output feedback and a real...
详细信息
暂无评论