Cooperation in learning improves the speed of convergence and the quality of learning. Special care is needed when heterogeneous agents cooperate in learning. It is discussed that, cooperation in learning may cause th...
详细信息
Cooperation in learning improves the speed of convergence and the quality of learning. Special care is needed when heterogeneous agents cooperate in learning. It is discussed that, cooperation in learning may cause the learning process to diverge if heterogeneity is not handled properly. In this paper, it is assumed that two heterogeneous Q-learning agents cooperate to learn. The heterogeneity is assumed in their action order (and not in their action set). A Q-learning-based method is introduced for the agents to learn the mapping among their actions. It is shown that, the agents are able to learn this mapping while cooperate in learning. Some simulation results are reported to show the effectiveness of the proposed method.
In any measuring system the categorization of the error generation factors leads to simplification of complex error problems and to higher suppression of the error. In this paper we categorize, quantify and analyze th...
详细信息
Q-learning is widely used in many multi agent systems. In most cases, a separate critic is considered for qualifying each individual agent behavior or it is assumed that the critic is completely aware of effects of al...
详细信息
Q-learning is widely used in many multi agent systems. In most cases, a separate critic is considered for qualifying each individual agent behavior or it is assumed that the critic is completely aware of effects of all agents' actions on the team qualification. But, in many cases, the role of each team member in the group performance is not known. In order to distribute a common credit among the agents, a suitable criterion must be provided to estimate the role of each agent in the team performance and to judge if an agent has done a wrong action. In this paper two such criteria, named Certainty and Expertness, for a team of agents with parallel tasks are introduced. In addition, two methods for reinforcing the agents based on the proposed measures are provided. Some simulation results are also reported to show the effectiveness of the proposed measures and methods.
Biological snakes' diverse locomotion modes and physiology make them supremely adapted for environment. The special structure of snakes and their unique movement offer them peculiar ability of climbing and moving ...
详细信息
Biological snakes' diverse locomotion modes and physiology make them supremely adapted for environment. The special structure of snakes and their unique movement offer them peculiar ability of climbing and moving even in some ill-conditioned environments such as on the marshland or in narrow tubes. The aim of this study is to elucidate a 3-dimensional motion of the snake, sinus-lifting motion, which is a phenomenon observed during rapid motion of the snake. The sinus-lifting creeping motion is an adaptive function peculiar to regular creeping motion, designed to prevent slippage in the normal direction of the body and to allow application of the greatest possible motive force. In this paper, we first discuss the controllability of the snake robot for analysis of sinus-lifting creeping mode. We utilized a 3-dimensional snake robot model to show the controllability of the robot through the computer simulation.
Entire region filling is a special type of robot path planning strategy that requires the mobile robot to cover every part of the whole workspace, which has many applications such as cleaning robots, vacuum cleaners, ...
详细信息
Entire region filling is a special type of robot path planning strategy that requires the mobile robot to cover every part of the whole workspace, which has many applications such as cleaning robots, vacuum cleaners, painter robots, land mine detectors, lawn mowers, and window cleaners. In this paper, a novel biologically inspired neural network approach is proposed for entire region filling with obstacle avoidance of a mobile cleaning robot in a nonstationary environment. The dynamics of each neuron in the topologically organized neural network is characterized by a shunting equation or an additive equation derived from Hodgkin and Huxley's (1952) membrane equation. There are only local lateral connections among neurons. Thus the computational complexity linearly depends on the neural network size. The robot path is autonomously generated from the dynamic activity landscape of the neural network and the previous robot location. The proposed model algorithm is computationally efficient. It can deal with an unstructured environment with irregular obstacles. The effectiveness of the proposed model is demonstrated by simulation results.
Membranes hold promise as a new water treatment method of the future. In this study, a device is designed to test the efficiency of membranes. The device is implemented to be controlled remotely. An Internet based rem...
详细信息
Membranes hold promise as a new water treatment method of the future. In this study, a device is designed to test the efficiency of membranes. The device is implemented to be controlled remotely. An Internet based remote control system is implemented on the membrane test device to make the users access to it easier. When using the system, a remote operator only needs a general purpose computer with Internet connection to conduct a test. The engineering objective is to perform robust control over the Internet connection. A control architecture that combines computer and the membrane testing hardware is built. This system has two primary parts, the server part and the client part. A server is used to provide the application to the operator to control the hardware. The client part is executed on the remote operator's computer. The client uses a TCP/IP protocol to connect to the server through the Internet. Communication coordination between the client and the server is developed using Java and Common Object Request Broker Architecture (CORBA).
Cooperation in learning improves the speed of convergence and the quality of learning. Special care is needed when heterogeneous agents cooperate in learning. It is discussed that, cooperation in learning may cause th...
详细信息
Cooperation in learning improves the speed of convergence and the quality of learning. Special care is needed when heterogeneous agents cooperate in learning. It is discussed that, cooperation in learning may cause the learning process to diverge if heterogeneity is not handled properly. In this paper, it is assumed that two heterogeneous Q-learning agents cooperate to learn. The heterogeneity is assumed in their action order (and not in their action set). A Q-learning-based method is introduced for the agents to learn the mapping among their actions. It is shown that, the agents are able to learn this mapping while cooperating in learning. Some simulation results are reported to show the effectiveness of the proposed method.
Q-learning is widely used in many multi agent systems. In most cases, a separate critic is considered for qualifying each individual agent behavior or it is assumed that the critic is completely aware of effects of al...
详细信息
Q-learning is widely used in many multi agent systems. In most cases, a separate critic is considered for qualifying each individual agent behavior or it is assumed that the critic is completely aware of effects of all agents' actions on the team qualification. But, in many cases, the role of each team member in the group performance is not known. In order to distribute a common credit among the agents, a suitable criterion must be provided to estimate the role of each agent in the team performance and to judge if an agent has done a wrong action. In this paper two such criteria, named certainty and expertness, for a team of agents with parallel tasks are introduced. In addition, two methods for reinforcing the agents based on the proposed measures are provided. Some simulation results are also reported to show the effectiveness of the proposed measures and methods.
An area-covering operation is a kind of complete coverage path planning, which requires the robot path to cover every part of the workspace, which is an essential issue in cleaning robots and many other robotic applic...
详细信息
An area-covering operation is a kind of complete coverage path planning, which requires the robot path to cover every part of the workspace, which is an essential issue in cleaning robots and many other robotic applications such as vacuum, robots, painter robots, land mine detectors, lawn mowers, and windows cleaners. In this paper, a novel biologically inspired neural network approach is proposed for complete coverage path planning with obstacle avoidance of a cleaning robot in a nonstationary environment. The dynamics of each neuron in the topologically organized neural network is characterized by a shunting equation or an additive equation derived from Hodgkin and Huxley's (1952) membrane equation. There are only local lateral connections among neurons. Thus the computational complexity linearly depends on the neural network size. The proposed model algorithm is computationally efficient, and can also deal with changing environment. Simulation results show that the proposed model is capable of planning collision-free complete coverage robot path.
暂无评论