Some of manufacturing tasks such as sawing task often requires continuous impulsive motion. In case of sawing task, such impulsive motions can be observed between the teeth of the saw and the object. The amount of the...
详细信息
ISBN:
(纸本)0780372727
Some of manufacturing tasks such as sawing task often requires continuous impulsive motion. In case of sawing task, such impulsive motions can be observed between the teeth of the saw and the object. The amount of the external impulse exerted on the object has been treated as an important control paramete. Also, the internal impulses experienced at the joints should be simultaneously taken into account to avoid serious damage or injury at the joints. The purpose of this work is to improve the efficacy of sawing task by using dual arm. For this, an external impulse model for sawing task is suggested. Also, the closed-form internal impulse models are proposed for both a single and dual arms. Based on these models, a new measure for internal impulse is proposed. A normalized impulse ellipsoid reflecting the velocity direction is employed to visualize the impact geometry. Finally, the optimal sawing region is identified in which the amount of external impulse is maximized and the amount of the internal impulse is minimized. It is demonstrated through simulation that the dual arm exhibits better sawing performance over a single arm in aspects of external and internal impulses.
Using other agents' experiences and knowledge, a learning agent may learn faster, make fewer mistakes, and create some rides for unseen situations. These benefits will be gained if the learning agents know the are...
详细信息
Using other agents' experiences and knowledge, a learning agent may learn faster, make fewer mistakes, and create some rides for unseen situations. These benefits will be gained if the learning agents know the area of expertise and the expertness values of each other. In this paper, some Q-learning agents with different skills and expertness levels cooperate in learning. The agents use some criteria to judge others information and knowledge. Four expertness criterion, certainty and entropy measures are used to assign degrees of importance to others' Q-Tables. Effects of measuring these values based on their whole Q-Table, a portion of Q-Tables that reflects their proficiencies, and the states in Q-Tables on the learning quality are studied. Simple strategy sharing and two different weighted strategy-sharing methods are used to combine the acquired knowledge from different agents.
In multiagent reinforcement learning, inter-agent credit assignment is a fundamental problem, since a single scalar reinforcement signal is the only reliable feedback that teams of learning agents receive. This proble...
详细信息
In multiagent reinforcement learning, inter-agent credit assignment is a fundamental problem, since a single scalar reinforcement signal is the only reliable feedback that teams of learning agents receive. This problem is more critical in groups of independent learners with a joint task. In this research, it is assumed that a critic agent receives the environment feedback and assigns a proper credit to each agent using some measures. Three of such measures for a team of cooperative agents with a parallel and AND-type task are introduced. These measures somehow compare the agents' knowledge. One of these criteria, called normal expertness, is a non-relative measure while two other ones (certainty and relative normal expertness) are relative measure. It is experimentally shown that relative measures work better as they contain more information for the critic agent.
We present a result on uniform attractivity of the origin for nonlinear time-varying systems. Our theorem generalizes Matrosov's theorem which extends, in a certain manner, Krasovskii-LaSalle's invariance prin...
详细信息
We present a result on uniform attractivity of the origin for nonlinear time-varying systems. Our theorem generalizes Matrosov's theorem which extends, in a certain manner, Krasovskii-LaSalle's invariance principle to the case of general nonlinear time-varying systems. We show the utility of our theorem by analyzing uniform asymptotic stability of the origin for nonholonomic: systems in closed loop with particular smooth time-varying feedbacks previously proposed in the literature.
Substantial progress has been made recently towards designing, building and test-flying remotely piloted Micro Air Vehicles (MAVs) and small UAVs. We seek to complement this progress in overcoming the aerodynamic obst...
详细信息
Substantial progress has been made recently towards designing, building and test-flying remotely piloted Micro Air Vehicles (MAVs) and small UAVs. We seek to complement this progress in overcoming the aerodynamic obstacles to flight at very small scales with a vision-guided flight stability and autonomy system, based on a robust horizon detection algorithm. In this paper, we first motivate the use of computer vision for MAV autonomy, arguing that given current sensor technology, vision may be the only practical approach to the problem. We then describe our statistical vision-based horizon detection algorithm, which has been demonstrated at 30 Hz with over 99.9% correct horizon identification. Next, we develop robust schemes for the detection of extreme MAV attitudes, where no horizon is visible, and for the detection of horizon estimation errors, due to external factors such as video transmission noise. Finally, we discuss our feedback controller for self-stabilized flight, and report results on vision-based autonomous flights of duration exceeding ten minutes.
A Markov process model for contour curvature is introduced via a stochastic differential equation. We analyze the distribution of such curves, and show that its mode is the Euler spiral, a curve minimizing changes in ...
详细信息
In the design of model predictive controller (MPC), the traditional approach of expanding the projected control signal uses the forward operator to obtain the linear-in-the-parameters relation for predicted output. In...
详细信息
In the design of model predictive controller (MPC), the traditional approach of expanding the projected control signal uses the forward operator to obtain the linear-in-the-parameters relation for predicted output. In this paper, by using a performance specification on the exponential change rate of the projected control signal, a more appropriate expansion, related to Laguerre networks, is introduced. It is shown that the number of terms used in the optimization procedure can be reduced to a fraction of that required by the usual procedure in the case of rapid sampling, and as a result, the numerical condition of the optimization algorithm can be significantly improved. Furthermore, the proposed algorithm is easy to tune for closed-loop performance with two explicit tuning parameters. By relaxing the constraint on the exponential change rate of the control signal and allowing arbitrary complexity in describing the trajectory, the proposed approach becomes equivalent to the traditional approach in MPC design.
Delay in the computation of the signal-to-interference ratio in communication systems is unavoidable. In the the case of mobile communication, delay is a very critical problem due the fast variation of the communicati...
详细信息
Delay in the computation of the signal-to-interference ratio in communication systems is unavoidable. In the the case of mobile communication, delay is a very critical problem due the fast variation of the communication channel and the need for effective, fast and accurate power control. In this paper we present our approach to dealing with the delay in mobile communication systems as well as our controller to achieve and maintain the desired signal quality. We will concentrate on the code division multiple access (CDMA) systems.
This paper concerns delay-dependent guaranteed cost control via memoryless state feedback controllers for a class of linear state-delayed systems with norm-bounded time-varying parametric uncertainties. New delay-depe...
详细信息
This paper concerns delay-dependent guaranteed cost control via memoryless state feedback controllers for a class of linear state-delayed systems with norm-bounded time-varying parametric uncertainties. New delay-dependent conditions for the existence of the guaranteed cost controller are presented in terms of matrix inequalities for both nominal state-delayed systems and uncertain state-delayed systems. An algorithm involving convex optimization is proposed to design a controller achieving a suboptimal guaranteed cost such that the system can be stabilized for all admissible uncertainties. Through numerical examples, it is shown that the proposed method can even give a lower guaranteed cost than the delay-independent method.
暂无评论