In the this paper, a CMAC-Q-learning based Dyna agent is presented to relieve the problem of learning speed in reinforcement learning, in order to achieve the goals of shortening training process and increasing the le...
In the this paper, a CMAC-Q-learning based Dyna agent is presented to relieve the problem of learning speed in reinforcement learning, in order to achieve the goals of shortening training process and increasing the learning, speed. We combine CMAC, Q-learning, and prioritized sweeping techniques to construct the Dyna agent in which a Q-learning is trained for policy learning; meanwhile, model approximators, called CMAC-model and CMAC-R-model, are in charge of approximating the environment model. The approximated model provides the Q-learning with virtual interaction experience to further update the policy within the time gap when there is no interplay between the agent and the real environment. The Dyna agent switches seamlessly between the real environment and the virtual environment model for the objective of policy learning. A simulation for controlling a differential-drive mobile robot has been conducted to demonstrate that the proposed method can preliminarily achieve the design goal.
A generalized architecture of an efficient digital signal processor using the residue number system (RNS) is proposed. It is based on using our new residue multipliers-accumulators (MACs) as the main building blocks. ...
ISBN:
(纸本)9780863419317
A generalized architecture of an efficient digital signal processor using the residue number system (RNS) is proposed. It is based on using our new residue multipliers-accumulators (MACs) as the main building blocks. This architecture offers potentially higher throughput thanks to the possibility of implementing very low-level pipelining. The maximal applicable clock frequency could be determined by the delay of only a few stages of full-adders.
The design of multiplier-accumulators (MACs) that could be used to build digital filters using the residue number system (RNS) is considered. The generalized architectures of residue MACs modulo A built using carry-sa...
ISBN:
(纸本)9780863419317
The design of multiplier-accumulators (MACs) that could be used to build digital filters using the residue number system (RNS) is considered. The generalized architectures of residue MACs modulo A built using carry-save adders (CSAs) are proposed. Unlike some earlier designs, the MACs are not limited to the moduli A of the form (2degplusmn1) and 2deg, which hence provides a designer with more choices of moduli A that could be used to form an RNS with the required dynamic range.
The aging effect can be understood as a deterioration of a machine (tool) that increases a time required to manufacture products. It is highly desirable to minimize the negative influence of this effect, which in case...
详细信息
The aging effect can be understood as a deterioration of a machine (tool) that increases a time required to manufacture products. It is highly desirable to minimize the negative influence of this effect, which in case of CNC machine is usually compensated by the increasing of its working speed. Therefore, although a tool efficiency decreases the processing times of jobs (products) can be still kept on the desired level. On the other hand, the increasing of a working speed cases the faster decreasing of a durability of a tool and its premature failure. In this paper, we focus on optimization problems in manufacturing systems, where occur the tool wear of CNC drilling/cutting machine and its compensation at the cost of a durability of a tool. We model such problems by single machine scheduling problems with the aging effect and additional resources with two criteria: time criterion (makespan and maximum lateness) and resource consumption penalty. The objectives are the minimization of time criteria under a given resource consumption and the minimization of resource consumption under a given time criteria. Computational analysis of these problems together with solution algorithms are provided.
The paper introduces accuracy boosting extension to a novel induction of fuzzy rules from raw data using artificial immune system methods. Accuracy boosting relies on fuzzy partition learning. The modified algorithm w...
详细信息
The paper introduces accuracy boosting extension to a novel induction of fuzzy rules from raw data using artificial immune system methods. Accuracy boosting relies on fuzzy partition learning. The modified algorithm was experimentally proved to be more accurate for all learning sets containing non-crisp attributes.
The paper deals with a single machine scheduling problem with job processing times dependent on continuously-divisible resource, e.g. gas, power, energy, raw materials, catalyzer, financial outlay. Ready and delivery ...
详细信息
The paper deals with a single machine scheduling problem with job processing times dependent on continuously-divisible resource, e.g. gas, power, energy, raw materials, catalyzer, financial outlay. Ready and delivery times are also given for each job. The problem is to find a schedule of jobs and resource allocation that minimize the time by which all jobs are delivered. Genetic approach, using some proved problem properties is constructed to solve the problem being strongly NP-hard. Analysis of some computational experiment and conclusion remarks are also given.
A series of experiments aimed to generate and learn fuzzy models for the valuation of residential premises was conducted using the KEEL tool (Knowledge Extraction based on Evolutionary Learning). Four regression and f...
详细信息
This paper presents a method to increase the reliability of UAV sensor fault detection in a multi-UAV context. The method uses additional position estimations that augment individual UAV fault detection system. These ...
This paper presents a method to increase the reliability of UAV sensor fault detection in a multi-UAV context. The method uses additional position estimations that augment individual UAV fault detection system. These additional estimations are obtained using images from the same planar scene taken from two different UAVs. Since accuracy and noise level of the estimation depends on several factors, dynamic replanning of the multiUAV team can be used to obtain a better estimation in case of faults caused by slow growing errors of absolute position estimation that cannot be detected by using local FDI in the UAVs. Experimental results with data from two real UAVs are also presented.
In this paper, we show that the tracking performance of a hard disk drive actuator can be improved by using two adaptive neural networks, each of which is tailored for a specific task. The first neural network utilize...
详细信息
ISBN:
(纸本)9781424431236
In this paper, we show that the tracking performance of a hard disk drive actuator can be improved by using two adaptive neural networks, each of which is tailored for a specific task. The first neural network utilizes accelerometer signal to detect external vibrations, and compensates for its effect on hard disk drive position via feedforward action. In particular, no information on the plant, sensor and disturbance dynamics is needed in the design of this neural network disturbance compensator. The second neural network, designed to compensate for the pivot friction, uses a signum activation function to introduce nonlinearities inherent to pivot friction, thus reducing the neural network¿s burden of expectation. The stability of the proposed scheme is analyzed by the Lyapunov criterion. Simulation results show that the tracking performance of the hard disk drives can be improved significantly with the use of both neural networks compared to the case without compensation, or when only one of the networks is activated.
A series of experiments aimed to generate and learn fuzzy models for the valuation of residential premises was conducted using the KEEL tool (knowledge extraction based on evolutionary learning). Four regression and f...
详细信息
A series of experiments aimed to generate and learn fuzzy models for the valuation of residential premises was conducted using the KEEL tool (knowledge extraction based on evolutionary learning). Four regression and four post-processing algorithms were applied to several data sets. They referred to sales/purchase transactions of residential premises, which were derived from the cadastral system and the registry of real estate transactions. The results proved the usefulness of the tool to carry out laborious and tedious investigation in a relatively short time. Further research is needed to determine to what extent data coming from such sources allow to build the real estate valuation models.
暂无评论