In general-sum games, multiagent cooperation has no global objective, and only individual rationality is concerned. Agent's learning is based on the assumption of opponents' policies, and this assumption may b...
详细信息
In general-sum games, multiagent cooperation has no global objective, and only individual rationality is concerned. Agent's learning is based on the assumption of opponents' policies, and this assumption may be wrong. By defining the global objective of agents, a novel multiagent reinforcement learning algorithm was proposed. All agents selected negotiated policies during learning, and punished those agents deviating from negotiated policies to ensure the execution of these policies. It was proved that the learned Q values on each stage games converge under certain restrictions. An example was given to analyze the proven result.
This paper focuses on genetic optimization and filtering efficiency ofa recently developed class ofWeighted Vector Directional Filters (WVDFs), which minimize the aggregated weighted angular distances between the samp...
详细信息
In this paper, we provide a new noise reduction method for the enhancement of the images of gene chips. We demonstrate that the new technique is capable of reducing outliers present in microarray images while preservi...
详细信息
This paper encompasses the study of the output impedance impact of parallel-connected UPS inverters. Two novel nonlinear control strategies are proposed. The first one is based on the single-wire current-sharing schem...
详细信息
This paper encompasses the study of the output impedance impact of parallel-connected UPS inverters. Two novel nonlinear control strategies are proposed. The first one is based on the single-wire current-sharing scheme, which is well known in parallel dc-to-dc converter systems. The second one is a wireless control technique derived from the droop method. The output impedance of the inverters is investigated in both cases. Results of two parallel-connected 1-kVA UPS inverters show the feasibility of the proposed approach. Finally, the two proposed controllers are compared between them and those with the existing solutions.
We solve the problem of the distribution of petroleum products through oil pipelines networks. This problem is modeled and solved using two techniques: A heuristic method, a multiobjective evolutionary algorithm and m...
详细信息
We solve the problem of the distribution of petroleum products through oil pipelines networks. This problem is modeled and solved using two techniques: A heuristic method, a multiobjective evolutionary algorithm and mathematical programming. In the multiobjective evolutionary algorithm, several objective functions are defined to express the goals of the solutions as well as the preferences among them. Some constraints are included as hard objective functions and some are evaluated through a repairing function to avoid infeasible solutions. In the mathematical programming approach the multiobjective optimization is solved using the constraint method in mixed integer linear programming. Some constraints of the mathematical model are nonlinear, so they are linearized. The results obtained with both methods for three concrete networks are presented. They are compared with a hybrid solution, where we use the results obtained by mathematical programming as the seed of the evolutionary algorithm.
The paper is devoted to the problem of modeling demand for inventory management of slow-moving items in the case of reporting errors. It is proposed a generalization of the beta-binomial demand model that takes into a...
详细信息
The paper is devoted to the problem of modeling demand for inventory management of slow-moving items in the case of reporting errors. It is proposed a generalization of the beta-binomial demand model that takes into account possible reporting errors in the learning sample. For the new model, there are developed identification and forecasting algorithms that provide consistent estimators of the model parameters and mean square optimal forecasts. The efficiency of the proposed approach is illustrated by an application example for slow-moving car parts.
Our research group, interested in outdoor scenes, has developed a methodology to register a CAD model of a city, with images taken with video cameras installed in a car, while driving city streets. So, we can merge vi...
详细信息
Based on Bayes theory of hypothesis testing, a new DWTdomain decoder structure for image watermarking has been proposed in this work. The statistical distribution of wavelet coefficients is deliberately described with...
详细信息
The problem of model-based fault detection in the presence of both parametric uncertainty and noise is addressed in this paper. Intervals are used to represent the uncertainty in the system parameters and interval ext...
详细信息
The problem of model-based fault detection in the presence of both parametric uncertainty and noise is addressed in this paper. Intervals are used to represent the uncertainty in the system parameters and interval extensions of parity equations are used as adaptive threshold selectors. A proper combination in time of different (interval) parity equations, together with a robust indicator, is used to maintain a good sensitivity to faults whilst avoiding the effect of noise. The results in the application to the detection of different faults in a DC motor are used to show the good properties of the proposed method.
Magnetic measurement is a typical inverse problem in Biomedical field. In this kind of problem we always need to locate the positions and moments of one or more magnetic dipoles. Although using the traditional methods...
详细信息
Magnetic measurement is a typical inverse problem in Biomedical field. In this kind of problem we always need to locate the positions and moments of one or more magnetic dipoles. Although using the traditional methods to solve this kind of inverse problem has all kinds of shortcomings, BPNN (Back Propagation Neural Networks) method can be used to solve this typical inverse problem fast enough for real time measurement. In the traditional BPNN method, gradient descent search method is performed for error propagation. In this paper the authors propose a new algorithm that Newton method is performed for error propagation. For the cost function is highly nonconvex in the magnetic measurement problem, the new kind of BPNN can get convergent results quickly and precisely. A simulation result for this method is also presented.
暂无评论