This paper develops a novel off-policy game Q-learning algorithm to solve the anti-interference control problem for discrete-time linear multi-player systems using only data without requiring system matrices to be kno...
详细信息
This paper develops a novel off-policy game Q-learning algorithm to solve the anti-interference control problem for discrete-time linear multi-player systems using only data without requiring system matrices to be known. The primary contribution of this paper lies in that the Q-learning strategy employed in the proposed algorithm is implemented in an off-policy policy iteration approach other than on-policy learning due to the well-known advantages of off-policy Q-learning over on-policy Q-learning. All of the players work hard together for the goal of minimizing their common performance index meanwhile defeating the disturbance that tries to maximize the specific performance index, and finally they reach the Nash equilibrium of the game resulting in satisfying disturbance attenuation condition. In order to find the solution to the Nash equilibrium, the anti-interference control problem is first transformed into an optimal control problem. Then an off-policy Q-learning algorithm is proposed in the framework of typical adaptive dynamic programming (ADP) and game architecture, such that control policies of all players can be learned using only measured data. Comparative simulation results are provided to verify the effectiveness of the proposed method.
The article deals with the navigation system elements development process. The movement and positioning of a two-wheeled mobile robot with a high level of accuracy are realized through this system. Also, the algorithm...
The article deals with the navigation system elements development process. The movement and positioning of a two-wheeled mobile robot with a high level of accuracy are realized through this system. Also, the algorithms mechanisms based on the construction of the optimal path for the autonomous device movement and based on a map building in an unknown area and avoiding obstacles are described. Using mathematical models, computer modeling of the device executive system is carried out using the engineering program MatLab.
This paper explores the evolution of geoscientific inquiry,tracing the progression from traditional physics-based models to modern data-driven approaches facilitated by significant advancements in artificial intellige...
详细信息
This paper explores the evolution of geoscientific inquiry,tracing the progression from traditional physics-based models to modern data-driven approaches facilitated by significant advancements in artificial intelligence(AI)and data collection *** models,which are grounded in physical and numerical frameworks,provide robust explanations by explicitly reconstructing underlying physical ***,their limitations in comprehensively capturing Earth’s complexities and uncertainties pose challenges in optimization and real-world *** contrast,contemporary data-driven models,particularly those utilizing machine learning(ML)and deep learning(DL),leverage extensive geoscience data to glean insights without requiring exhaustive theoretical *** techniques have shown promise in addressing Earth science-related ***,challenges such as data scarcity,computational demands,data privacy concerns,and the“black-box”nature of AI models hinder their seamless integration into *** integration of physics-based and data-driven methodologies into hybrid models presents an alternative *** models,which incorporate domain knowledge to guide AI methodologies,demonstrate enhanced efficiency and performance with reduced training data *** review provides a comprehensive overview of geoscientific research paradigms,emphasizing untapped opportunities at the intersection of advanced AI techniques and *** examines major methodologies,showcases advances in large-scale models,and discusses the challenges and prospects that will shape the future landscape of AI in *** paper outlines a dynamic field ripe with possibilities,poised to unlock new understandings of Earth’s complexities and further advance geoscience exploration.
The distributed nonconvex optimization problem of minimizing a global cost function formed by a sum of n local cost functions by using local information exchange is considered. This problem is an important component o...
详细信息
A novel engineering model reduction method is proposed in this paper that can be applied to a chemical reaction network (CRN) with chains of linear reactions. The reduced model is a delayed CRN with possibly different...
详细信息
Small area (or small domain) estimation is still rarely applied in business statistics, because of challenges arising from the skewness and variability of variables such as turnover. We examine a range of small area e...
详细信息
Tablet manufacturing in the pharmaceutical industry involves batch fluidized bed drying for particle moisture removal. This paper introduces five approaches for moisture content monitoring, relying either on a complex...
详细信息
Tablet manufacturing in the pharmaceutical industry involves batch fluidized bed drying for particle moisture removal. This paper introduces five approaches for moisture content monitoring, relying either on a complex phenomenological model or its simplified version. The first two soft sensors consist of open-loop estimators, i.e. they simply simulate the models fed by the manipulated variables. Three closed-loop moving horizon estimators based on the simplified model are also proposed for improved robustness. In the first one, the measurements of the inlet gas and particle temperatures feed back the soft sensor. The last two closed-loop observers additionally can take into account infrequent delayed moisture content measurements, such as at-line loss on drying analysis. A validation of the soft sensors is performed with experimental data collected on a pilot scale fluidized bed dryer. Results show that the closed-loop observer with the delayed moisture content measurements still has an accuracy that is equivalent (and sometimes better) than the complex phenomenological model.
In this paper CRNs containing linear reaction chains with multiple joint complexes were considered in order to obtain an equivalent reduced order delayed CRN model with distributed time delays. For this purpose, our e...
详细信息
In this paper CRNs containing linear reaction chains with multiple joint complexes were considered in order to obtain an equivalent reduced order delayed CRN model with distributed time delays. For this purpose, our earlier method (Lipták and Hangos (2018)) for decomposing the chains of linear reactions with multiple joint complexes was used together with the "linear chain trick". An analytical expression for the kernel function of the distributed delay was also derived from the reaction rate coefficients of the linear reaction chains. Our approach was demonstrated using the example of the well known McKeithan’s network model of kinetic proofreading.
Internet Gateway(IGW) main role is detecting availability nodes and providing internet to Mobile Ad Hoc Network(MANET) have whenever connected to internet. Discovery time of gateway is changed based on the throughput ...
Internet Gateway(IGW) main role is detecting availability nodes and providing internet to Mobile Ad Hoc Network(MANET) have whenever connected to internet. Discovery time of gateway is changed based on the throughput and packet delay. Many of the situations the mobile nodes are have fixed host connection to the internet using minimum hop path, it is not good for waiting packets because of that packets are have longer path interface queue. This research paper object is avoided above the problem mentioning using a novel approach of SPAHN (Solving Problem of Ad-Hoc Network). This paper mainly focus is classify the routing protocols of load-aware in MANET and from this classify discover the proactive load-aware gateway (PL-AG) from a device into interface queue size and min-hop-metric. This novel approach has been allowing and gives better handoff between two internet gateways for fixed host seamless connectivity. We justify the performance of the SPAHN approach using two metrics like average end-to-end delay and throughput based on this examination the SPAHN system yield good simulation results comparably existing systems.
暂无评论