检索结果-内蒙古大学图书馆

Off-Policy Q-Learning for Anti-Interference control of Multi-Player Systems ⁎

IFAC-PapersOnLine 2020年第2期53卷 9189-9194页

作者： Jinna Li Zhenfei Xiao Tianyou Chai Frank. L. Lewis Sarangapani Jagannathan School of Information and Control Engineering Liaoning Shihua University Fushun 113001 China State Key Laboratory of Synthetical Automation for Process Industries Northeastern University Shenyang 110819 China UTA Research Institute the University of Texas at Arlington Arlington TX 76118 USA Department of Electrical and Computer Engineering Missouri University of Science and Technology Rolla MO 65409 USA

This paper develops a novel off-policy game Q-learning algorithm to solve the anti-interference control problem for discrete-time linear multi-player systems using only data without requiring system matrices to be known. The primary contribution of this paper lies in that the Q-learning strategy employed in the proposed algorithm is implemented in an off-policy policy iteration approach other than on-policy learning due to the well-known advantages of off-policy Q-learning over on-policy Q-learning. All of the players work hard together for the goal of minimizing their common performance index meanwhile defeating the disturbance that tries to maximize the specific performance index, and finally they reach the Nash equilibrium of the game resulting in satisfying disturbance attenuation condition. In order to find the solution to the Nash equilibrium, the anti-interference control problem is first transformed into an optimal control problem. Then an off-policy Q-learning algorithm is proposed in the framework of typical adaptive dynamic programming (ADP) and game architecture, such that control policies of all players can be learned using only measured data. Comparative simulation results are provided to verify the effectiveness of the proposed method.

关键词： H ∞ control off-policy Q-learning game theory Nash equilibrium

来源：评论

学校读者我要写书评

暂无评论

Retraction Note to: An improved approach for automatic spine canal segmentation using probabilistic boosting tree (PBT) with fuzzy support vector machine

引用

Journal of Ambient Intelligence and Humanized Computing 2022年第1期14卷 303-303页

作者： Viji, C. Rajkumar, N. Suganthi, S. T. Venkatachalam, K. Rajesh kumar, T. Pandiyan, Sanjeevi Department of CSE Akshaya College of Engineering and Technology Coimbatore India Department of Computer Engineering Lebanese French University Erbil Iraq School of CSE VIT University Bhopal Bhopal India Department of Information Technology Sri Krishna College of Technology Kovaipudur India Key Laboratory of Advanced Process Control for Light Industry Ministry of Education Jiangnan University Wuxi China

来源：评论

学校读者我要写书评

暂无评论

Development and modeling of some elements of the two-wheeled mobile robot system

引用

Journal of Physics: Conference Series 2021年第1期2134卷

作者： Alexander O Karpov Alexey O Karpov M Yu Vasilyeva И E S Belashova Process Dynamics and Control Department Kazan National Research Technical University named after A.N. Tupolev - KAI Kazan Russia Department of quality management Kazan (Volga Region) Federal University Kazan Russia Department of Automation and Process Control Systems Kazan National Research Technological University Kazan Russia Department of Computer Systems Kazan National Research Technical University named after A.N. Tupolev - KAI Kazan Russia

The article deals with the navigation system elements development process. The movement and positioning of a two-wheeled mobile robot with a high level of accuracy are realized through this system. Also, the algorithms mechanisms based on the construction of the optimal path for the autonomous device movement and based on a map building in an unknown area and avoiding obstacles are described. Using mathematical models, computer modeling of the device executive system is carried out using the engineering program MatLab.

关键词：

来源：评论

学校读者我要写书评

暂无评论

Artificial intelligence for geoscience:Progress,challenges,and perspectives

引用

The Innovation 2024年第5期5卷 136-160,135页

作者： Tianjie Zhao Sheng Wang Chaojun Ouyang Min Chen Chenying Liu Jin Zhang Long Yu Fei Wang Yong Xie Jun Li Fang Wang Sabine Grunwald Bryan MWong Fan Zhang Zhen Qian Yongjun Xu Chengqing Yu Wei Han Tao Sun Zezhi Shao Tangwen Qian Zhao Chen Jiangyuan Zeng Huai Zhang Husi Letu Bing Zhang Li Wang Lei Luo Chong Shi Hongjun Su Hongsheng Zhang Shuai Yin Ni Huang Wei Zhao Nan Li Chaolei Zheng Yang Zhou Changping Huang Defeng Feng Qingsong Xu Yan Wu Danfeng Hong Zhenyu Wang Yinyi Lin Tangtang Zhang Prashant Kumar Antonio Plaza Jocelyn Chanussot Jiabao Zhang Jiancheng Shi Lizhe Wang Aerospace Information Research Institute Chinese Academy of SciencesBeijing 100094China School of Computer Science China University of GeosciencesWuhan 430078China State Key Laboratory of Mountain Hazards and Engineering Resilience Institute of Mountain Hazards and EnvironmentChinese Academy of SciencesChengdu 610299China Key Laboratory of Virtual Geographic Environment(Ministry of Education of PRC) Nanjing Normal UniversityNanjing 210023China Data Science in Earth Observation Technical University of Munich80333 MunichGermany The National Key Laboratory of Water Disaster Prevention Yangtze Institute for Conservation and DevelopmentHohai UniversityNanjing 210098China Institute of Computing Technology Chinese Academy of SciencesBeijing 100190China School of Geographical Sciences Nanjing University of Information Science and TechnologyNanjing 210044China State Key Laboratory of Soil and Sustainable Agriculture Institute of Soil ScienceChinese Academy of SciencesNanjing 210008China Soil Water and Ecosystem Sciences DepartmentUniversity of FloridaPO Box 110290GainesvilleFLUSA Materials Science Engineering Program Cooperating Faculty Member in the Department of Chemistry and Department of Physics Astronomy University of CaliforniaCaliforniaRiversideCA 92521USA Institute of Remote Sensing and Geographical Information System School of Earth and Space SciencesPeking UniversityBeijing 100871China Key Laboratory of Computational Geodynamics University of Chinese Academy of SciencesBeijing 100049China International Research Center of Big Data for Sustainable Development Goals Beijing 100094China College of Geography and Remote Sensing Hohai UniversityNanjing 211100China Department of Geography The University of Hong KongHong Kong 999077SARChina Jiangsu Key Laboratory of Atmospheric Environment Monitoring and Pollution Control Nanjing 210044China School of Environmental Science and Engineering Nanjing University of Information Science&TechnologyNanjing 210044China Collaborative Inno

This paper explores the evolution of geoscientific inquiry,tracing the progression from traditional physics-based models to modern data-driven approaches facilitated by significant advancements in artificial intelligence(AI)and data collection *** models,which are grounded in physical and numerical frameworks,provide robust explanations by explicitly reconstructing underlying physical ***,their limitations in comprehensively capturing Earth’s complexities and uncertainties pose challenges in optimization and real-world *** contrast,contemporary data-driven models,particularly those utilizing machine learning(ML)and deep learning(DL),leverage extensive geoscience data to glean insights without requiring exhaustive theoretical *** techniques have shown promise in addressing Earth science-related ***,challenges such as data scarcity,computational demands,data privacy concerns,and the“black-box”nature of AI models hinder their seamless integration into *** integration of physics-based and data-driven methodologies into hybrid models presents an alternative *** models,which incorporate domain knowledge to guide AI methodologies,demonstrate enhanced efficiency and performance with reduced training data *** review provides a comprehensive overview of geoscientific research paradigms,emphasizing untapped opportunities at the intersection of advanced AI techniques and *** examines major methodologies,showcases advances in large-scale models,and discusses the challenges and prospects that will shape the future landscape of AI in *** paper outlines a dynamic field ripe with possibilities,poised to unlock new understandings of Earth’s complexities and further advance geoscience exploration.

关键词： Earth utilizing landscape

来源：评论

学校读者我要写书评

暂无评论

A Primal-Dual SGD Algorithm for Distributed Nonconvex Optimization

arXiv

引用

arXiv 2020年

作者： Yi, Xinlei Zhang, Shengjun Yang, Tao Chai, Tianyou Johansson, Karl H. The Division of Decision and Control Systems School of Electrical Engineering and Computer Science KTH Royal Institute of Technology Stockholm100 44 Sweden The Department of Electrical Engineering University of North Texas DentonTX76203 United States The State Key Laboratory of Synthetical Automation for Process Industries Northeastern University Shenyang110819 China

The distributed nonconvex optimization problem of minimizing a global cost function formed by a sum of n local cost functions by using local information exchange is considered. This problem is an important component of many machine learning techniques with data parallelism, such as deep learning and federated learning. We propose a distributed primal–dual stochastic gradient descent (SGD) algorithm, suitable for arbitrarily connected communication networks and any smooth (possibly nonconvex) cost functions. We show that the proposed algorithm achieves the linear speedup convergence rate O(1/√nT) for general nonconvex cost functions and the linear speedup convergence rate O(1/(nT)) when the global cost function satisfies the Polyak–Lojasiewicz (P–L) condition, where T is the total number of iterations. We also show that the output of the proposed algorithm with constant parameters linearly converges to a neighborhood of a global optimum. We demonstrate through numerical experiments the efficiency of our algorithm in comparison with the baseline centralized SGD and recently proposed distributed SGD algorithms. Copyright © 2020, The Authors. All rights reserved.

关键词： Cost functions

来源：评论

学校读者我要写书评

暂无评论

Delays in Model Reduction of Chemical Reaction Networks

引用

IFAC-PapersOnLine 2018年第14期51卷 100-105页

作者： Lipták, György Hangos, Katalin M. Process Control Research Group Systems and Control Laboratory Computer and Automation Research Institute Hungarian Academy of Sciences P.O. Box 63 BudapestH-1518 Hungary Department of Electrical Engineering and Information Systems University of Pannonia Veszprém Hungary

A novel engineering model reduction method is proposed in this paper that can be applied to a chemical reaction network (CRN) with chains of linear reactions. The reduced model is a delayed CRN with possibly different delays but with less state variables than the original model. As the first step of the model reduction, a decomposition method is also developed to transform chains with joint reactions into independent chains of linear reactions. The well known example of McKeithan's network is used as a case study to illustrate the basic concepts and the design method. © 2018

关键词： Reduction

来源：评论

学校读者我要写书评

暂无评论

Robust estimation for small domains in business surveys

arXiv

引用

arXiv 2020年

作者： Smith, Paul A. Bocci, Chiara Tzavidis, Nikos Krieg, Sabine Smeets, Marc J.E. S3RI Department of Social Statistics & Demography University of Southampton Highfield SouthamptonSO17 1BJ United Kingdom Department of Statistics Computer Science Applications "G. Parenti" University of Florence Viale Morgagni 59 Firenze50134 Italy Statistics Netherlands Process Development & Methodology P.O. Box 4481 Heerlen6401CZ Netherlands

Small area (or small domain) estimation is still rarely applied in business statistics, because of challenges arising from the skewness and variability of variables such as turnover. We examine a range of small area estimation methods as the basis for estimating the activity of industries within the retail sector in the Netherlands. We use tax register data and a sampling procedure which replicates the sampling for the retail sector of Statistics Netherlands' Structural Business Survey as a basis for investigating the properties of small area estimators. In particular, we consider the use of the EBLUP under a random effects model and variations of the EBLUP derived under (a) a random effects model that includes a complex specification for the level 1 variance and (b) a random effects model that is fitted by using the survey weights. Although accounting for the survey weights in estimation is important, the impact of influential data points remains the main challenge in this case. The paper further explores the use of outlier robust estimators in business surveys, in particular a robust version of the EBLUP, M‐regression based synthetic estimators, and M‐quantile small area estimators. The latter family of small area estimators includes robust projective (without and with survey weights) and robust predictive versions. M‐quantile methods have the lowest empirical mean squared error and are substantially better than direct estimators, though there is an open question about how to choose the tuning constant for bias adjustment in practice. The paper makes a further contribution by exploring a doubly robust approach comprising the use of survey weights in conjunction with outlier robust methods in small area estimation. Copyright © 2020, The Authors. All rights reserved.

关键词： Surveys

来源：评论

学校读者我要写书评

暂无评论

Monitoring the Moisture Content in Pharmaceutical Batch Fluidized Bed Dryers Using Observer-Based Soft Sensors

引用

IFAC-PapersOnLine 2020年第2期53卷 12056-12061页

作者： Marc-Olivier Roseberry Francis Gagnon André Desbiens Jocelyn Bouchard Pierre-Philippe Lapointe-Garant Department of Electrical and Computer Engineering LOOP Université Laval Québec City Québec G1V 6A6 Canada Department of Chemical Engineering LOOP Université Laval Québec City Québec G1V 6A6 Canada Process Monitoring Automation and Control Global Engineering Pfizer Montréal Québec H4R 1J6 Canada

Tablet manufacturing in the pharmaceutical industry involves batch fluidized bed drying for particle moisture removal. This paper introduces five approaches for moisture content monitoring, relying either on a complex phenomenological model or its simplified version. The first two soft sensors consist of open-loop estimators, i.e. they simply simulate the models fed by the manipulated variables. Three closed-loop moving horizon estimators based on the simplified model are also proposed for improved robustness. In the first one, the measurements of the inlet gas and particle temperatures feed back the soft sensor. The last two closed-loop observers additionally can take into account infrequent delayed moisture content measurements, such as at-line loss on drying analysis. A validation of the soft sensors is performed with experimental data collected on a pilot scale fluidized bed dryer. Results show that the closed-loop observer with the delayed moisture content measurements still has an accuracy that is equivalent (and sometimes better) than the complex phenomenological model.

关键词： state estimation batch fluidized bed dryer moving horizon estimator offline measurement measurement delay

来源：评论

学校读者我要写书评

暂无评论

Distributed delay model of the McKeithan’s network

引用

IFAC-PapersOnLine 2019年第7期52卷 33-38页

作者： György Lipták Katalin M. Hangos Process Control Research Group Systems and Control Laboratory Computer and Automation Research Institute Hungarian Academy of Sciences P.O. Box 63 H-1518 Budapest Hungary Department of Electrical Engineering and Information Systems University of Pannonia Veszprém Hungary

In this paper CRNs containing linear reaction chains with multiple joint complexes were considered in order to obtain an equivalent reduced order delayed CRN model with distributed time delays. For this purpose, our earlier method (Lipták and Hangos (2018)) for decomposing the chains of linear reactions with multiple joint complexes was used together with the "linear chain trick". An analytical expression for the kernel function of the distributed delay was also derived from the reaction rate coefficients of the linear reaction chains. Our approach was demonstrated using the example of the well known McKeithan’s network model of kinetic proofreading.

关键词： process control Delay Chemical Reaction Networks

来源：评论

学校读者我要写书评

暂无评论

SPAHN novel approach for PL-AG gateway discovery for internet connectivity

引用

Peer-to-Peer Networking and applications 2020年第4期14卷 2275-2284页

作者： Kannan, K. Sivaranjani, P. Sathish Kumar, S. Nalini, M. Balaji, V. R. Sanjeevi, P. Department of ECE R.M.K. College of Engineering and Technology Chennai India Department of Electronics and communication Engineering Kongu Engineering College Erode India Dept of Electrical and Electronics Engineering M. Kumarasamy College of Engineering Thalavapalayam India Department of Computer Science and Engineering Saveetha School of Engineering Saveetha Institute of Medical and Technical Sciences Chennai India Department of ECE Sri Krishna College of Engineering and Technology Coimbatore India Key Laboratory of Advanced Process Control for Light Industry Ministry of Education Jiangnan University Wuxi China

Internet Gateway(IGW) main role is detecting availability nodes and providing internet to Mobile Ad Hoc Network(MANET) have whenever connected to internet. Discovery time of gateway is changed based on the throughput and packet delay. Many of the situations the mobile nodes are have fixed host connection to the internet using minimum hop path, it is not good for waiting packets because of that packets are have longer path interface queue. This research paper object is avoided above the problem mentioning using a novel approach of SPAHN (Solving Problem of Ad-Hoc Network). This paper mainly focus is classify the routing protocols of load-aware in MANET and from this classify discover the proactive load-aware gateway (PL-AG) from a device into interface queue size and min-hop-metric. This novel approach has been allowing and gives better handoff between two internet gateways for fixed host seamless connectivity. We justify the performance of the SPAHN approach using two metrics like average end-to-end delay and throughput based on this examination the SPAHN system yield good simulation results comparably existing systems.

关键词：

来源：评论

学校读者我要写书评

暂无评论

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案：

请选择收藏分类：

通借通还

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

分类表

所选分类

限定检索结果

文献类型

馆藏范围

日期分布

学科分类号

主题

机构

作者

语言

请选择保存的检索档案： 新增检索档案 确定 取消

请选择收藏分类： 新增自定义分类 确定 取消

通借通还

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

请选择保存的检索档案：

请选择收藏分类：