Gradient-based methods have been widely used for system design and optimization in diverse application domains. Recently, there has been a renewed interest in studying theoretical properties of these methods in the co...
详细信息
Gradient-based methods have been widely used for system design and optimization in diverse application domains. Recently, there has been a renewed interest in studying theoretical properties of these methods in the context of control and reinforcement learning. This article surveys some of the recent developments on policy optimization, a gradient-based iterative approach for feedback control synthesis that has been popularized by successes of reinforcement learning. We take an interdisciplinary perspective in our exposition that connects control theory, reinforcement learning, and large-scale optimization. We review a number of recently developed theoretical results on the optimization landscape, global convergence, and sample complexityof gradient-based methods for various continuous control problems, such as the linear quadratic regulator (LQR), control, risk-sensitive control, linear quadratic Gaussian (LQG) control, and output feedback synthesis. In conjunction with these optimization results, we also discuss how direct policy optimization handles stability and robustness concerns in learning-based control, two main desiderata in controlengineering. We conclude the survey by pointing out several challenges and opportunities at the intersection of learning and control.
The two-volume set LNCS 7565 and 7566 constitutes the refereed proceedings of three confederated international conferences: Cooperative Information systems (CoopIS 2012), Distributed Objects and Applications - Secure ...
详细信息
ISBN:
(数字)9783642336065
ISBN:
(纸本)9783642336058
The two-volume set LNCS 7565 and 7566 constitutes the refereed proceedings of three confederated international conferences: Cooperative Information systems (CoopIS 2012), Distributed Objects and Applications - Secure Virtual Infrastructures (DOA-SVI 2012), and Ontologies, DataBases and Applications of SEmantics (ODBASE 2012) held as part of OTM 2012 in September 2012 in Rome, Italy. The 53 revised full papers presented were carefully reviewed and selected from a total of 169 submissions. The 22 full papers included in the first volume constitute the proceedings of CoopIS 2012 and are organized in topical sections on business process design; process verification and analysis; service-oriented architectures and cloud; security, risk, and prediction; discovery and detection; collaboration; and 5 short papers.
The 6th edition of International Conference on Intelligent Computing and Optimization took place at G Hua Hin Resort & Mall on April 27–28, 2023, with tremendous support from the global research scholars across t...
详细信息
ISBN:
(数字)9783031503306
ISBN:
(纸本)9783031503290
The 6th edition of International Conference on Intelligent Computing and Optimization took place at G Hua Hin Resort & Mall on April 27–28, 2023, with tremendous support from the global research scholars across the planet. Objective is to celebrate “Research Novelty with Compassion and Wisdom” with researchers, scholars, experts, and investigators in Intelligent Computing and Optimization across the globe, to share knowledge, experience, and innovation—a marvelous opportunity for discourse and mutuality by novel research, invention, and creativity.;This proceedings book of the 6th ICO’2023 is published by Springer Nature—Quality Label of Enlightenment.
The two-volume set LNCS 7565 and 7566 constitutes the refereed proceedings of three confederated international conferences: Cooperative Information systems (CoopIS 2012), Distributed Objects and Applications - Secure ...
详细信息
ISBN:
(数字)9783642336157
ISBN:
(纸本)9783642336140
The two-volume set LNCS 7565 and 7566 constitutes the refereed proceedings of three confederated international conferences: Cooperative Information systems (CoopIS 2012), Distributed Objects and Applications - Secure Virtual Infrastructures (DOA-SVI 2012), and Ontologies, DataBases and Applications of SEmantics (ODBASE 2012) held as part of OTM 2012 in September 2012 in Rome, Italy. The 53 revised full papers presented were carefully reviewed and selected from a total of 169 submissions. The 31 full papers included in the second volume constitute the proceedings of DOA-SVI 2012 with 10 full papers organized in topical sections on privacy in the cloud; resource management and assurance; context, compliance and attack; and ODBASE 2012 with 21 full papers organized in topical sections on using ontologies and semantics; applying probalistic techniques to semantic information; exploiting and querying semantic information; and managing and storing semantic information.
The 6th edition of International Conference on Intelligent Computing and Optimization took place at G Hua Hin Resort & Mall on April 27–28, 2023, with tremendous support from the global research scholars across t...
详细信息
ISBN:
(数字)9783031503276
ISBN:
(纸本)9783031503269
The 6th edition of International Conference on Intelligent Computing and Optimization took place at G Hua Hin Resort & Mall on April 27–28, 2023, with tremendous support from the global research scholars across the planet. Objective is to celebrate “Research Novelty with Compassion and Wisdom” with researchers, scholars, experts, and investigators in Intelligent Computing and Optimization across the globe, to share knowledge, experience, and innovation—a marvelous opportunity for discourse and mutuality by novel research, invention, and creativity.;This proceedings book of the 6th ICO’2023 is published by Springer Nature—Quality Label of Enlightenment.
暂无评论