咨询与建议

限定检索结果

文献类型

  • 3 篇 期刊文献
  • 3 篇 会议

馆藏范围

  • 6 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 6 篇 工学
    • 4 篇 计算机科学与技术...
    • 3 篇 电气工程
    • 2 篇 动力工程及工程热...
    • 1 篇 控制科学与工程
    • 1 篇 化学工程与技术
    • 1 篇 石油与天然气工程
  • 3 篇 理学
    • 2 篇 数学
    • 1 篇 化学
    • 1 篇 统计学(可授理学、...
  • 1 篇 管理学
    • 1 篇 管理科学与工程(可...

主题

  • 6 篇 model-free algor...
  • 2 篇 q-learning
  • 1 篇 differential sca...
  • 1 篇 offline reinforc...
  • 1 篇 reinforcement le...
  • 1 篇 regression
  • 1 篇 partial coverage
  • 1 篇 neuroevolution
  • 1 篇 fans
  • 1 篇 markov processes
  • 1 篇 nonlinear functi...
  • 1 篇 variance reducti...
  • 1 篇 balancing
  • 1 篇 computational mo...
  • 1 篇 li-s cells
  • 1 篇 behavioral scien...
  • 1 篇 trajectory
  • 1 篇 soc estimation
  • 1 篇 tower crane syst...
  • 1 篇 phenol-formaldeh...

机构

  • 1 篇 princeton univ d...
  • 1 篇 google res mount...
  • 1 篇 mit inst data sy...
  • 1 篇 washington state...
  • 1 篇 univ penn wharto...
  • 1 篇 politehn univ ti...
  • 1 篇 univ politehn bu...
  • 1 篇 stanford univ st...
  • 1 篇 univ maryland de...

作者

  • 1 篇 wang jw
  • 1 篇 zhang tong
  • 1 篇 fan jianqing
  • 1 篇 fathy hosam k.
  • 1 篇 chen yuxin
  • 1 篇 roman raul-crist...
  • 1 篇 hosu ionel-alexa...
  • 1 篇 fang catherine
  • 1 篇 doosthosseini ma...
  • 1 篇 petriu emil m.
  • 1 篇 yan yuling
  • 1 篇 nozarijouybari z...
  • 1 篇 jin yujia
  • 1 篇 hedrea elena-lor...
  • 1 篇 laborie mpg
  • 1 篇 precup radu-emil
  • 1 篇 urzica andreea
  • 1 篇 david radu-codru...
  • 1 篇 xu chu
  • 1 篇 li gen

语言

  • 5 篇 英文
  • 1 篇 其他
检索条件"主题词=model-free algorithms"
6 条 记 录,以下是1-10 订阅
排序:
Comparison of model-free kinetic methods for modeling the cure kinetics of commercial phenol-formaldehyde resins
收藏 引用
THERMOCHIMICA ACTA 2005年 第1-2期439卷 68-73页
作者: Wang, JW Laborie, MPG Wolcott, MP Washington State Univ Dept Civil & Environm Engn Wood Mat & Engn Lab Pullman WA 99164 USA
For many industrial processes it is important to model the cure kinetics of phenol-formaldehyde resoles. Yet the applicability of common model-free kinetic algorithms for the cure of phenolic resins is not known. In t... 详细信息
来源: 评论
VOQL: Towards Optimal Regret in model-free RL with Nonlinear Function Approximation  36
VOQL: Towards Optimal Regret in Model-free RL with Nonlinear...
收藏 引用
36th Annual Conference on Learning Theory (COLT)
作者: Agarwal, Alekh Jin, Yujia Zhang, Tong Google Res Mountain View CA 94043 USA Stanford Univ Stanford CA 94305 USA
We study time-inhomogeneous episodic reinforcement learning (RL) under general function approximation and sparse rewards. We design a new algorithm, Variance-weighted Optimistic Q-Learning (VOQL), based on Q-learning ... 详细信息
来源: 评论
The Efficacy of Pessimism in Asynchronous Q-Learning
收藏 引用
IEEE TRANSACTIONS ON INFORMATION THEORY 2023年 第11期69卷 7185-7219页
作者: Yan, Yuling Li, Gen Chen, Yuxin Fan, Jianqing MIT Inst Data Syst & Soc Cambridge MA 02139 USA Univ Penn Wharton Sch Dept Stat & Data Sci Philadelphia PA 19104 USA Princeton Univ Dept Operat Res & Financial Engn Princeton NJ 08544 USA
This paper is concerned with the asynchronous form of Q-learning, which applies a stochastic approximation scheme to Markovian data samples. Motivated by the recent advances in offline reinforcement learning, we devel... 详细信息
来源: 评论
First-Order Active Disturbance Rejection-Virtual Reference Feedback Tuning Control of Tower Crane Systems  24
First-Order Active Disturbance Rejection-Virtual Reference F...
收藏 引用
24th International Conference on System Theory, Control and Computing (ICSTCC)
作者: Roman, Raul-Cristian Precup, Radu-Emil Petriu, Emil M. David, Radu-Codrut Hedrea, Elena-Lorena Szedlak-Stinean, Alexandra-Iulia Politehn Univ Timisoara Dept Autom Appl Informat Timisoara Romania
The current paper combines the main features of first-order Active Disturbance Rejection Control (ADRC) with Virtual Reference Feedback Tuning (VRFT) to automatically determine the parameters of the controller without... 详细信息
来源: 评论
Comparative Analysis of Existing Architectures for General Game Agents  17
Comparative Analysis of Existing Architectures for General G...
收藏 引用
17th International Symposium on Symbolic and Numeric algorithms for Scientific Computing (SYNASC)
作者: Hosu, Ionel-Alexandru Urzica, Andreea Univ Politehn Bucuresti Fac Automat Control & Comp Sci Bucharest Romania
This paper addresses the development of general purpose game agents able to learn a vast number of games using the same architecture. The article analyzes the main existing approaches to general game playing, reviews ... 详细信息
来源: 评论
An algorithm for dip point detection in lithium-sulfur battery cells
收藏 引用
JOURNAL OF ENERGY STORAGE 2022年 55卷
作者: Nozarijouybari, Zahra Fang, Catherine Doosthosseini, Mahsa Xu, Chu Fathy, Hosam K. Univ Maryland Dept Mech Engn College Pk MD 20742 USA
This article examines the problem of developing a simple, model-free algorithm for detecting and identifying the time instant when a Lithium-Sulfur (Li-S) cell passes through its "dip point"during discharge.... 详细信息
来源: 评论