咨询与建议

限定检索结果

文献类型

  • 81 篇 期刊文献
  • 28 篇 会议
  • 2 篇 学位论文

馆藏范围

  • 111 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 87 篇 工学
    • 53 篇 计算机科学与技术...
    • 36 篇 电气工程
    • 30 篇 控制科学与工程
    • 8 篇 交通运输工程
    • 7 篇 石油与天然气工程
    • 5 篇 软件工程
    • 4 篇 信息与通信工程
    • 3 篇 动力工程及工程热...
    • 2 篇 仪器科学与技术
    • 2 篇 土木工程
    • 1 篇 电子科学与技术(可...
    • 1 篇 化学工程与技术
    • 1 篇 船舶与海洋工程
    • 1 篇 环境科学与工程(可...
  • 28 篇 管理学
    • 28 篇 管理科学与工程(可...
    • 3 篇 工商管理
  • 24 篇 理学
    • 22 篇 数学
    • 4 篇 系统科学
    • 1 篇 物理学
    • 1 篇 统计学(可授理学、...
  • 11 篇 经济学
    • 7 篇 理论经济学
    • 3 篇 应用经济学
  • 3 篇 医学
    • 3 篇 临床医学
    • 2 篇 基础医学(可授医学...

主题

  • 111 篇 value function a...
  • 37 篇 reinforcement le...
  • 18 篇 approximate dyna...
  • 12 篇 dynamic programm...
  • 7 篇 dynamic vehicle ...
  • 7 篇 temporal differe...
  • 6 篇 q-learning
  • 5 篇 function approxi...
  • 5 篇 markov decision ...
  • 4 篇 markov decision ...
  • 4 篇 neural networks
  • 4 篇 optimal control
  • 4 篇 policy iteration
  • 3 篇 rate of converge...
  • 3 篇 actor-critic
  • 3 篇 policy evaluatio...
  • 3 篇 polynomial basis...
  • 3 篇 reinforcement le...
  • 3 篇 energy managemen...
  • 3 篇 off-policy learn...

机构

  • 2 篇 beijing univ che...
  • 2 篇 hefei univ techn...
  • 2 篇 missouri univ sc...
  • 2 篇 univ massachuset...
  • 2 篇 tokyo inst techn...
  • 2 篇 northeastern uni...
  • 2 篇 univ sci & techn...
  • 2 篇 tech univ carolo...
  • 2 篇 natl univ def te...
  • 2 篇 georgia inst tec...
  • 2 篇 chinese acad sci...
  • 2 篇 otto von guerick...
  • 2 篇 rice univ dept e...
  • 1 篇 polish acad sci ...
  • 1 篇 shanghai engn re...
  • 1 篇 tsinghua univ de...
  • 1 篇 univ sydney sch ...
  • 1 篇 inria nancy gran...
  • 1 篇 univ southern ca...
  • 1 篇 univ twente ind ...

作者

  • 6 篇 ulmer marlin w.
  • 5 篇 song tianheng
  • 5 篇 li dazi
  • 4 篇 xu xin
  • 4 篇 mattfeld dirk c.
  • 3 篇 soeffker ninja
  • 3 篇 hachiya hirotaka
  • 2 篇 tutsoy onder
  • 2 篇 huang zhenhua
  • 2 篇 savelsbergh mart...
  • 2 篇 montoya juan m.
  • 2 篇 lewis frank l.
  • 2 篇 pietquin olivier
  • 2 篇 jin qibing
  • 2 篇 sickles robin c.
  • 2 篇 geist matthieu
  • 2 篇 li ping
  • 2 篇 chapman archie c...
  • 2 篇 zuo lei
  • 2 篇 cervellera crist...

语言

  • 109 篇 英文
  • 2 篇 其他
检索条件"主题词=Value function approximation"
111 条 记 录,以下是91-100 订阅
排序:
Learning to select branching rules in the DPLL procedure for satisfiability
收藏 引用
Electronic Notes in Discrete Mathematics 2001年 9卷 344-359页
作者: Lagoudakis, Michail G. Littman, Michael L. Department of Computer Science Duke University Durham NC 27708 United States Shannon Laboratory AT and T Labs. Research Florham Park NJ 07932 United States
The DPLL procedure is the most popular complete satisfiability (SAT) solver. While its worst case complexity is exponential, the actual running time is greatly affected by the ordering of branch variables during the s... 详细信息
来源: 评论
EXPERT-BASED REWARD SHAPING AND EXPLORATION SCHEME FOR BOOSTING POLICY LEARNING OF DIALOGUE MANAGEMENT
EXPERT-BASED REWARD SHAPING AND EXPLORATION SCHEME FOR BOOST...
收藏 引用
IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)
作者: Ferreira, Emmanuel Lefevre, Fabrice Univ Avignon LIA F-84911 Avignon 9 France
This paper investigates the conditions under which expert knowledge can be used to accelerate the policy optimization of a learning agent. Recent works on reinforcement learning for dialogue management allowed to devi... 详细信息
来源: 评论
Temporal-Difference Learning An Online Support Vector Regression Approach  12
Temporal-Difference Learning <i>An Online Support Vector Reg...
收藏 引用
12th International Conference on Informatics in Control Automation and Robotics (ICINCO)
作者: Teixeira, Hugo Tanzarella Bottura, Celso Pascoli State Univ Campinas UNICAMP Sch Elect & Comp Engn FEEC DSIF LCSI Av Albert Einstein 400LE31 BR-13081970 Campinas SP Brazil
This paper proposes a new algorithm for Temporal-Difference (TD) learning using online support vector regression. It benefits from the good generalization properties support vector regression (SVR) has, and also can d... 详细信息
来源: 评论
The Operation Optimization Model of Pumped-Hydro Power Storage Station Based on Approximate Dynamic Programming
The Operation Optimization Model of Pumped-Hydro Power Stora...
收藏 引用
International Conference on Power System Technology (PowerCon)
作者: Liang, Zhencheng Li, Yu Wei, Hua Guangxi Univ Sch Elect Engn Nanning 530004 Peoples R China Guangxi Key Lab Power Syst Optimizat & Energy Tec Nanning Peoples R China
Based on the hypothesis that pumped-hydro power storage (PHPS) station is available for multi-day optimization and adjustment, the paper has proposed a long-term operation optimization model of PHPS station based on a... 详细信息
来源: 评论
R-learning and Gaussian Process Regression Algorithm for Cloud Job Access Control  3
R-learning and Gaussian Process Regression Algorithm for Clo...
收藏 引用
3rd IEEE International Conference on Cyber Security and Cloud Computing (IEEE CSCloud) / 2nd IEEE International Conference of Scalable and Smart Cloud (IEEE SSC)
作者: Peng, Zhiping Cui, Delong Ma, Yuanjia Xiong, Jianbin Xu, Bo Lin, Weiwei Guangdong Univ Petrochem Technol Coll Comp & Elect Informat Maoming Peoples R China South China Univ Technol Sch Comp Sci & Engn Guangzhou Guangdong Peoples R China
Reinforcement learning is an area of machine learning inspired by behaviorist psychology, concerned with how software agents ought to take actions in an environment so as to maximize some notion of cumulative reward. ... 详细信息
来源: 评论
Energy management of PV-storage systems: ADP approach with temporal difference learning  19
Energy management of PV-storage systems: ADP approach with t...
收藏 引用
19th Power Systems Computation Conference (PSCC)
作者: Keerthisinghe, Chanaka Verbic, Gregor Chapman, Archie C. Univ Sydney Sch Elect & Informat Engn Sydney NSW Australia
In the future, residential energy users can seize the full potential of demand response schemes by using an automated home energy management system (HEMS) to schedule their distributed energy resources. In order to ge... 详细信息
来源: 评论
Tracking in Reinforcement Learning
收藏 引用
16th International Conference on Neural Information Processing (ICONIP 2009)
作者: Geist, Matthieu Pietquin, Olivier Fricout, Gabriel IMS Res Grp Metz France Arcelor Mittal Res MC Cluster Metz France INRIA Nancy Grand Est CORIDA project team Nancy France
Reinforcement learning induces non-stationarity at several levels. Adaptation to non-stationary environments is of course a desired feature of a fair RL algorithm. Yet, even if the environment of the learning agent ca... 详细信息
来源: 评论
Advances in Tactical & Operational Planning for Less-than-Truckload Carriers
Advances in Tactical & Operational Planning for Less-than-Tr...
收藏 引用
作者: Baubaid, Ahmad Ali Georgia Institute of Technology
学位级别:博士
This thesis explores tactical and operational planning problems in the context of the Less-than-Truckload (LTL) industry. LTL carriers transport shipments that occupy a small fraction of trailer capacity, and, thus, r... 详细信息
来源: 评论
Compensation guarantees in crowdsourced delivery: Impact on platform and driver welfare
收藏 引用
OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE 2024年 122卷
作者: Alnaggar, Aliaa Gzara, Fatma Bookbinder, James H. Toronto Metropolitan Univ Dept Mech & Ind Engn Toronto ON M5B 2K3 Canada Univ Waterloo Dept Management Sci Waterloo ON N2L 3G1 Canada
Crowdsourced delivery and other sharing economy platforms attract freelance workers by offering them flexibility in scheduling their own work hours. Those platforms, however, have been criticized for the lack of prote... 详细信息
来源: 评论
Optimal intra-day operations of behind-the-meter battery storage for primary frequency regulation provision: A hybrid lookahead method
收藏 引用
ENERGY 2022年 第0期247卷 123482-123482页
作者: Wen, Kerui Li, Weidong Yu, Samson Shenglong Li, Ping Shi, Peng Dalian Univ Technol Fac Elect Informat & Elect Engn Dalian 116024 Peoples R China Deakin Univ Sch Engn 75 Pigdgon Rd Waurn Ponds Vic 3216 Australia State Grid Liaoning Elect Power Supply Co Ltd Elect Power Res Inst Shenyang 110006 Peoples R China Univ Adelaide Sch Elect & Elect Engn North Terrace Adelaide SA 5000 Australia
Battery energy storage systems (BESSs) are being widely installed behind-the-meter to reduce electricity bill. By providing grid ancillary services, behind-the-meter BESSs can increase potential revenue streams. This ... 详细信息
来源: 评论