文献详情 >Algorithm of matching law base... 收藏

Algorithm of matching law based on optimal policy search model

作者机构：State Key Laboratory of Intelligent Technology and Systems Tsinghua University Beijing 100084 China Tsinghua National Laboratory for Information Science and Technology Tsinghua University Beijing 100084 China Department of Computer Science and Technology Tsinghua University Beijing 100084 China

出版物：《Dongnan Daxue Xuebao (Ziran Kexue Ban)/Journal of Southeast University (Natural Science Edition)》 (Dongnan Daxue Xuebao)

年卷期：2009年第39卷第SUPPL. 1期

页面：146-151页

核心收录：

主　　题：Reinforcement learning

摘要：Based on the policy search algorithm in partially observable Markov decision process (POMDP), an optimal policy search algorithm is proposed. An algorithm leading to matching law is then derived from the optimal algorithm. The aim of the subject can find a policy parameter that can maximize the expected value of a value function, and the policy parameter is updated on the experience of the subject. Due to the Markov assumption for the environment, the optimal policy algorithm can be obtained from computing the gradient of the expected value of the value function. Theoretical analysis and simulation results show that the decision behavior achieved by this algorithm is able to reach matching law. The matching law can be met if one subject tries to maximize the expected value of the value function under the simple assumption that past choice behaviors do not affect the expected value of the value function and the current policy. It reveals the relationship between the matching behavior and the optimal policy search algorithm, and suggests that the matching behavior is a suboptimal decision behavior.

本地馆藏 | 借阅须知 | 我要预约

已订购，未入库

sda

目录详情 | 试阅读 |

读者评论与其他读者分享你的观点

学校读者

用户名:未登录

我的评分

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Algorithm of matching law based on optimal policy search model

读者评论与其他读者分享你的观点

请选择收藏分类：

建议与咨询 留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

时间限定

文献类型

馆藏选择

核心期刊

语言

文献类型

帮助

文字说明：

检索规则说明：

检索范例：

分类表

所选分类

看过本文的还看了

相关文献

该作者的其他文献

CADAL相关文献

Algorithm of matching law based on optimal policy search model

读者评论 与其他读者分享你的观点

请选择收藏分类： 新增自定义分类 确定 取消

建议与咨询留下您的常用邮箱和电话号码，以便我们向您反馈解决方案和替代方法

读者评论与其他读者分享你的观点

请选择收藏分类：