咨询与建议

限定检索结果

文献类型

  • 9 篇 期刊文献
  • 4 篇 会议

馆藏范围

  • 13 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 8 篇 工学
    • 7 篇 计算机科学与技术...
    • 3 篇 软件工程
    • 2 篇 控制科学与工程
  • 6 篇 理学
    • 4 篇 数学
    • 2 篇 生物学
    • 2 篇 科学技术史(分学科...
  • 3 篇 哲学
    • 3 篇 哲学
  • 3 篇 教育学
    • 3 篇 心理学(可授教育学...
  • 3 篇 管理学
    • 3 篇 管理科学与工程(可...
    • 1 篇 工商管理
  • 2 篇 经济学
    • 1 篇 理论经济学
    • 1 篇 应用经济学
  • 2 篇 医学
    • 2 篇 临床医学

主题

  • 13 篇 sequential decis...
  • 2 篇 reinforcement le...
  • 2 篇 dynamic programm...
  • 2 篇 dynamic pricing
  • 1 篇 decision tree po...
  • 1 篇 rational choice
  • 1 篇 price experiment...
  • 1 篇 suboptimal solut...
  • 1 篇 formal epistemol...
  • 1 篇 qualitative deci...
  • 1 篇 degeneracy
  • 1 篇 multiple criteri...
  • 1 篇 distributed opti...
  • 1 篇 exploration-expl...
  • 1 篇 bayesian epistem...
  • 1 篇 infinite horizon...
  • 1 篇 optimization
  • 1 篇 philosophy of sc...
  • 1 篇 curse of dimensi...
  • 1 篇 evidence

机构

  • 1 篇 larodec le bardo...
  • 1 篇 vrije univ amste...
  • 1 篇 navy center for ...
  • 1 篇 carl von ossietz...
  • 1 篇 northeastern uni...
  • 1 篇 univ toulouse to...
  • 1 篇 univ genoa dibri...
  • 1 篇 univ amsterdam n...
  • 1 篇 faculty of busin...
  • 1 篇 technion israel ...
  • 1 篇 eindhoven univ t...
  • 1 篇 singapore manage...
  • 1 篇 ctr wiskunde & i...
  • 1 篇 cnrs irit toulou...
  • 1 篇 irit toulouse
  • 1 篇 tel aviv univ ra...
  • 1 篇 carnegie mellon ...
  • 1 篇 cairo univ comp ...
  • 1 篇 college of compu...
  • 1 篇 univ sydney sch ...

作者

  • 2 篇 fargier helene
  • 1 篇 gnecco giorgio
  • 1 篇 gaggero mauro
  • 1 篇 staudt philipp
  • 1 篇 smorodinsky r
  • 1 篇 miskiw kim k.
  • 1 篇 elreedy dina
  • 1 篇 ben amor nahla
  • 1 篇 jing peng
  • 1 篇 varakantham prad...
  • 1 篇 sanguineti marce...
  • 1 篇 harder nick
  • 1 篇 bean jc
  • 1 篇 grefenstette joh...
  • 1 篇 den boer arnoud ...
  • 1 篇 williams ronald ...
  • 1 篇 guillaume romain
  • 1 篇 heesen remco
  • 1 篇 chapman archie c...
  • 1 篇 ryan sm

语言

  • 13 篇 英文
检索条件"主题词=sequential decision problems"
13 条 记 录,以下是1-10 订阅
排序:
Relative entropy in sequential decision problems
收藏 引用
JOURNAL OF MATHEMATICAL ECONOMICS 2000年 第4期33卷 425-439页
作者: Lehrer, E Smorodinsky, R Tel Aviv Univ Raymond & Beverly Sackler Fac Exact Sci Sch Math IL-69978 Tel Aviv Israel Technion Israel Inst Technol IL-32000 Haifa Israel
Consider an agent who faces a sequential decision problem. At each stage the agent takes an action and observes a stochastic outcome (e.g., daily prices, weather conditions, opponents' actions in a repeated game, ... 详细信息
来源: 评论
Dynamic Programming and Value-Function Approximation in sequential decision problems: Error Analysis and Numerical Results
收藏 引用
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS 2013年 第2期156卷 380-416页
作者: Gaggero, Mauro Gnecco, Giorgio Sanguineti, Marcello Natl Res Council Italy Inst Intelligent Syst Automat Genoa Italy Univ Genoa DIBRIS Genoa Italy
Value-function approximation is investigated for the solution via Dynamic Programming (DP) of continuous-state sequential N-stage decision problems, in which the reward to be maximized has an additive structure over a... 详细信息
来源: 评论
The decision tree polytope and its application to sequential decision problems
收藏 引用
Journal of Multi-Criteria decision Analysis 1999年 第6期7卷
作者: Art Warburton Faculty of Business Administration Simon Fraser University Burnaby Canada
This paper describes a new mathematical programming approach to sequential decision problems that have an underlying decision tree structure. The approach, based upon a characterization of strategies as extreme points... 详细信息
来源: 评论
Novel pricing strategies for revenue maximization and demand learning using an exploration-exploitation framework
收藏 引用
SOFT COMPUTING 2021年 第17期25卷 11711-11733页
作者: Elreedy, Dina Atiya, Amir F. Shaheen, Samir I. Cairo Univ Comp Engn Dept Giza 12613 Egypt
The price demand relation is a fundamental concept that models how price affects the sale of a product. It is critical to have an accurate estimate of its parameters, as it will impact the company's revenue. The l... 详细信息
来源: 评论
Simultaneously Learning and Optimizing Using Controlled Variance Pricing
收藏 引用
MANAGEMENT SCIENCE 2014年 第3期60卷 770-783页
作者: den Boer, Arnoud V. Zwart, Bert Eindhoven Univ Technol NL-5600 MB Eindhoven Netherlands Univ Amsterdam NL-1098 XH Amsterdam Netherlands Ctr Wiskunde & Informat NL-1098 XG Amsterdam Netherlands Vrije Univ Amsterdam Dept Math NL-1081 HV Amsterdam Netherlands
Price experimentation is an important tool for firms to find the optimal selling price of their products. It should be conducted properly, since experimenting with selling prices can be costly. A firm, therefore, need... 详细信息
来源: 评论
DEGENERACY IN INFINITE HORIZON OPTIMIZATION
收藏 引用
MATHEMATICAL PROGRAMMING 1989年 第3期43卷 305-316页
作者: RYAN, SM BEAN, JC UNIV MICHIGAN DEPT IND & OPERAT ENGNANN ARBORMI 48109
We consider sequential decision problems over an infinite horizon. The forecast or solution horizon approach to solving such problems requires that the optimal initial decision be unique. We show that multiple optimal... 详细信息
来源: 评论
Algorithms for Multi-criteria Optimization in Possibilistic decision Trees  14th
Algorithms for Multi-criteria Optimization in Possibilistic ...
收藏 引用
14th European Conference on Symbolic and Quantitative Approaches to Reasoning with Uncertainty (ECSQARU)
作者: Ben Amor, Nahla Essghaier, Fatma Fargier, Helene LARODEC Le Bardo Tunisia IRIT Toulouse France
This paper raises the question of solving multi-criteria sequential decision problems under uncertainty. It proposes to extend to possibilistic decision trees the decision rules presented in [1] for non sequential pro... 详细信息
来源: 评论
Marginal Contribution Stochastic Games for Dynamic Resource Allocation  1
收藏 引用
17th International Conference on Principles and Practice of Multi-Agent Systems (PRIMA)
作者: Chapman, Archie C. Varakantham, Pradeep Univ Sydney Sch Elect & Informat Engn Sydney NSW 2006 Australia Singapore Management Univ Sch Informat Syst Singapore Singapore
We develop a new formalism for solving team Markov decision processes (MDPs), called marginal-contribution stochastic games (MCSGs). In MCSGs, each agent's utility for a state transition is given by its marginal c... 详细信息
来源: 评论
The Evolution of Strategies for Multiagent Environments
收藏 引用
Adaptive Behavior 1992年 第1期1卷 65-90页
作者: Grefenstette, John J. Navy Center for Applied Research in Artificial Intelligence Naval Research Laboratory Washington DC 20375-5000 United States
SAMUEL is an experimental learning system that uses genetic algorithms and other learning methods to evolve reactive decision rules from simulations of multiagent environments. The basic approach is to explore a range... 详细信息
来源: 评论
Efficient Learning and Planning Within the Dyna Framework
收藏 引用
Adaptive Behavior 1993年 第4期1卷 437-454页
作者: Jing, Peng Williams, Ronald J. Northeastern University United States College of Computer Science Northeastern University Boston MA 02115 United States
Sutton’s Dyna framework provides a novel and computationally appealing way to integrate learning, planning, and reacting in autonomous agents. Examined here is a class of strategies designed to enhance the learning a... 详细信息
来源: 评论