咨询与建议

限定检索结果

文献类型

  • 3 篇 期刊文献
  • 3 篇 会议

馆藏范围

  • 6 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 3 篇 理学
    • 2 篇 生物学
    • 1 篇 数学
    • 1 篇 统计学(可授理学、...
  • 3 篇 工学
    • 2 篇 计算机科学与技术...
    • 1 篇 电气工程
    • 1 篇 信息与通信工程
  • 2 篇 农学
  • 1 篇 经济学
    • 1 篇 应用经济学

主题

  • 6 篇 ucb algorithm
  • 2 篇 confidence inter...
  • 2 篇 multi-armed band...
  • 1 篇 wet clutch
  • 1 篇 industrial contr...
  • 1 篇 risk aversion
  • 1 篇 the principle of...
  • 1 篇 security in mach...
  • 1 篇 exploration-expl...
  • 1 篇 hypercubes
  • 1 篇 hypercube
  • 1 篇 optimism
  • 1 篇 algorithms
  • 1 篇 electronic mail
  • 1 篇 pareto ucb1 algo...
  • 1 篇 cloning vectors
  • 1 篇 real-world stoch...
  • 1 篇 pcb
  • 1 篇 vectors
  • 1 篇 signal integrity

机构

  • 1 篇 tokyo denki univ...
  • 1 篇 univ psl diens b...
  • 1 篇 univ elect sci &...
  • 1 篇 tokyo denki univ...
  • 1 篇 university of ca...
  • 1 篇 tohoku univ elec...
  • 1 篇 tokyo denki univ...
  • 1 篇 univ clermont au...
  • 1 篇 tokyo denki univ...
  • 1 篇 univ grenoble al...
  • 1 篇 vrije univ bruss...

作者

  • 2 篇 kamiura moto
  • 1 篇 lafourcade pasca...
  • 1 篇 sano kohei
  • 1 篇 chen siyu
  • 1 篇 nowe ann
  • 1 篇 soare marta
  • 1 篇 ciucanu radu
  • 1 篇 drugan madalina ...
  • 1 篇 manderick bernar...
  • 1 篇 ochi kento
  • 1 篇 yijia zeng
  • 1 篇 lombard-platet m...
  • 1 篇 wei shuwu
  • 1 篇 zhang tingrui
  • 1 篇 chen jienan

语言

  • 6 篇 英文
检索条件"主题词=UCB algorithm"
6 条 记 录,以下是1-10 订阅
Involvement of the variance in the ucb algorithm regarding risk aversion and the regret bound
Involvement of the variance in the UCB algorithm regarding r...
收藏 引用
作者: Yijia Zeng University of California San DiegoMath Department
The classical Upper Confidence Bound(ucb) algorithm implemented overestimates of the true mean of reward distributions based on the sample mean and the number of times such arms were chosen to decide the best *** this... 详细信息
来源: 评论
Secure protocols for cumulative reward maximization in stochastic multi-armed bandits
收藏 引用
JOURNAL OF COMPUTER SECURITY 2023年 第1期31卷 1-27页
作者: Ciucanu, Radu Lafourcade, Pascal Lombard-Platet, Marius Soare, Marta Univ Grenoble Alpes France Mails Raduciucanu Univ Grenoble Alpesfr LIG Grenoble France Univ Clermont Auvergne LIMOS Clermont Ferrand France Univ PSL DIENS Be Studys France
We consider the problem of cumulative reward maximization in multi-armed bandits. We address the security concerns that occur when data and computations are outsourced to an honest-but-curious cloud i.e., that execute... 详细信息
来源: 评论
A Fast Signal Integrity Design Model of Printed Circuit Board based on Monte-Carlo Tree  13
A Fast Signal Integrity Design Model of Printed Circuit Boar...
收藏 引用
13th IEEE International Conference on ASIC
作者: Zhang, Tingrui Chen, Siyu Wei, Shuwu Chen, Jienan Univ Elect Sci & Technol China Chengdu 611731 Sichuan Peoples R China
In this paper, we discuss the signal integrity of the printed circuit board(PCB) and put forward a new method of designing PCB parameters based on monte-carlo tree search(MCTS). First, effects and measurements of the ... 详细信息
来源: 评论
Optimism in the face of uncertainty supported by a statistically-designed multi-armed bandit algorithm
收藏 引用
BIOSYSTEMS 2017年 160卷 25-32页
作者: Kamiura, Moto Sano, Kohei Tokyo Denki Univ Grad Sch Sci & Engn Tokyo Japan Tokyo Denki Univ Sch Sci & Engn Tokyo Japan
The principle of optimism in the face of uncertainty is known as a heuristic in sequential decision-making problems. Overtaking method based on this principle is an effective algorithm to solve multi-armed bandit prob... 详细信息
来源: 评论
Overtaking method based on sand-sifter mechanism: Why do optimistic value functions find optimal solutions in multi-armed bandit problems?
收藏 引用
BIOSYSTEMS 2015年 135卷 55-65页
作者: Ochi, Kento Kamiura, Moto Tokyo Denki Univ Grad Sch Sci & Engn Hiki Saitama Japan Tokyo Denki Univ Sch Sci & Engn Hiki Saitama Japan Tohoku Univ Elect Commun Res Inst Sendai Miyagi 980 Japan
A multi-armed bandit problem is a search problem on which a learning agent must select the optimal arm among multiple slot machines generating random rewards. ucb algorithm is one of the most popular methods to solve ... 详细信息
来源: 评论
Pareto Upper Confidence Bounds algorithms: an empirical study
Pareto Upper Confidence Bounds algorithms: an empirical stud...
收藏 引用
IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL)
作者: Drugan, Madalina M. Nowe, Ann Manderick, Bernard Vrije Univ Brussel Artificial Intelligence Lab Ixelles Belgium
Many real-world stochastic environments are inherently multi-objective environments with conflicting objectives. The multi-objective multi-armed bandits (MOMAB) are extensions of the classical, i.e. single objective, ... 详细信息
来源: 评论