咨询与建议

限定检索结果

文献类型

  • 28 篇 期刊文献
  • 10 篇 会议

馆藏范围

  • 38 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 32 篇 工学
    • 23 篇 计算机科学与技术...
    • 16 篇 电气工程
    • 9 篇 控制科学与工程
    • 4 篇 软件工程
    • 2 篇 仪器科学与技术
    • 2 篇 电子科学与技术(可...
    • 1 篇 力学(可授工学、理...
    • 1 篇 机械工程
    • 1 篇 信息与通信工程
  • 8 篇 理学
    • 7 篇 数学
    • 1 篇 地球物理学
    • 1 篇 统计学(可授理学、...
  • 6 篇 管理学
    • 6 篇 管理科学与工程(可...
  • 2 篇 医学
    • 2 篇 基础医学(可授医学...
    • 2 篇 临床医学
  • 1 篇 经济学
    • 1 篇 理论经济学

主题

  • 38 篇 nonlinear functi...
  • 5 篇 reinforcement le...
  • 5 篇 neural networks
  • 3 篇 function approxi...
  • 3 篇 actor-critic
  • 3 篇 radial basis fun...
  • 2 篇 nonlinear functi...
  • 2 篇 regression analy...
  • 2 篇 incremental lear...
  • 2 篇 economic growth.
  • 2 篇 classification
  • 2 篇 policy gradient
  • 2 篇 artificial neura...
  • 2 篇 production funct...
  • 1 篇 thermal stabilit...
  • 1 篇 mahalanobis dist...
  • 1 篇 piecewise-linear...
  • 1 篇 hinging hyperpla...
  • 1 篇 correlation-awar...
  • 1 篇 representation l...

机构

  • 2 篇 shahid beheshti ...
  • 2 篇 henan univ sci &...
  • 1 篇 faculty of engin...
  • 1 篇 google res mount...
  • 1 篇 southwest petr u...
  • 1 篇 amirkabir univ t...
  • 1 篇 fujian normal un...
  • 1 篇 yonsei univ dept...
  • 1 篇 austrian acad sc...
  • 1 篇 natl i lan univ ...
  • 1 篇 cent queensland ...
  • 1 篇 getac technol co...
  • 1 篇 department of st...
  • 1 篇 city univ hong k...
  • 1 篇 college of autom...
  • 1 篇 qnap syst inc ta...
  • 1 篇 nanjing univ sci...
  • 1 篇 naist informat s...
  • 1 篇 avic xi''an figh...
  • 1 篇 heriot watt univ...

作者

  • 3 篇 salimi-badr armi...
  • 2 篇 zhang tong
  • 2 篇 ebadzadeh mohamm...
  • 2 篇 zhu junlong
  • 2 篇 zheng ruijuan
  • 1 篇 murata n
  • 1 篇 yu yu-hsiang
  • 1 篇 hao hu
  • 1 篇 jihong shen
  • 1 篇 wang xueqi
  • 1 篇 venayagamoorthy ...
  • 1 篇 tarela jm
  • 1 篇 erlina tati
  • 1 篇 kirby michael j.
  • 1 篇 park jb
  • 1 篇 breiman l
  • 1 篇 chen c. l. phili...
  • 1 篇 iannella n
  • 1 篇 yan shengnan
  • 1 篇 zhao xuhui

语言

  • 32 篇 英文
  • 5 篇 其他
  • 1 篇 中文
检索条件"主题词=Nonlinear Function Approximation"
38 条 记 录,以下是1-10 订阅
排序:
VOQL: Towards Optimal Regret in Model-free RL with nonlinear function approximation  36
VOQL: Towards Optimal Regret in Model-free RL with Nonlinear...
收藏 引用
36th Annual Conference on Learning Theory (COLT)
作者: Agarwal, Alekh Jin, Yujia Zhang, Tong Google Res Mountain View CA 94043 USA Stanford Univ Stanford CA 94305 USA
We study time-inhomogeneous episodic reinforcement learning (RL) under general function approximation and sparse rewards. We design a new algorithm, Variance-weighted Optimistic Q-Learning (VOQL), based on Q-learning ... 详细信息
来源: 评论
Forward Actor-Critic for nonlinear function approximation in Reinforcement Learning  16
Forward Actor-Critic for Nonlinear Function Approximation in...
收藏 引用
16th International Conference on Autonomous Agents and Multiagent Systems (AAMAS)
作者: Veeriah, Vivek van Seijen, Harm Sutton, Richard S. Univ Alberta Dept Comp Sci Edmonton AB Canada Univ Alberta Edmonton AB Canada
Multi-step methods are important in reinforcement learning (RL). Eligibility traces, the usual way of handling them, works well with linear function approximators. Recently, van Seijen (2016) had introduced a delayed ... 详细信息
来源: 评论
Forward Actor-Critic for nonlinear function approximation in Reinforcement Learning  17
Forward Actor-Critic for Nonlinear Function Approximation in...
收藏 引用
International Conference on Autonomous Agents and Multiagent Systems
作者: Vivek Veeriah Harm van Seijen Richard S. Sutton Dept. of Computing Science University of Alberta
Multi-step methods are important in reinforcement learning (RL). Eligibility traces, the usual way of handling them, works well with linear function approximators. Recently, van Seijen (2016) had introduced a delayed ... 详细信息
来源: 评论
Service placement strategies in mobile edge computing based on an improved genetic algorithm
收藏 引用
PERVASIVE AND MOBILE COMPUTING 2024年 105卷
作者: Zheng, Ruijuan Xu, Junwei Wang, Xueqi Liu, Muhua Zhu, Junlong Henan Univ Sci & Technol Sch Informat Engn Luoyang 471023 Henan Peoples R China
In mobile edge computing (MEC), quality of service (QoS) is closely related to optimizing service placement strategies, which is crucial to providing efficient services that meet user needs. However, due to the mobili... 详细信息
来源: 评论
Adaptive temporal-difference learning via deep neural network function approximation: a non-asymptotic analysis
收藏 引用
COMPLEX & INTELLIGENT SYSTEMS 2025年 第2期11卷 1-19页
作者: Wang, Guoyong Fu, Tiange Zheng, Ruijuan Zhao, Xuhui Zhu, Junlong Zhang, Mingchuan Luoyang Inst Sci & Technol Sch Informat Engn Luoyang 471023 Peoples R China Longmen Lab Luoyang 471023 Peoples R China Henan Univ Sci & Technol Sch Informat Engn Luoyang 471023 Peoples R China
Although deep reinforcement learning has achieved notable practical achievements, its theoretical foundations have been scarcely explored until recent times. Nonetheless, the rate of convergence for current neural tem... 详细信息
来源: 评论
A data-driven implicit deep adaptive neuro-fuzzy inference system capable of manifold learning for function approximation
收藏 引用
APPLIED SOFT COMPUTING 2024年 155卷
作者: Salimi-Badr, Armin Shahid Beheshti Univ Fac Comp Sci & Engn Tehran Iran
Fuzzy Neural Networks (FNN) have the ability of decision-making based on constructing semi-ellipsoidal clusters in the input space as the antecedent parts of their fuzzy rules. To determine the output value for each i... 详细信息
来源: 评论
A novel learning algorithm based on computing the rules' desired outputs of a TSK fuzzy neural network with non-separable fuzzy rules
收藏 引用
NEUROCOMPUTING 2022年 第0期470卷 139-153页
作者: Salimi-Badr, Armin Ebadzadeh, Mohammad Mehdi Shahid Beheshti Univ Fac Comp Sci & Engn Tehran Iran Amirkabir Univ Technol Dept Comp Engn Tehran Iran
In this paper, a novel learning approach to train fuzzy neural networks' parameters based on calculating the desired outputs of their rules, is proposed. We describe the desired outputs of fuzzy rules as values th... 详细信息
来源: 评论
Stacked Broad Learning System: From Incremental Flatted Structure to Deep Model
收藏 引用
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2021年 第1期51卷 209-222页
作者: Liu, Zhulin Chen, C. L. Philip Feng, Shuang Feng, Qiying Zhang, Tong South China Univ Technol Sch Comp Sci & Engn Guangzhou 510641 Guangdong Peoples R China Univ Macau Fac Sci & Technol Macau Peoples R China Beijing Normal Univ Sch Appl Math Zhuhai 519087 Peoples R China
The broad learning system (BLS) has been proved to be effective and efficient lately. In this article, several deep variants of BLS are reviewed, and a new adaptive incremental structure, Stacked BLS, is proposed. The... 详细信息
来源: 评论
Rollout algorithm for light-weight physical-layer authentication in cognitive radio networks
收藏 引用
IET COMMUNICATIONS 2020年 第18期14卷 3128-3134页
作者: Yan, Shengnan Wang, Xiaoding Xu, Li Fujian Normal Univ Coll Math & Informat Fuzhou 350117 Fujian Peoples R China Fujian Normal Univ Key Lab Network Secur & Cryptol Fuzhou 350117 Fujian Peoples R China
Cognitive radio networks (CRNs) are vulnerable to spoofing attacks due to their wireless and cognitive nature. Since the traditional cryptographic authentication can hardly prevent such attacks in CRNs, the physical-l... 详细信息
来源: 评论
The True Online Continuous Learning Automation (TOCLA) in a continuous control benchmarking of actor-critic algorithms
The True Online Continuous Learning Automation (TOCLA) in a ...
收藏 引用
IEEE Symposium Series on Computational Intelligence (IEEE SSCI)
作者: Frost, Gordon Vallejo, Marta Heriot Watt Univ Sch Engn & Phys Sci Edinburgh Midlothian Scotland
Reinforcement learning problems are often discretised, use linear function approximation, or perform batch updates. However, many applications that can benefit from reinforcement learning contain continuous variables ... 详细信息
来源: 评论