咨询与建议

限定检索结果

文献类型

  • 7 篇 会议
  • 2 篇 期刊文献

馆藏范围

  • 9 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 7 篇 工学
    • 6 篇 计算机科学与技术...
    • 5 篇 电气工程
    • 2 篇 控制科学与工程
    • 1 篇 测绘科学与技术
  • 1 篇 理学
    • 1 篇 数学

主题

  • 9 篇 kanerva coding
  • 6 篇 reinforcement le...
  • 4 篇 function approxi...
  • 3 篇 tcp congestion c...
  • 3 篇 dynamic generali...
  • 2 篇 q-learning
  • 2 篇 state abstractio...
  • 1 篇 decision-making
  • 1 篇 routing
  • 1 篇 rough sets
  • 1 篇 predator-prey pu...
  • 1 篇 iot
  • 1 篇 mobile wireless ...
  • 1 篇 memory optimizat...
  • 1 篇 pursuit
  • 1 篇 tcp
  • 1 篇 robocup
  • 1 篇 generalisation

机构

  • 3 篇 northeastern uni...
  • 1 篇 univ essex dept ...
  • 1 篇 bangalore inst t...
  • 1 篇 northeastern uni...
  • 1 篇 bangalore inst t...
  • 1 篇 bangalore inst t...
  • 1 篇 northeastern uni...
  • 1 篇 northeastern uni...
  • 1 篇 soochow univ sch...

作者

  • 5 篇 meleis waleed
  • 5 篇 li wei
  • 3 篇 zhou fan
  • 2 篇 chowdhury kaushi...
  • 1 篇 waleed m. meleis
  • 1 篇 wu cheng
  • 1 篇 hu hs
  • 1 篇 fan zhou
  • 1 篇 wei li
  • 1 篇 waleed meleis
  • 1 篇 chowdhury kaushi...
  • 1 篇 kaushik chowdhur...
  • 1 篇 kostiadis k
  • 1 篇 sudha g.
  • 1 篇 nagaraj r.
  • 1 篇 cheng wu
  • 1 篇 vidyadhar varshi...

语言

  • 9 篇 英文
检索条件"主题词=Kanerva coding"
9 条 记 录,以下是1-10 订阅
排序:
Similarity-Aware kanerva coding for On-Line Reinforcement Learning  2018
Similarity-Aware Kanerva Coding for On-Line Reinforcement Le...
收藏 引用
2nd International Conference on Vision, Image and Signal Processing (ICVISP)
作者: Li, Wei Meleis, Waleed Northeastern Univ 360 Huntington Ave Boston MA 02115 USA
A major challenge in reinforcement learning (RL) is use of a tabular representation to represent learned policies with a large number of states or state-action pairs. Function approximation is a promising tool to over... 详细信息
来源: 评论
Dynamic Generalization kanerva coding in Reinforcement Learning for TCP Congestion Control Design  16
Dynamic Generalization Kanerva Coding in Reinforcement Learn...
收藏 引用
16th International Conference on Autonomous Agents and Multiagent Systems (AAMAS)
作者: Li, Wei Zhou, Fan Meleis, Waleed Chowdhury, Kaushik Northeastern Univ Dept Elect & Comp Engn Boston MA 02115 USA
Traditional reinforcement learning (RL) techniques often encounter limitations when solving large or continuous state-action spaces. Training times needed to explore the very large space are impractically long, and it... 详细信息
来源: 评论
Dynamic Generalization kanerva coding in Reinforcement Learning for TCP Congestion Control Design  17
Dynamic Generalization Kanerva Coding in Reinforcement Learn...
收藏 引用
Proceedings of the 16th Conference on Autonomous Agents and MultiAgent Systems
作者: Wei Li Fan Zhou Waleed Meleis Kaushik Chowdhury Northeastern University Boston MA USA
Traditional reinforcement learning (RL) techniques often encounter limitations when solving large or continuous state-action spaces. Training times needed to explore the very large space are impractically long, and it... 详细信息
来源: 评论
Sparse Distributed Memory Approach for Reinforcement Learning Driven Efficient Routing in Mobile Wireless Network System
收藏 引用
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS 2021年 第11期12卷 144-152页
作者: Vidyadhar, Varshini Nagaraj, R. Sudha, G. Bangalore Inst Technol Dept Comp Sci & Engn Bangalore Karnataka India Bangalore Inst Technol Dept Informat Sci & Engn Bangalore Karnataka India Bangalore Inst Technol Dept Elect & Elect Bangalore Karnataka India
In recent years, researchers have explored the applicability of Q-learning, a model-free reinforcement learning technology towards designing QoS-aware, resource-efficiency, and reliable routing techniquesin a dynamica... 详细信息
来源: 评论
QTCP: Adaptive Congestion Control with Reinforcement Learning
收藏 引用
IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING 2019年 第3期6卷 445-458页
作者: Li, Wei Zhou, Fan Chowdhury, Kaushik Roy Meleis, Waleed Northeastern Univ Dept Elect & Comp Engn Boston MA 02115 USA
Next generation network access technologies and Internet applications have increased the challenge of providing satisfactory quality of experience for users with traditional congestion control protocols. Efforts on op... 详细信息
来源: 评论
Learning-based and Data-driven TCP Design for Memory-constrained IoT  12
Learning-based and Data-driven TCP Design for Memory-constra...
收藏 引用
12th IEEE Annual International Conference on Distributed Computing in Sensor Systems (DCOSS)
作者: Li, Wei Zhou, Fan Meleis, Waleed Chowdhury, Kaushik Northeastern Univ Dept Elect & Comp Engn Boston MA USA
Advances in wireless technology have resulted in pervasive deployment of devices of a high variability in form factors, memory and computational ability. The need for maintaining continuous connections that deliver da... 详细信息
来源: 评论
Rough Sets-based Prototype Optimization in kanerva-based Function Approximation
Rough Sets-based Prototype Optimization in Kanerva-based Fun...
收藏 引用
IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)
作者: Wu, Cheng Li, Wei Meleis, Waleed Soochow Univ Sch Urban Rail Transportat Suzhou Peoples R China Northeastern Univ Dept Elect & Comp Engn Boston MA 02115 USA
Problems involving multi-agent systems can be complex and involve huge state-action spaces, making such problems difficult to solve. Function approximation schemes such as kanerva coding with dynamic, frequency-based ... 详细信息
来源: 评论
Adaptive kanerva-based function approximation for multi-agent systems  08
Adaptive Kanerva-based function approximation for multi-agen...
收藏 引用
Proceedings of the 7th international joint conference on Autonomous agents and multiagent systems - Volume 3
作者: Cheng Wu Waleed M. Meleis
In this paper, we show how adaptive prototype optimization can be used to improve the performance of function approximation based on kanerva coding when solving largescale instances of classic multi-agent problems. We... 详细信息
来源: 评论
KaBaGe-RL: kanerva-based generalisation and reinforcement learning for possession football
KaBaGe-RL: Kanerva-based generalisation and reinforcement le...
收藏 引用
IEEE Conference on Intelligent Robots and Systems (IROS 2001)
作者: Kostiadis, K Hu, HS Univ Essex Dept Comp Sci Colchester CO4 3SQ Essex England
The complexity of most modem systems prohibits a hand-coded approach to decision making. In addition, many problems have continuous or large discrete state spaces;some have large or continuous action spaces. The probl... 详细信息
来源: 评论