咨询与建议

限定检索结果

文献类型

  • 126 篇 期刊文献
  • 71 篇 会议

馆藏范围

  • 197 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 179 篇 工学
    • 91 篇 计算机科学与技术...
    • 77 篇 电气工程
    • 41 篇 控制科学与工程
    • 37 篇 信息与通信工程
    • 18 篇 石油与天然气工程
    • 12 篇 软件工程
    • 11 篇 机械工程
    • 10 篇 仪器科学与技术
    • 9 篇 动力工程及工程热...
    • 8 篇 电子科学与技术(可...
    • 4 篇 材料科学与工程(可...
    • 4 篇 交通运输工程
    • 4 篇 船舶与海洋工程
    • 3 篇 土木工程
    • 3 篇 环境科学与工程(可...
    • 2 篇 建筑学
    • 2 篇 水利工程
    • 2 篇 航空宇航科学与技...
    • 1 篇 光学工程
  • 36 篇 管理学
    • 34 篇 管理科学与工程(可...
    • 5 篇 工商管理
  • 23 篇 理学
    • 12 篇 数学
    • 5 篇 物理学
    • 5 篇 系统科学
    • 3 篇 化学
    • 2 篇 生物学
    • 1 篇 海洋科学
  • 5 篇 经济学
    • 4 篇 应用经济学
    • 2 篇 理论经济学
  • 2 篇 教育学
    • 2 篇 教育学
  • 1 篇 农学

主题

  • 197 篇 q-learning algor...
  • 51 篇 reinforcement le...
  • 11 篇 optimization
  • 10 篇 path planning
  • 10 篇 learning (artifi...
  • 9 篇 markov decision ...
  • 8 篇 q-learning
  • 7 篇 quality of servi...
  • 6 篇 heuristic algori...
  • 5 篇 convergence
  • 5 篇 mobile robot
  • 5 篇 resource allocat...
  • 5 篇 machine learning
  • 5 篇 dynamic scheduli...
  • 4 篇 internet of thin...
  • 4 篇 task analysis
  • 4 篇 automatic genera...
  • 4 篇 radio networks
  • 4 篇 jamming attack
  • 4 篇 cognitive radio ...

机构

  • 3 篇 natl taiwan univ...
  • 3 篇 mississippi stat...
  • 2 篇 hong kong polyte...
  • 2 篇 s china univ tec...
  • 2 篇 northeastern uni...
  • 2 篇 aristotle univ t...
  • 2 篇 nanjing tech uni...
  • 2 篇 northwestern pol...
  • 2 篇 univ sains malay...
  • 2 篇 nanyang technol ...
  • 2 篇 kun shan univ te...
  • 2 篇 mil acad tunisia...
  • 2 篇 nagoya univ dept...
  • 2 篇 hong kong polyte...
  • 2 篇 china commun inf...
  • 2 篇 jiangsu normal u...
  • 1 篇 nanjing univ pos...
  • 1 篇 beijing inst tec...
  • 1 篇 hainan inst zhej...
  • 1 篇 hangzhou dianzi ...

作者

  • 3 篇 scheers bart
  • 3 篇 stebel krzysztof
  • 3 篇 suandi shahrel a...
  • 3 篇 samma hussein
  • 3 篇 mohamad-saleh ju...
  • 3 篇 slimeni feten
  • 3 篇 chen jiann-liang
  • 3 篇 chtourou zied
  • 3 篇 le nir vincent
  • 2 篇 li ji
  • 2 篇 wang xingwei
  • 2 篇 xu yan
  • 2 篇 liu dexing
  • 2 篇 musial jakub
  • 2 篇 xu zhao
  • 2 篇 yang songpo
  • 2 篇 czeczot jacek
  • 2 篇 attia rabah
  • 2 篇 lu en
  • 2 篇 noori amin

语言

  • 191 篇 英文
  • 4 篇 其他
  • 2 篇 中文
  • 1 篇 德文
检索条件"主题词=Q-learning algorithm"
197 条 记 录,以下是111-120 订阅
排序:
A Reinforcement learning Method for Constraint-Satisfied Services Composition
收藏 引用
IEEE TRANSACTIONS ON SERVICES COMPUTING 2020年 第5期13卷 786-800页
作者: Ren, Lifang Wang, Wenjian Xu, Hang Shanxi Univ Sch Comp & Informat Technol Taiyuan 030006 Peoples R China Shanxi Univ Finance & Econ Sch Appl Math Taiyuan 030006 Peoples R China
With increasing adoption and presence of Web services, service composition becomes an effective way to construct software applications. Composite services need to satisfy both the functional and the non-functional req... 详细信息
来源: 评论
PID Controller Autotuning Design by a Deterministic q-SLP algorithm
收藏 引用
IEEE ACCESS 2020年 8卷 50010-50021页
作者: Pongfai, Jirapun Su, Xiaojie Zhang, Huiyan Assawinchaichote, Wudhichai King Mongkuts Univ Technol Thonburi Dept Elect & Telecommun Engn Fac Engn Bangkok 10140 Thailand Chongqing Univ Coll Automat Chongqing 400044 Peoples R China Chongqing Technol & Business Univ Natl Res Base Intelligent Mfg Serv Chongqing 400067 Peoples R China
The proportional integral and derivative (PID) controller is extensively applied in many applications. However, three parameters must be properly adjusted to ensure effective performance of the control system: the pro... 详细信息
来源: 评论
TABU SEARCH GUIDED BY REINFORCEMENT learning FOR THE MAX-MEAN DISPERSION PROBLEM
收藏 引用
JOURNAL OF INDUSTRIAL AND MANAGEMENT OPTIMIZATION 2021年 第6期17卷 3223-3246页
作者: Nijimbere, Dieudonne Zhao, Songzheng Gu, Xunhao Esangbedo, Moses Olabhele Dominique, Nyiribakwe Northwestern Polytech Univ Sch Management Xian 710072 Peoples R China Xidian Univ Dept Comp Sci & Technol Xian 710071 Peoples R China
We present an effective hybrid metaheuristic of integrating reinforcement learning with a tabu-search (RLTS) algorithm for solving the max- mean dispersion problem. The innovative element is to design using a knowledg... 详细信息
来源: 评论
learning-based secure communication against active eavesdropper in dynamic environment
收藏 引用
IET COMMUNICATIONS 2019年 第15期13卷 2235-2242页
作者: He, Dongxuan Wang, Hua Zhou, He Beijing Inst Technol Sch Informat & Elect Beijing Peoples R China
In this study, the authors propose a learning-based approach to improve the security of the authors' considered communication system in a dynamic environment, where a source transmits information to a legitimate r... 详细信息
来源: 评论
Embedding a priori knowledge in reinforcement learning
收藏 引用
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS 1998年 第1期21卷 51-71页
作者: Ribeiro, CHC Univ London Imperial Coll Sci Technol & Med Dept Elect & Elect Engn London SW7 2BT England
In the last years, temporal differences methods have been put forward as convenient tools for reinforcement learning. Techniques based on temporal differences, however, suffer from a serious drawback: as stochastic ad... 详细信息
来源: 评论
A smart home management system with hierarchical behavior suggestion and recovery mechanism
收藏 引用
COMPUTER STANDARDS & INTERFACES 2015年 41卷 98-111页
作者: Shen, Victor R. L. Yang, Cheng-Ying Chen, Chien Hung Natl Taipei Univ Coll Elect Engn & Comp Sci Dept Comp Sci & Informat Engn New Taipei City 237 Taiwan Univ Taipei Dept Comp Sci Taipei 100 Taiwan Natl Taipei Univ Grad Inst Elect Engn Coll Elect Engn & Comp Sci New Taipei City 237 Taiwan
We propose the hierarchical behavior suggestion system and recovery mechanism for the smart home management platform, including location layer, action layer, and home appliance layer. The smart home management system ... 详细信息
来源: 评论
Tool Path Optimization for Complex Cavity Milling Based on Reinforcement learning Approach
收藏 引用
IEEE ACCESS 2023年 11卷 66793-66807页
作者: Wan, Yi Xu, Wei Zuo, Tian-Yu Nanjing Xiaozhuang Univ Sch Environm Sci Nanjing 211171 Peoples R China Sanjiang Univ Sch Mech & Elect Engn Nanjing 210012 Peoples R China Nanjing Univ Informat Sci & Technol Sch Automat Nanjing 210044 Peoples R China
In the machining of parts, tool paths for complex cavity milling often have different generation options, as opposed to simple machining features. The different tool path generation options influence the machining tim... 详细信息
来源: 评论
learning-Based Modeling and Optimization for Real-Time System Availability
收藏 引用
IEEE TRANSACTIONS ON COMPUTERS 2021年 第4期70卷 581-594页
作者: Li, Liying Zhou, Junlong Wei, Tongquan Chen, Mingsong Hu, Xiaobo Sharon East China Normal Univ Sch Comp Sci & Technol Engn Res Ctr Software Hardware Codesign Technol & Shanghai 200062 Peoples R China Nanjing Univ Sci & Technol Sch Comp Sci & Engn Nanjing 210094 Peoples R China Univ Notre Dame Dept Comp Sci & Engn Notre Dame IN 46656 USA
As the density of integrated circuits continues to increase, the possibility that real-time systems suffer from soft and hard errors rises significantly, resulting in a degraded availability of system. In this article... 详细信息
来源: 评论
A reinforcement learning approach using Markov decision processes for battery energy storage control within a smart contract framework
收藏 引用
JOURNAL OF ENERGY STORAGE 2024年 86卷
作者: Jonban, Mansour Selseleh Romeral, Luis Marzband, Mousa Abusorrah, Abdullah Univ Politecn Cataluna Elect Engn Dept MCIA Ctr Terrassa Spain King Abdulaziz Univ Ctr Res Excellence Renewable Energy & Power Syst Jeddah 21589 Saudi Arabia King Abdulaziz Univ Fac Engn KA Care Energy Res & Innovat Ctr Dept Elect & Comp EngnRenewable Energy & Power Sy Jeddah 21589 Saudi Arabia
With the increasing penetration of renewable energy sources (RESs), the necessity for employing smart methods to control and manage energy has become undeniable. This study introduces a real -time energy management sy... 详细信息
来源: 评论
Simulation Model for the AGC System of Isolated Microgrid Based on q-learning Method
Simulation Model for the AGC System of Isolated Microgrid Ba...
收藏 引用
IEEE Data Driven Control and learning Systems Conference
作者: Penghu Wang Hao Tang Kai Lv School of Electrical Engineering and Automation Hefei University of Technology
The automatic generation control (AGC) in isolated microgrid with multiple distributed energy resources is concerned in this study. First, the load frequency control (LFC) model of an isolated microgrid, which contain... 详细信息
来源: 评论