咨询与建议

限定检索结果

文献类型

  • 128 篇 期刊文献
  • 71 篇 会议

馆藏范围

  • 199 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 181 篇 工学
    • 92 篇 计算机科学与技术...
    • 78 篇 电气工程
    • 41 篇 控制科学与工程
    • 39 篇 信息与通信工程
    • 18 篇 石油与天然气工程
    • 12 篇 软件工程
    • 11 篇 机械工程
    • 10 篇 仪器科学与技术
    • 9 篇 动力工程及工程热...
    • 9 篇 电子科学与技术(可...
    • 5 篇 材料科学与工程(可...
    • 4 篇 交通运输工程
    • 4 篇 船舶与海洋工程
    • 3 篇 土木工程
    • 3 篇 环境科学与工程(可...
    • 2 篇 建筑学
    • 2 篇 水利工程
    • 2 篇 航空宇航科学与技...
    • 1 篇 光学工程
  • 36 篇 管理学
    • 34 篇 管理科学与工程(可...
    • 5 篇 工商管理
  • 23 篇 理学
    • 12 篇 数学
    • 5 篇 物理学
    • 5 篇 系统科学
    • 3 篇 化学
    • 2 篇 生物学
    • 1 篇 海洋科学
  • 5 篇 经济学
    • 4 篇 应用经济学
    • 2 篇 理论经济学
  • 2 篇 教育学
    • 2 篇 教育学
  • 1 篇 农学

主题

  • 199 篇 q-learning algor...
  • 51 篇 reinforcement le...
  • 11 篇 path planning
  • 11 篇 optimization
  • 10 篇 learning (artifi...
  • 9 篇 markov decision ...
  • 8 篇 q-learning
  • 7 篇 quality of servi...
  • 6 篇 heuristic algori...
  • 5 篇 convergence
  • 5 篇 mobile robot
  • 5 篇 resource allocat...
  • 5 篇 machine learning
  • 5 篇 dynamic scheduli...
  • 4 篇 internet of thin...
  • 4 篇 task analysis
  • 4 篇 automatic genera...
  • 4 篇 radio networks
  • 4 篇 jamming attack
  • 4 篇 cognitive radio ...

机构

  • 3 篇 natl taiwan univ...
  • 3 篇 mississippi stat...
  • 2 篇 hong kong polyte...
  • 2 篇 s china univ tec...
  • 2 篇 northeastern uni...
  • 2 篇 aristotle univ t...
  • 2 篇 nanjing tech uni...
  • 2 篇 northwestern pol...
  • 2 篇 univ sains malay...
  • 2 篇 nanyang technol ...
  • 2 篇 kun shan univ te...
  • 2 篇 mil acad tunisia...
  • 2 篇 nagoya univ dept...
  • 2 篇 hong kong polyte...
  • 2 篇 china commun inf...
  • 2 篇 jiangsu normal u...
  • 1 篇 nanjing univ pos...
  • 1 篇 beijing inst tec...
  • 1 篇 hainan inst zhej...
  • 1 篇 hangzhou dianzi ...

作者

  • 3 篇 scheers bart
  • 3 篇 stebel krzysztof
  • 3 篇 suandi shahrel a...
  • 3 篇 samma hussein
  • 3 篇 mohamad-saleh ju...
  • 3 篇 slimeni feten
  • 3 篇 chen jiann-liang
  • 3 篇 chtourou zied
  • 3 篇 le nir vincent
  • 2 篇 li ji
  • 2 篇 wang xingwei
  • 2 篇 xu yan
  • 2 篇 liu dexing
  • 2 篇 musial jakub
  • 2 篇 xu zhao
  • 2 篇 yang songpo
  • 2 篇 czeczot jacek
  • 2 篇 attia rabah
  • 2 篇 lu en
  • 2 篇 noori amin

语言

  • 192 篇 英文
  • 4 篇 其他
  • 2 篇 中文
检索条件"主题词=Q-Learning Algorithm"
199 条 记 录,以下是31-40 订阅
排序:
Model-free extended q-learning method for H∞, output tracking control of networked control systems with network delays and packet loss
收藏 引用
NEUROCOMPUTING 2025年 634卷
作者: Hao, Longyan Wang, Chaoli Liang, Dong Li, Shihua Univ Shanghai Sci & Technol Dept Control Sci & Engn Shanghai 200093 Peoples R China Southeast Univ Sch Automat Nanjing 211189 Peoples R China
In this paper, the extended q-learning method is used to study the HPo output tracking control (HOTC) problem of networked control systems with state delay and data loss. Compared with the existing results, the networ... 详细信息
来源: 评论
The coevolution of cooperation: Integrating q-learning and occasional social interactions in evolutionary games
收藏 引用
CHAOS SOLITONS & FRACTALS 2025年 194卷
作者: Lin, Jiaying Long, Pinduo Liang, Jinfeng Dai, qionglin Li, Haihong Yang, Junzhong Beijing Univ Posts & Telecommun Sch Sci Beijing 100876 Peoples R China Beijing Normal Univ Sch Syst Sci Beijing 100875 Peoples R China Beijing Univ Posts & Telecommun Key Lab Math & Informat Networks Minist Educ Beijing Peoples R China
This study explores the emergence and maintenance of cooperation in evolutionary game theory by incorporating occasional social interactions into q-learning algorithms. We model the dynamics on a square lattice, where... 详细信息
来源: 评论
Event-Triggered Data-Driven Control of Nonlinear Systems via q-learning
收藏 引用
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS 2025年 第2期55卷 1069-1077页
作者: Shen, Mouquan Wang, Xianming Zhu, Song Huang, Tingwen Wang, qing-Guo Nanjing Tech Univ Coll Elect Engn & Control Sci Nanjing 211816 Peoples R China Nanjing Tech Univ Sch Mech & Power Engn Nanjing 211816 Peoples R China China Univ Min & Technol Sch Math Xuzhou 221116 Peoples R China Shenzhen Univ Adv Technol Fac Comp Sci & Control Engn Shenzhen 518055 Peoples R China Beijing Normal Univ Inst Artificial Intelligence & Future Networks Zhuhai 519087 Peoples R China BNU HKBU United Int Coll Guangdong Key Lab AI & Multimodal Data Proc Zhuhai 519087 Peoples R China Univ Johannesburg Inst Intelligent Syst Fac Engn & Built Environm ZA-2006 Johannesburg South Africa
This article aims to study event-triggered data-driven control of nonlinear systems via q-learning. An input-output mapping is described by a pseudo-partial derivatives form. A q-learning-based optimization criterion ... 详细信息
来源: 评论
Optimizing Subchannel Assignment and Power Allocation for Network Slicing in High-Density NOMA Networks: A q-learning Approach
收藏 引用
IEEE ACCESS 2025年 13卷 24323-24335页
作者: Solaiman, Suhare Taif Univ Coll Comp & Informat Technol Dept Comp Sci Taif 21944 Saudi Arabia
The growing number of connected devices in high-density environments poses serious challenges for accommodating and managing these devices across different network slicing services, such as ultra-reliable low-latency ... 详细信息
来源: 评论
learning to select operators in meta-heuristics: An integration of q-learning into the iterated greedy algorithm for the permutation flowshop scheduling problem
收藏 引用
EUROPEAN JOURNAL OF OPERATIONAL RESEARCH 2023年 第3期304卷 1296-1330页
作者: Karimi-Mamaghan, Maryam Mohammadi, Mehrdad Pasdeloup, Bastien Meyer, Patrick IMT Atlantique Lab STICC UMR CNRS 6285 F-29238 Brest France
This paper aims at integrating machine learning techniques into meta-heuristics for solving combinato-rial optimization problems. Specifically, our study develops a novel efficient iterated greedy algorithm based on r... 详细信息
来源: 评论
Relay selection algorithm based on social network combined with q-learning for vehicle D2D communication
收藏 引用
IET COMMUNICATIONS 2019年 第20期13卷 3582-3587页
作者: qian, Hongzhi Yu, Jinming Hua, Licheng Donghua Univ Coll Informat Sci & Technol Shanghai 201620 Peoples R China Ningbo Univ Fac Mech Engn & Mech Ningbo 315211 Zhejiang Peoples R China
A relay selection algorithm was proposed to improve a communication rate of D2D (device-to-device) users in-vehicle networking communication systems based on social network combined with q-learning. The scheme was div... 详细信息
来源: 评论
q-learning whale optimization algorithm for test suite generation with constraints support
收藏 引用
NEURAL COMPUTING & APPLICATIONS 2023年 第34期35卷 24069-24090页
作者: Hassan, Ali Abdullah Abdullah, Salwani Zamli, Kamal Z. Razali, Rozilawati Univ Kebangsaan Malaysia Fac Informat Sci & Technol Bangi 43600 Selangor Malaysia Univ Malaysia Pahang Al Sultan Abdullah Fac Comp Pekan 26600 Pahang Malaysia Univ Airlangga Fac Sci & Technol Campus JI Dr H Soekamo C Surabaya 60115 Indonesia
This paper introduces a new variant of a metaheuristic algorithm based on the whale optimization algorithm (WOA), the q-learning algorithm and the Exponential Monte Carlo Acceptance Probability called (qWOA-EMC). Unli... 详细信息
来源: 评论
q-learning-based simulated annealing algorithm for constrained engineering design problems
收藏 引用
NEURAL COMPUTING & APPLICATIONS 2020年 第9期32卷 5147-5161页
作者: Samma, Hussein Mohamad-Saleh, Junita Suandi, Shahrel Azmin Lahasan, Badr Univ Sains Malaysia Sch Elect & Elect Engn Intelligent Biometr Grp Engn Campus Nibong Tebal 14300 Penang Malaysia Univ Aden Fac Educ Shabwa Dept Comp Programming Aden Yemen
Simulated annealing (SA) was recognized as an effective local search optimizer, and it showed a great success in many real-world optimization problems. However, it has slow convergence rate and its performance is wide... 详细信息
来源: 评论
The q-learning obstacle avoidance algorithm based on EKF-SLAM for NAO autonomous walking under unknown environments
收藏 引用
ROBOTICS AND AUTONOMOUS SYSTEMS 2015年 72卷 29-36页
作者: Wen, Shuhuan Chen, Xiao Ma, Chunli Lam, H. K. Hua, Shaoyang Yanshan Univ Key Lab Ind Comp Control Engn Hebei Prov Qinhuangdao Peoples R China Kings Coll London Dept Informat London WC2R 2LS England
The two important problems of SLAM and Path planning are often addressed independently. However, both are essential to achieve successfully autonomous navigation. In this paper, we aim to integrate the two attributes ... 详细信息
来源: 评论
Inverse q-learning Optimal Control for Takagi-Sugeno Fuzzy Systems
收藏 引用
IEEE Transactions on Fuzzy Systems 2025年
作者: Song, Wenting Ning, Jun Tong, Shaocheng Liaoning University of Technology College of Science Jinzhou121000 China Dalian Maritime University Navigation College Dalian116026 China
Inverse reinforcement learning optimal control is under the framework of learner-expert, the learner system can learn expert system's trajectory and optimal control policy via a reinforcement learning algorithm an... 详细信息
来源: 评论