咨询与建议

限定检索结果

文献类型

  • 126 篇 期刊文献
  • 71 篇 会议

馆藏范围

  • 197 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 179 篇 工学
    • 91 篇 计算机科学与技术...
    • 77 篇 电气工程
    • 41 篇 控制科学与工程
    • 37 篇 信息与通信工程
    • 18 篇 石油与天然气工程
    • 12 篇 软件工程
    • 11 篇 机械工程
    • 10 篇 仪器科学与技术
    • 9 篇 动力工程及工程热...
    • 8 篇 电子科学与技术(可...
    • 4 篇 材料科学与工程(可...
    • 4 篇 交通运输工程
    • 4 篇 船舶与海洋工程
    • 3 篇 土木工程
    • 3 篇 环境科学与工程(可...
    • 2 篇 建筑学
    • 2 篇 水利工程
    • 2 篇 航空宇航科学与技...
    • 1 篇 光学工程
  • 36 篇 管理学
    • 34 篇 管理科学与工程(可...
    • 5 篇 工商管理
  • 23 篇 理学
    • 12 篇 数学
    • 5 篇 物理学
    • 5 篇 系统科学
    • 3 篇 化学
    • 2 篇 生物学
    • 1 篇 海洋科学
  • 5 篇 经济学
    • 4 篇 应用经济学
    • 2 篇 理论经济学
  • 2 篇 教育学
    • 2 篇 教育学
  • 1 篇 农学

主题

  • 197 篇 q-learning algor...
  • 51 篇 reinforcement le...
  • 11 篇 optimization
  • 10 篇 path planning
  • 10 篇 learning (artifi...
  • 9 篇 markov decision ...
  • 8 篇 q-learning
  • 7 篇 quality of servi...
  • 6 篇 heuristic algori...
  • 5 篇 convergence
  • 5 篇 mobile robot
  • 5 篇 resource allocat...
  • 5 篇 machine learning
  • 5 篇 dynamic scheduli...
  • 4 篇 internet of thin...
  • 4 篇 task analysis
  • 4 篇 automatic genera...
  • 4 篇 radio networks
  • 4 篇 jamming attack
  • 4 篇 cognitive radio ...

机构

  • 3 篇 natl taiwan univ...
  • 3 篇 mississippi stat...
  • 2 篇 hong kong polyte...
  • 2 篇 s china univ tec...
  • 2 篇 northeastern uni...
  • 2 篇 aristotle univ t...
  • 2 篇 nanjing tech uni...
  • 2 篇 northwestern pol...
  • 2 篇 univ sains malay...
  • 2 篇 nanyang technol ...
  • 2 篇 kun shan univ te...
  • 2 篇 mil acad tunisia...
  • 2 篇 nagoya univ dept...
  • 2 篇 hong kong polyte...
  • 2 篇 china commun inf...
  • 2 篇 jiangsu normal u...
  • 1 篇 nanjing univ pos...
  • 1 篇 beijing inst tec...
  • 1 篇 hainan inst zhej...
  • 1 篇 hangzhou dianzi ...

作者

  • 3 篇 scheers bart
  • 3 篇 stebel krzysztof
  • 3 篇 suandi shahrel a...
  • 3 篇 samma hussein
  • 3 篇 mohamad-saleh ju...
  • 3 篇 slimeni feten
  • 3 篇 chen jiann-liang
  • 3 篇 chtourou zied
  • 3 篇 le nir vincent
  • 2 篇 li ji
  • 2 篇 wang xingwei
  • 2 篇 xu yan
  • 2 篇 liu dexing
  • 2 篇 musial jakub
  • 2 篇 xu zhao
  • 2 篇 yang songpo
  • 2 篇 czeczot jacek
  • 2 篇 attia rabah
  • 2 篇 lu en
  • 2 篇 noori amin

语言

  • 191 篇 英文
  • 4 篇 其他
  • 2 篇 中文
  • 1 篇 德文
检索条件"主题词=Q-Learning algorithm"
197 条 记 录,以下是101-110 订阅
排序:
Reinforcement learning based mainline dynamic speed limit adjustment of expressway off-ramp upstream under connected and autonomous vehicles environment
收藏 引用
IET INTELLIGENT TRANSPORT SYSTEMS 2022年 第12期16卷 1809-1819页
作者: Xiao, Daiquan Kang, Shengyang Xu, Xuecai Shen, Zhenwu Huazhong Univ Sci & Technol Sch Civil & Hydraul Engn Wuhan 430074 Peoples R China Shenzhen Urban Transport Planning Ctr Co Ltd Shenzhen Peoples R China Wuhan Huake Quanda Transport Planning & Design Co Wuhan Peoples R China
With the rapid progress of urbanization and continuous increasing of automobiles, expressway on- and off-ramp area becomes the bottleneck, and recurrent congestion occurs frequently. In order to solve the problem of t... 详细信息
来源: 评论
Adaptive hysteresis compensation control of a macro-fiber composite bimorph by improved reinforcement learning
收藏 引用
JOURNAL OF INTELLIGENT MATERIAL SYSTEMS AND STRUCTURES 2024年 第19期35卷 1471-1482页
作者: Li, Xingqiu Hu, Kaiming Li, Hua Wang, Ban Xu, Suan He, Yuchen China Jiliang Univ Sch Mech & Elect Engn 258 Xueyuan St Hangzhou 310018 Zhejiang Peoples R China Zhejiang Univ Sch Aeronaut & Astronaut Hangzhou Zhejiang Peoples R China Hangzhou City Univ Dept Mech Engn Hangzhou Zhejiang Peoples R China
The hysteresis characteristics related to the frequency and amplitude of the control signal seriously affect the precision of the displacement tracking control of the macro-fiber composite (MFC) bimorph. The tradition... 详细信息
来源: 评论
Maximum entropy-based optimal threshold selection using deterministic reinforcement learning with controlled randomization
收藏 引用
SIGNAL PROCESSING 2002年 第7期82卷 993-1006页
作者: Yin, PY Ming Chuan Univ Dept Informat Management Tao Yuan 333 Taiwan
Traditional maximum entropy-based thresholding methods are very popular and efficient in the case of bilevel thresholding. But they are very computationally expensive when extended to multilevel thresholding since the... 详细信息
来源: 评论
Code Dissemination of Long Chain Wireless Sensor Networks Based on q-learning
Code Dissemination of Long Chain Wireless Sensor Networks Ba...
收藏 引用
第四届材料科学应用与能源材料国际研讨会
作者: Ming Yue Wang Gui Geng Zeng College of Telecommunication & Information Engineering NJUPT
Long chain wireless sensor networks have been applied in a variety of applications, such as railway lines and power lines. However, there are few researches on code dissemination protocols under long chain topology. I... 详细信息
来源: 评论
Optimal Values Selection of q-learning Parameters in Stochastic Mazes
Optimal Values Selection of Q-learning Parameters in Stochas...
收藏 引用
作者: Xiaolin Zhou School of Mathematics and Information Sciences Guangzhou University
The model-free characteristic of the q-learning algorithm, without obtaining information about the environment and being available for agents to learn by themselves, enables q-learning to be widely applied to path pla... 详细信息
来源: 评论
Multi-objective traffic signal control model for traffic management
收藏 引用
TRANSPORTATION LETTERS-THE INTERNATIONAL JOURNAL OF TRANSPORTATION RESEARCH 2015年 第4期7卷 196-200页
作者: Long, q. Zhang, J. -F. Zhou, Z. -M. Hunan City Univ Sch Civil Engn Yiyang 413000 Hunan Peoples R China
Traffic signals are one of the main traffic management tools used to control traffic flow on the roads and should reflect traffic managers' intentions in different tasks. This paper showed a multi-objective optimi... 详细信息
来源: 评论
Interference Mitigation for Coexisting Wireless Body Area Networks: Distributed learning Solutions
收藏 引用
IEEE ACCESS 2020年 8卷 24209-24218页
作者: George, Emy Mariam Jacob, Lillykutty Natl Inst Technol Calicut Kozhikode 673601 India
When multiple wireless body area networks (WBANs) exist in close proximity to each other, the inter-user interference considerably degrades the signal to interference plus noise ratio of the packets arriving at each W... 详细信息
来源: 评论
A communication security anti-interference decision model using deep learning in intelligent industrial IoT environment
收藏 引用
SOFT COMPUTING 2022年 第16期26卷 7993-8002页
作者: Yan, Lichao Hu, Juan Wang, Yi Zheng, Ning Di, Jinhong Zhengzhou Univ Aeronaut Sch Intelligent Engn 15 Wenyuan West Rd Zhengzhou 450046 Henan Peoples R China
To traditional anti-jamming decision algorithm that cannot meet the security needs of smart city development, this paper proposes a communication security anti-interference decision algorithm using deep learning in an... 详细信息
来源: 评论
Research on path planning algorithm of mobile robot based on reinforcement learning
收藏 引用
SOFT COMPUTING 2022年 第18期26卷 8961-8970页
作者: Pan, Guoqian Xiang, Yong Wang, Xiaorui Yu, Zhongquan Zhou, Xinzhi Sichuan Univ Coll Elect & Informat Engn Chengdu Sichuan Peoples R China CAAC Res Inst 2 Chengdu Sichuan Peoples R China Civil Aviat Logist Technol Co Ltd Chengdu Sichuan Peoples R China
In order to solve the problems of low learning efficiency and slow convergence speed when mobile robot uses reinforcement learning method for path planning in complex environment, a reinforcement learning method based... 详细信息
来源: 评论
Distributed multi-agent scheme support for service continuity in IMS-4G-Cloud networks
收藏 引用
COMPUTERS & ELECTRICAL ENGINEERING 2015年 42卷 49-59页
作者: Hsieh, Han-Chuan Chen, Jiann-Liang Natl Taiwan Univ Sci & Technol Dept Elect Engn Taipei Taiwan
In this study, the quality of Service (qoS) needed to support service continuity in heterogeneous networks is achieved by a Distributed Multi-Agent Scheme (DMAS) based on cooperation concepts and an awareness algorith... 详细信息
来源: 评论