咨询与建议

限定检索结果

文献类型

  • 126 篇 期刊文献
  • 71 篇 会议

馆藏范围

  • 197 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 179 篇 工学
    • 91 篇 计算机科学与技术...
    • 77 篇 电气工程
    • 41 篇 控制科学与工程
    • 37 篇 信息与通信工程
    • 18 篇 石油与天然气工程
    • 12 篇 软件工程
    • 11 篇 机械工程
    • 10 篇 仪器科学与技术
    • 9 篇 动力工程及工程热...
    • 8 篇 电子科学与技术(可...
    • 4 篇 材料科学与工程(可...
    • 4 篇 交通运输工程
    • 4 篇 船舶与海洋工程
    • 3 篇 土木工程
    • 3 篇 环境科学与工程(可...
    • 2 篇 建筑学
    • 2 篇 水利工程
    • 2 篇 航空宇航科学与技...
    • 1 篇 光学工程
  • 36 篇 管理学
    • 34 篇 管理科学与工程(可...
    • 5 篇 工商管理
  • 23 篇 理学
    • 12 篇 数学
    • 5 篇 物理学
    • 5 篇 系统科学
    • 3 篇 化学
    • 2 篇 生物学
    • 1 篇 海洋科学
  • 5 篇 经济学
    • 4 篇 应用经济学
    • 2 篇 理论经济学
  • 2 篇 教育学
    • 2 篇 教育学
  • 1 篇 农学

主题

  • 197 篇 q-learning algor...
  • 51 篇 reinforcement le...
  • 11 篇 optimization
  • 10 篇 path planning
  • 10 篇 learning (artifi...
  • 9 篇 markov decision ...
  • 8 篇 q-learning
  • 7 篇 quality of servi...
  • 6 篇 heuristic algori...
  • 5 篇 convergence
  • 5 篇 mobile robot
  • 5 篇 resource allocat...
  • 5 篇 machine learning
  • 5 篇 dynamic scheduli...
  • 4 篇 internet of thin...
  • 4 篇 task analysis
  • 4 篇 automatic genera...
  • 4 篇 radio networks
  • 4 篇 jamming attack
  • 4 篇 cognitive radio ...

机构

  • 3 篇 natl taiwan univ...
  • 3 篇 mississippi stat...
  • 2 篇 hong kong polyte...
  • 2 篇 s china univ tec...
  • 2 篇 northeastern uni...
  • 2 篇 aristotle univ t...
  • 2 篇 nanjing tech uni...
  • 2 篇 northwestern pol...
  • 2 篇 univ sains malay...
  • 2 篇 nanyang technol ...
  • 2 篇 kun shan univ te...
  • 2 篇 mil acad tunisia...
  • 2 篇 nagoya univ dept...
  • 2 篇 hong kong polyte...
  • 2 篇 china commun inf...
  • 2 篇 jiangsu normal u...
  • 1 篇 nanjing univ pos...
  • 1 篇 beijing inst tec...
  • 1 篇 hainan inst zhej...
  • 1 篇 hangzhou dianzi ...

作者

  • 3 篇 scheers bart
  • 3 篇 stebel krzysztof
  • 3 篇 suandi shahrel a...
  • 3 篇 samma hussein
  • 3 篇 mohamad-saleh ju...
  • 3 篇 slimeni feten
  • 3 篇 chen jiann-liang
  • 3 篇 chtourou zied
  • 3 篇 le nir vincent
  • 2 篇 li ji
  • 2 篇 wang xingwei
  • 2 篇 xu yan
  • 2 篇 liu dexing
  • 2 篇 musial jakub
  • 2 篇 xu zhao
  • 2 篇 yang songpo
  • 2 篇 czeczot jacek
  • 2 篇 attia rabah
  • 2 篇 lu en
  • 2 篇 noori amin

语言

  • 191 篇 英文
  • 4 篇 其他
  • 2 篇 中文
  • 1 篇 德文
检索条件"主题词=Q-learning algorithm"
197 条 记 录,以下是151-160 订阅
排序:
Dynamic Joint Decision on Price and Delivery Date in MTO Manufacturer Based on Agent
Dynamic Joint Decision on Price and Delivery Date in MTO Man...
收藏 引用
3rd International Conference on Energy, Environment and Sustainable Development (EESD 2013)
作者: Hao, Juan Yu, Jianjun Wu, Miancan South China Univ Technol Sch Business Adm Guangzhou Guangdong Peoples R China Guangdong Univ Foreign Studies Cisco Sch Informat Guangzhou Peoples R China
In order to maximize the total profit and improve the service level, based on the perspective of queuing theory, a new approach for dynamic joint decision on price and delivery date in Make-to-order (MTO) manufacturin... 详细信息
来源: 评论
A UAV Dynamic Path Planning algorithm  35
A UAV Dynamic Path Planning Algorithm
收藏 引用
35th Youth Academic Annual Conference of Chinese-Association-of-Automation (YAC)
作者: Hou, Xiaojian Liu, Fei Wang, Renjie Yu, Yao Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing 100083 Peoples R China Univ Sci & Technol Beijing Sch Automat & Elect Engn Beijing Engn Res Ctr Ind Spectrum Imaging Beijing 100083 Peoples R China
In this paper, we propose a UAV dynamic path planning algorithm to solve the path planning problem of a single UAV in a dynamic environment. The contributions of this paper mainly include the following two folds: (1) ... 详细信息
来源: 评论
Adaptive Inventory Control and Bullwhip Effect Analysis for Supply Chains with Non-stationary Demand  27
Adaptive Inventory Control and Bullwhip Effect Analysis for ...
收藏 引用
27th Chinese Control and Decision Conference (CCDC)
作者: Yang, Songpo Zhang, Jihui Qingdao Univ Inst Complex Sci Qingdao 266071 Peoples R China
In this paper, two adaptive inventory control models, i.e. centralized and decentralized respectively, for a multi-echelon multi-cycle supply chain consisting of one supplier and one retailer with non-stationary stoch... 详细信息
来源: 评论
Agent-Based Simulation of Power Markets under Uniform and Pay-as-Bid Pricing Rules using Reinforcement learning
Agent-Based Simulation of Power Markets under Uniform and Pa...
收藏 引用
IEEE/PES Power Systems Conference and Exposition
作者: Bakirtzis, Anastasios G. Tellidou, Athina C. Aristotle Univ Thessaloniki Dept Elect & Comp Engn Thessaloniki 54124 Greece
In this paper agent-based simulation is employed to study the power market operation under two alternative pricing systems: uniform and discriminatory (pay-as-bid). Power suppliers are modeled as adaptive agents capab... 详细信息
来源: 评论
Privacy-Cost Management in Smart Meters Using Deep Reinforcement learning  10
Privacy-Cost Management in Smart Meters Using Deep Reinforce...
收藏 引用
10th IEEE PES Innovative Smart Grid Technologies Europe (ISGT-Europe) - Smart Grids - Key Enablers of a Green Power System
作者: Shateri, Mohammadhadi Messina, Francisco Piantanida, Pablo Labeau, Fabrice McGill Univ Montreal PQ Canada Univ Paris Sud CNRS Cent Supelec Gif Sur Yvette France
Smart meters (SMs) play a pivotal rule in the smart grid by being able to report the electricity usage of consumers to the utility provider (UP) almost in real-time. However, this could leak sensitive information abou... 详细信息
来源: 评论
Hierarchical Multi-agent System in Traffic Network Signalization with Improved Genetic algorithm
Hierarchical Multi-agent System in Traffic Network Signaliza...
收藏 引用
IEEE International Conference on Artificial Intelligence in Engineering and Technology (IICAIET)
作者: Tan, Min Keng Chuo, Helen Sin Ee Chin, Renee Ka Yin Yeo, Kiam Beng Teo, Kenneth Tze Kin Univ Malaysia Sabah Fac Engn Modelling Simulat & Comp Lab Kota Kinabalu Malaysia Univ Malaysia Sabah Fac Med & Hlth Sci Kota Kinabalu Malaysia
Instead of using classical offline data-driven optimization technique in traffic network signal control, this work aims to explore the potential of implementing an online data-driven optimization technique. A dynamic ... 详细信息
来源: 评论
A Reinforcement learning Approach to Dynamic Optimization of Load Allocation in AGC System
A Reinforcement Learning Approach to Dynamic Optimization of...
收藏 引用
General Meeting of the IEEE-Power-and-Energy-Society
作者: Wang, Y. M. Liu, q. J. Yu, T. S China Univ Technol Elect Power Coll Guangzhou Guangdong Peoples R China
A Reinforcement learning (RL) method applied to the dynamic load allocation in AGC system is presented. The problem can be modeled as a Markov Decision Process (MDP). The q-learning algorithm as a model-free learning ... 详细信息
来源: 评论
qoS-Aware Heterogeneous Networking Using Distributed Multiagent Schemes
QoS-Aware Heterogeneous Networking Using Distributed Multiag...
收藏 引用
7th IEEE International Wireless Communications and Mobile Computing Conference (IWCMC)
作者: Chen, Jiann-Liang Larosa, Yanuarius Teofilus Deng, Der-Jiunn Yang, Pei-Jia Ma, Yi-Wei Natl Changhua Univ Educ Dept Comp Sci & Informat Engn Taipei Taiwan Natl Taiwan Univ Sci & Technol Dept Elect Engn Taipei Taiwan Natl Cheng Kung Univ Dept Engn Sci Tainan Taiwan
This study achieves quality-of-Service (qoS) management in heterogeneous networking using a distributed multiagent scheme (DMAS) based on the concept of cooperation and the awareness algorithm. The proposed scheme is ... 详细信息
来源: 评论
Reinforcement learning algorithms in Global Path Planning for Mobile Robot
Reinforcement Learning Algorithms in Global Path Planning fo...
收藏 引用
International Conference on Industrial Engineering, Applications and Manufacturing (ICIEAM)
作者: Sichkar, Valentyn N. ITMO Univ Dept Control Syst & Robot St Petersburg Russia
The paper is devoted to the research of two approaches for global path planning for mobile robots, based on q-learning and Sarsa algorithms. The study has been done with different adjustments of two algorithms that ma... 详细信息
来源: 评论
Cognitive Radio Jamming Mitigation using Markov Decision Process and Reinforcement learning
Cognitive Radio Jamming Mitigation using Markov Decision Pro...
收藏 引用
International Conference on Advanced Wireless Information and Communication Technologies (AWICT)
作者: Slimeni, Feten Scheers, Bart Chtourou, Zied Le Nir, Vincent Attia, Rabah Mil Acad Tunisia VRIT Lab Nabeul 8000 Tunisia Royal Mil Acad CISS Dept B-1000 Brussels Belgium EPT Univ Carthage SERCOM Lab Marsa 2078 Tunisia
The Cognitive radio technology is a promising solution to the imbalance between scarcity and under utilization of the spectrum. However, this technology is susceptible to both classical and advanced jamming attacks wh... 详细信息
来源: 评论