咨询与建议

限定检索结果

文献类型

  • 6 篇 会议
  • 5 篇 期刊文献
  • 1 篇 学位论文

馆藏范围

  • 12 篇 电子文献
  • 0 种 纸本馆藏

日期分布

学科分类号

  • 10 篇 工学
    • 8 篇 计算机科学与技术...
    • 7 篇 电气工程
    • 4 篇 信息与通信工程
    • 2 篇 控制科学与工程
    • 1 篇 动力工程及工程热...
    • 1 篇 电子科学与技术(可...
    • 1 篇 石油与天然气工程
    • 1 篇 交通运输工程
    • 1 篇 软件工程
    • 1 篇 安全科学与工程
    • 1 篇 网络空间安全
  • 3 篇 管理学
    • 3 篇 管理科学与工程(可...

主题

  • 12 篇 reinforcement le...
  • 3 篇 learning (artifi...
  • 2 篇 simulation
  • 2 篇 heuristic algori...
  • 1 篇 consumers
  • 1 篇 function approxi...
  • 1 篇 anomaly mining
  • 1 篇 reinforcement le...
  • 1 篇 power systems
  • 1 篇 gym
  • 1 篇 base stations
  • 1 篇 decision support...
  • 1 篇 deep learning
  • 1 篇 cognition
  • 1 篇 full-duplex comm...
  • 1 篇 data objects
  • 1 篇 intelligent agen...
  • 1 篇 constraint gener...
  • 1 篇 function approxi...
  • 1 篇 signal to noise ...

机构

  • 1 篇 higher sch commu...
  • 1 篇 georgia inst tec...
  • 1 篇 inst markets tec...
  • 1 篇 al zahra coll wo...
  • 1 篇 university of ca...
  • 1 篇 polytech inst po...
  • 1 篇 united int univ ...
  • 1 篇 shanghai polytec...
  • 1 篇 facri sci & tech...
  • 1 篇 nanjing univ aer...
  • 1 篇 kocaeli univ bil...
  • 1 篇 csg ehv power tr...
  • 1 篇 mergers & acquis...
  • 1 篇 univ pisa dipart...
  • 1 篇 univ quebec mont...
  • 1 篇 univ tsukuba fac...
  • 1 篇 univ pisa dipart...
  • 1 篇 univ tsukuba gra...

作者

  • 1 篇 akter ari fa
  • 1 篇 shatabda swakkha...
  • 1 篇 calisir sinan
  • 1 篇 billah h. m. mut...
  • 1 篇 abdelkefi fatma
  • 1 篇 liu rong
  • 1 篇 ajib wessam
  • 1 篇 luise marco
  • 1 篇 tokadli gueliz
  • 1 篇 vale zita
  • 1 篇 morais hugo
  • 1 篇 yicheng peng
  • 1 篇 praca isabel
  • 1 篇 mlika zoubeir
  • 1 篇 chaieb cirine
  • 1 篇 morita masahiko
  • 1 篇 vo phuong minh
  • 1 篇 hasan md tarek
  • 1 篇 alkhambashi maji...
  • 1 篇 bacci giacomo

语言

  • 12 篇 英文
检索条件"主题词=reinforcement learning algorithms"
12 条 记 录,以下是1-10 订阅
排序:
Model-Free reinforcement learning algorithms: A Survey  27
Model-Free Reinforcement Learning Algorithms: A Survey
收藏 引用
27th Signal Processing and Communications Applications Conference (SIU)
作者: Calisir, Sinan Pehlivanoglu, Meltem Kurt Kocaeli Univ Bilgisayar Muhendisligi Bolumu Kocaeli Turkey
This paper aims to provide a comprehensive survey of the reinforcement learning algorithms given in the literature. Especially model-free reinforcement learning algorithms are given in details and the differences of t... 详细信息
来源: 评论
Research on big data anomaly mining method for power grid operation and maintenance based on reinforcement learning algorithm  9
Research on big data anomaly mining method for power grid op...
收藏 引用
9th International Forum on Electrical Engineering and Automation (IFEEA)
作者: Wen, Xing CSG EHV Power Transmiss Co Guangzhou Guangdong Peoples R China
As the scale of development of power grids continues to expand, the issue of their safe operation has received much attention. The problem of low accuracy in the face of network attacks exists in the big data anomaly ... 详细信息
来源: 评论
Adaptive Tabu Dropout for Regularization of Deep Neural Networks  29th
Adaptive Tabu Dropout for Regularization of Deep Neural Netw...
收藏 引用
29th International Conference on Neural Information Processing
作者: Hasan, Md Tarek Akter, Ari Fa Shamael, Mohammad Nazmush Hossain, Md Al Emran Billah, H. M. Mutasim Islam, Sumayra Shatabda, Swakkhar United Int Univ Dept Comp Sci & Engn Plot 2Madani Ave Dhaka 1212 Badda Bangladesh
Dropout is an effective strategy for the regularization of deep neural networks. Applying tabu to the units that have been dropped in the recent epoch and retaining them for training ensures diversification in dropout... 详细信息
来源: 评论
Essays on Return Insurance and Antitrust Issues
Essays on Return Insurance and Antitrust Issues
收藏 引用
作者: Vo, Phuong Minh University of California Irvine
学位级别:Ph.D., Doctor of Philosophy
Chapter 1 introduces a continuous-time monopoly model that considers a return policy allowing consumers to return purchased products within a specified period. The model shows that an easy return policy, allowing for ... 详细信息
来源: 评论
Application of Robotic Arm Path Planning Based on TQC Algorithm  6
Application of Robotic Arm Path Planning Based on TQC Algori...
收藏 引用
6th IEEE International Conference on Automation, Electronics and Electrical Engineering, AUTEEE 2023
作者: Gu, Jiahui Shanghai Polytechnic University School of Computer and Information Engineering Shanghai China
For the slow training of the Panda robotic arm grasping and placing task in a third-party environment in the Gym simulation environment, it is proposed to use the TQC algorithm for training. Compared with DDPG algorit... 详细信息
来源: 评论
Development and Design of an Intelligent Financial Asset Management System Based on Big Data Analysis and Kubernetes
收藏 引用
Procedia Computer Science 2024年 243卷 482-489页
作者: Yicheng Peng Mergers & Acquisitions Practice West Monroe Partners New York 10019 NY USA
The rise of deep learning in the financial field has led to the integration of artificial intelligence and investment, providing users with intelligent investment decisions. However, the data volume of financial marke... 详细信息
来源: 评论
On the Optimization of User Association and Resource Allocation in HetNets With mm-Wave Base Stations
收藏 引用
IEEE SYSTEMS JOURNAL 2020年 第3期14卷 3957-3967页
作者: Chaieb, Cirine Mlika, Zoubeir Abdelkefi, Fatma Ajib, Wessam Univ Quebec Montreal Dept Comp Sci Montreal PQ H3C 3P8 Canada Higher Sch Commun Tunis Dept Appl Math Signals & Commun El Ghazala 2083 Ariana Tunisia
This article investigates the problem of joint user association and resource allocation, defined by the number of allocated time-slots, in hybrid heterogeneous networks with the coexistence of sub-6-GHz base stations ... 详细信息
来源: 评论
RETRACTED: Research on breakthrough and innovation of UAV mission planning method based on cloud computing-based reinforcement learning algorithm (Retracted Article)
收藏 引用
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS 2019年 第3期37卷 3285-3292页
作者: Liu, Rong Liang, Jin Alkhambashi, Majid Nanjing Univ Aeronaut & Astronaut UAV Res Inst Middle & Small Size UAV Adv Tech Key Lab Minist Ind & Informat Technol Nanjing Jiangsu Peoples R China FACRI Sci & Technol Aircraft Control Lab Xian Shanxi Peoples R China Al Zahra Coll Women Dept Informat Technol Muscat Oman
The UAV system has evolved in the direction of intelligence and autonomy. Mission planning is an important part of autonomous drone control. The issue of route planning and task assignment in drone mission planning is... 详细信息
来源: 评论
Energy-Efficient Power Control for Multiple-Relay Cooperative Networks Using Q-learning
收藏 引用
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS 2015年 第3期14卷 1567-1580页
作者: Shams, Farshad Bacci, Giacomo Luise, Marco Inst Markets Technol IMT Inst Adv Studies Dept Comp Sci & Engn I-55100 Lucca Italy Univ Pisa Dipartimento Ingn Informaz I-56121 Pisa Italy Univ Pisa Dipartimento Ingn Informaz I-56122 Pisa Italy
In this paper, we investigate the power control problem in a cooperative network with multiple wireless transmitters, multiple amplify-and-forward relays, and one destination. The relay communication can be either ful... 详细信息
来源: 评论
Option and Constraint Generation using Work Domain Analysis
Option and Constraint Generation using Work Domain Analysis
收藏 引用
IEEE International Conference on Systems, Man, and Cybernetics (SMC)
作者: Tokadli, Gueliz Feigh, Karen M. Georgia Inst Technol Sch Aerosp Engn Atlanta GA 30332 USA
In this paper we investigate the use of Work Domain Analysis (WDA), a technique from the field of cognitive engineering, to inform the creation of options and constraints for reinforcement learning (RL) algorithms. Th... 详细信息
来源: 评论