×
验证码:
换一张
忘记密码?
记住我
CORC
首页
科研机构
检索
知识图谱
申请加入
托管服务
登录
注册
在结果中检索
科研机构
自动化研究所 [133]
沈阳自动化研究所 [7]
清华大学 [2]
长春光学精密机械与物... [2]
数学与系统科学研究院 [2]
山东大学 [2]
更多...
内容类型
期刊论文 [133]
会议论文 [11]
学位论文 [8]
发表日期
2023 [5]
2022 [9]
2021 [11]
2020 [13]
2019 [9]
2018 [20]
更多...
×
知识图谱
CORC
开始提交
已提交作品
待认领作品
已认领作品
未提交全文
收藏管理
QQ客服
官方微博
反馈留言
浏览/检索结果:
共152条,第1-10条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
作者升序
作者降序
题名升序
题名降序
发表日期升序
发表日期降序
提交时间升序
提交时间降序
Peer Incentive Reinforcement Learning for Cooperative Multiagent Games
期刊论文
IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 15, 期号: 4, 页码: 623-636
作者:
Zhang, Tianle
;
Liu, Zhen
;
Pu, Zhiqiang
;
Yi, Jianqiang
收藏
  |  
浏览/下载:2/0
  |  
提交时间:2024/02/22
Cooperative multiagent games
intrinsic reward
multiagent reinforcement learning (MARL)
Starcraft II Micromanagement
CASOG: Conservative Actor–Critic With SmOoth Gradient for Skill Learning in Robot-Assisted Intervention
期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 页码: 10
作者:
Li, Hao
;
Zhou, Xiao-Hu
;
Xie, Xiao-Liang
;
Liu, Shi-Qi
;
Feng, Zhen-Qiu
收藏
  |  
浏览/下载:6/0
  |  
提交时间:2024/02/22
Deep neural network
offline reinforcement learning
robot-assisted intervention
vascular robotic system
Approximate Dynamic Programming for Event-Driven H-8 Constrained Control
期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 11
作者:
Yang, Xiong
;
Xu, Mengmeng
;
Wei, Qinglai
收藏
  |  
浏览/下载:0/0
  |  
提交时间:2023/11/17
Approximate dynamic programming (ADP)
event-driven control
H(8)control
input constraint
optimal control
基于深度强化学习的网约车调度算法研究
学位论文
2023
作者:
习金浩
收藏
  |  
浏览/下载:11/0
  |  
提交时间:2023/06/08
Vehicle Repositioning
Deep Reinforcement Learning
Hierarchical Reinforcement Learning
Graph Neural Network
Learning Cooperative Policies with Graph Networks in Distributed Swarm Systems
会议论文
Queensland, Australia, June 18-23, 2023
作者:
Zhang TL(张天乐)
;
Liu Z(刘振)
;
Pu ZQ(蒲志强)
;
Yi JQ(易建强)
;
Ai XL(艾晓琳)
收藏
  |  
浏览/下载:11/0
  |  
提交时间:2023/06/12
Discrete soft actor-critic with auto-encoder on vascular robotic system
期刊论文
ROBOTICA, 2022, 页码: 12
作者:
Li, Hao
;
Zhou, Xiao-Hu
;
Xie, Xiao-Liang
;
Liu, Shi-Qi
;
Gui, Mei-Jiang
收藏
  |  
浏览/下载:29/0
  |  
提交时间:2023/01/09
surgical robots
vascular robotic system
automation
reinforcement learning
deep neural network
Monte Carlo-based reinforcement learning control for unmanned aerial vehicle systems
期刊论文
NEUROCOMPUTING, 2022, 卷号: 507, 页码: 282-291
作者:
Wei, Qinglai
;
Yang, Zesheng
;
Su, Huaizhong
;
Wang, Lijian
收藏
  |  
浏览/下载:39/0
  |  
提交时间:2022/09/19
Reinforcement learning
Adaptive dynamic programming (ADP)
UAV control
Monte Carlo simulation
Neural networks
Model-Free Reinforcement Learning by Embedding an Auxiliary System for Optimal Control of Nonlinear Systems
期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 4, 页码: 1520-1534
作者:
Xu, Zhenhui
;
Shen, Tielong
;
Cheng, Daizhan
收藏
  |  
浏览/下载:15/0
  |  
提交时间:2022/06/21
Mathematical model
Trajectory
Heuristic algorithms
Optimal control
System dynamics
Artificial neural networks
Convergence
Approximate optimal control design
auxiliary trajectory
completely model-free
integral reinforcement learning (IRL)
Attention Enhanced Reinforcement Learning for Multi agent Cooperation
期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:
Pu, Zhiqiang
;
Wang, Huimu
;
Liu, Zhen
;
Yi, Jianqiang
;
Wu, Shiguang
收藏
  |  
浏览/下载:23/0
  |  
提交时间:2022/06/06
Training
Reinforcement learning
Games
Scalability
Task analysis
Standards
Optimization
Attention mechanism
deep reinforcement learning (DRL)
graph convolutional networks
multi agent systems
Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game
期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 2, 页码: 879-892
作者:
Wei, Qinglai
;
Zhu, Liao
;
Song, Ruizhuo
;
Zhang, Pinjia
;
Liu, Derong
收藏
  |  
浏览/下载:38/0
  |  
提交时间:2022/03/17
Heuristic algorithms
Nonlinear systems
Optimal control
Mathematical model
Dynamic programming
Games
Adaptive systems
Adaptive dynamic programming (ADP)
globalized dual-heuristic dynamic programming (GDHP)
multiplayer nonzero-sum game (MP-NZSG)
neural network (NN)
©版权所有 ©2017 CSpace - Powered by
CSpace