×
验证码:
换一张
忘记密码?
记住我
CORC
首页
科研机构
检索
知识图谱
申请加入
托管服务
登录
注册
在结果中检索
科研机构
自动化研究所 [13]
合肥物质科学研究院 [1]
内容类型
期刊论文 [13]
会议论文 [1]
发表日期
2023 [14]
×
知识图谱
CORC
开始提交
已提交作品
待认领作品
已认领作品
未提交全文
收藏管理
QQ客服
官方微博
反馈留言
浏览/检索结果:
共14条,第1-10条
帮助
限定条件
发表日期:2023
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
发表日期升序
发表日期降序
提交时间升序
提交时间降序
题名升序
题名降序
作者升序
作者降序
Large sequence models for sequential decision-making: a survey
期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:
Wen, Muning
;
Lin, Runji
;
Wang, Hanjing
;
Yang, Yaodong
;
Wen, Ying
收藏
  |  
浏览/下载:6/0
  |  
提交时间:2023/11/17
sequential decision-making
sequence modeling
the Transformer
training system
Relay Hindsight Experience Replay: Self-guided continual reinforcement learning for sequential object manipulation tasks with sparse rewards
期刊论文
NEUROCOMPUTING, 2023, 卷号: 557
作者:
Luo, Yongle
;
Wang, Yuxin
;
Dong, Kun
;
Zhang, Qiang
;
Cheng, Erkang
收藏
  |  
浏览/下载:8/0
  |  
提交时间:2023/11/10
Deep reinforcement learning
Robotic manipulation
Continual learning
Hindsight experience replay
Sparse reward
Learning for Depth Control of a Robotic Penguin: A Data-Driven Model Predictive Control Approach
期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 卷号: 70, 期号: 11, 页码: 11422-11432
作者:
Pan, Jie
;
Zhang, Pengfei
;
Wang, Jian
;
Liu, Mingxin
;
Yu, Junzhi
收藏
  |  
浏览/下载:8/0
  |  
提交时间:2023/11/17
Data-driven model predictive control (MPC)
depth control
motion control
reinforcement learning (RL)
robotic penguin
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning
期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:
Chai, Jiajun
;
Zhu, Yuanheng
;
Zhao, Dongbin
收藏
  |  
浏览/下载:3/0
  |  
提交时间:2023/11/16
Large-scale multiagent
neighboring communication
reinforcement learning (RL)
variational information flow
Data Generation Feedback Relearning Control for Unmodeled Nonlinear Systems
期刊论文
IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2023, 页码: 12
作者:
Zhang, Yong
;
Mu, Chaoxu
;
Zhao, Dongbin
收藏
  |  
浏览/下载:7/0
  |  
提交时间:2023/11/16
Data models
Real-time systems
Heuristic algorithms
Mathematical models
Adaptation models
Approximation algorithms
Cost function
Data generation model
feedback relearning control
delayed neural network
reinforcement learning
unmodeled nonlinear system
A Data-Driven Iterative Learning Approach for Optimizing the Train Control Strategy
期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 卷号: 19, 期号: 7, 页码: 7885-7893
作者:
Su, Shuai
;
Zhu, Qingyang
;
Liu, Junqing
;
Tang, Tao
;
Wei, Qinglai
收藏
  |  
浏览/下载:2/0
  |  
提交时间:2023/11/17
Deep reinforcement learning (RL)
driving strategy
energy-efficient train control (EETC)
soft actorcritic (SAC)
Pseudo Value Network Distillation for High-Performance Exploration
会议论文
澳大利亚, 2023-06
作者:
Zhao EM(赵恩民)
;
Xing JL(兴军亮)
;
Li K(李凯)
;
Kang YX(康永欣)
;
Tao P(陶品)
收藏
  |  
浏览/下载:9/0
  |  
提交时间:2023/06/28
Efficient Accelerator/Network Co-Search with Circular Greedy Reinforcement Learning
期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, 2023, 页码: 1-5
作者:
Liu, Zejian
;
Li, Gang
;
Cheng, Jian
收藏
  |  
浏览/下载:10/0
  |  
提交时间:2023/06/19
Accelerator/Network Co-Search
Reinforcement Learning
Performance Estimation
Multi-objective Optimization
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning
期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 318-334
作者:
Wai-Chung Kwan, Hong-Ru Wang, Hui-Min Wang, Kam-Fai Wong
收藏
  |  
浏览/下载:5/0
  |  
提交时间:2023/05/29
Dialogue policy learning (DPL), task-oriented dialogue system (TOD), reinforcement learning (RL), dialogue system, Markov decision process
Privacy Preserving Demand Side Management Method via Multi-Agent Reinforcement Learning
期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 10, 页码: 1984-1999
作者:
Feiye Zhang
;
Qingyu Yang
;
Dou An
收藏
  |  
浏览/下载:8/0
  |  
提交时间:2023/09/07
Centralized training and decentralized execution
demand side management
multi-agent reinforcement learning
privacy preserving
©版权所有 ©2017 CSpace - Powered by
CSpace