CORC  > 厦门大学  > 信息技术-已发表论文
基于KL距离的交互式动态影响图近似算法; Approximate algorithm of interactive dynamic influence diagrams based on KL distance
田乐 ; 罗键 ; 曹浪财 ; 陈志平
2013-01-15
关键词多Agent决策 交互式动态影响图 行为等价 近似行为等价 Kullback-Leibler(KL)距离 multi-Agent decision interactive dynamic influence diagram(I-DID) behavioral equivalence approximate behavioral equivalence Kullback-Leibler(KL)distance
英文摘要交互式动态影响图(InTErACTIVE dynAMIC InfluEnCE dIAgrAMS,I-dIdS)状态空间太大,候选模型的数量随时间变化而呈指数倍增长。针对其备受计算量困扰的问题,提出一种利用近似行为等价原理与区别模型更新算法(dISCrIMInATIVE MOdEl uPdATES,dMu)相结合的近似算法。首先给出了基于kullbACk-lEIblEr(kl)距离模型行为等价和近似行为等价的定义,然后基于kl距离和候选模型的动作对候选模型聚类,自上而下合并策略树形成策略图,最后利用dMu算法进行求解。仿真结果表明,相对于传统的dMu算法,所提近似算法能显著降低候选模型的数量,提高I-dIdS的效率,对I-dIdS的理论及应用研究具有参考价值。; The model space of interactive dynamic influence diagrams(I-DIDs)is too large and the number of candidate models grows exponentially with the number of time steps.To deal with the high calculation cost issue,a method of solving I-DIDs approximately that combines approximate behavioral principle and discriminative model update algorithm(DMU)is proposed.First,a new definition of behavior equivalence and approximate behavior equivalence of models are presented.Then the candidate models based on the Kullback-Leibler(KL)distance and the action of candidate models are clustered.Afterwards,the top to bottom method is used to merge policy trees into policy graphs.Finally,I-DIDs are solved by using the approach of DMU.The simulation results show that the approximated algorithm can dramatically decrease the number of candidate model and improve the efficiency compared with the traditional DMU algorithm.This research work should be valuable in the research and application of I-DIDs.; 国家自然科学基金(60975052)资助课题
语种zh_CN
内容类型期刊论文
源URL[http://dspace.xmu.edu.cn/handle/2288/123009]  
专题信息技术-已发表论文
推荐引用方式
GB/T 7714
田乐,罗键,曹浪财,等. 基于KL距离的交互式动态影响图近似算法, Approximate algorithm of interactive dynamic influence diagrams based on KL distance[J],2013.
APA 田乐,罗键,曹浪财,&陈志平.(2013).基于KL距离的交互式动态影响图近似算法..
MLA 田乐,et al."基于KL距离的交互式动态影响图近似算法".(2013).
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace