We present multi-agent A* (MAA*), the first complete and optimal heuristic search algorithm for solving decentralized partially-observable Markov decision problems (DEC-POMDPs) with finite horizon. The algorithm is suitable for computing optimal plans for a cooperative group of agents that operate ...
基于点的值迭代算法在POMDP问题中的研究 部分可观测马尔可夫决策过程(Partially Observable Markov Decision Process,POMDP)是马尔可夫决策过程(Markov Decision Process,MDP)的扩展。在POMDP框架下,由于环境... 房俊恒 - 苏州大学 被引量: 0发表: 2016年