Accelerating Point-Based POMDP Algorithms through Successive Approximations of the Optimal Reachable Space
dc.contributor.author | HSU, David | en_US |
dc.contributor.author | LEE, Wee Sun | en_US |
dc.contributor.author | RONG, Nan | en_US |
dc.date.accessioned | 2007-06-01T09:21:41Z | en_US |
dc.date.accessioned | 2017-01-23T07:00:22Z | |
dc.date.available | 2007-06-01T09:21:41Z | en_US |
dc.date.available | 2017-01-23T07:00:22Z | |
dc.date.issued | 2007-04-29 | en_US |
dc.description.abstract | Point-based approximation algorithms have drastically im-proved the speed of POMDP planning. This paper presents a new point-based POMDP algorithm called SARSOP. Like earlier point-based algorithms, SARSOP performs value iter-ation at a set of sampled belief points; however, it focuses on sampling near the space reachable from an initial belief point under the optimal policy. Since neither the optimal policynor the optimal reachable space is known in advance, SARSOP builds successive approximations to it through sampling and pruning. In our experiments, the new algorithm solved dif-.cult POMDP problems with more than 10,000 states. Its running time is competitive with the fastest existing point-based algorithm on most problems andfasterby manytimes on some. Our approach is complementary to existing point-based algorithms and can be integrated with them to improve their performance. | en_US |
dc.format.extent | 902129 bytes | en_US |
dc.format.mimetype | application/pdf | en_US |
dc.identifier.uri | https://dl.comp.nus.edu.sg/xmlui/handle/1900.100/2552 | en_US |
dc.language.iso | en | en_US |
dc.relation.ispartofseries | TRA4/07 | en_US |
dc.title | Accelerating Point-Based POMDP Algorithms through Successive Approximations of the Optimal Reachable Space | en_US |
dc.type | Technical Report | en_US |