\section{Conclusion}

This paper studies the problem of computing near-optimal policies with both lower and upper bounds for POMDP MRPP. Utilizing ideas from heuristic trial-based belief exploration for discounted POMDPs,  we propose an incremental graph search algorithm that searches in the space of potentially optimal reachable beliefs. This work shows that trial-based belief exploration can be an effective methodology to obtain near-optimal policies with over- and under-approximations of reachability probabilities.