Home Politics Randomization for Faster Exact Optimization of...
Politics

Randomization for Faster Exact Optimization of Discounted Markov Decision Processes

Key Points

arXiv:2606.05110v1 Announce Type: new Abstract: We provide faster deterministic and randomized algorithms for exactly solving discounted Markov Decision Processes (DMDPs). We obtain our results by efficiently reducing computing optimal values and policies in DMDPs to the easier tasks of policy evaluation and computing approximately optimal values in DMDPs. We provide both a straightforward deterministic reduction and a more efficient randomized variant that, together with advances in...

arXiv:2606.05110v1 Announce Type: new Abstract: We provide faster deterministic and randomized algorithms for exactly solving discounted Markov Decision Processes (DMDPs). We obtain our results by efficiently reducing computing optimal values and policies in DMDPs to the easier tasks of policy evaluation and computing approximately optimal values in DMDPs. We provide both a straightforward deterministic reduction and a more efficient randomized variant that, together with advances in approximately solving DMDPs, yield our results.
Originally published by arXiv CS Read original →