A Convergent Recursive Least Squares Approximate Policy Iteration Algorithm for Multi-Dimensional Markov Decision Process with Continuous State and Action Spaces

Jun Ma, Warren Buckler Powell

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Scopus citations

Fingerprint Dive into the research topics of 'A Convergent Recursive Least Squares Approximate Policy Iteration Algorithm for Multi-Dimensional Markov Decision Process with Continuous State and Action Spaces'. Together they form a unique fingerprint.