Optimal Estimation of Policy Gradient via Double Fitted Iteration

Chengzhuo Ni, Ruiqi Zhang, Xiang Ji, Xuezhou Zhang, Mengdi Wang

Research output: Contribution to journalConference articlepeer-review

Fingerprint

Dive into the research topics of 'Optimal Estimation of Policy Gradient via Double Fitted Iteration'. Together they form a unique fingerprint.

Mathematics

Engineering & Materials Science