Mathematics
Function Value
100%
Linear Convergence
100%
Regularization
50%
Numerical Experiment
50%
Global Solution
50%
Markov Decision Process
50%
Structural Property
50%
Engineering
Reinforcement Learning
100%
Value Function
50%
Applicability
25%
Maximization
25%
Optimization Technique
25%
Structural Property
25%
Numerical Experiment
25%
Regularization
25%
Target Value
25%
Markov Decision Process
25%
Computer Science
Reinforcement Learning
100%
Function Value
50%
Optimization Technique
25%
Algorithm Converges
25%
Structural Property
25%
Regularization Term
25%
Markov Decision Process
25%
Learning Rate
25%
Large-Scale Optimization
25%
Optimization Policy
25%
Keyphrases
Policy Mirror Descent
100%
Safety Resources
25%
Cognizance
25%
Convergence Feature
25%