Home »
MCQs »
Reinforcement Learning MCQs
Which of the following type of policy is a learning algorithm in which the same policy is improved and evaluated?
40. Which of the following type of policy is a learning algorithm in which the same policy is improved and evaluated?
- behavior policy
- Target policy
- On-policy
- Off-policy
Answer: C) On-policy
Explanation:
On-policy type of policy is a learning algorithm in which the same policy is improved and evaluated.