Home »
MCQs »
Reinforcement Learning MCQs
Which of the following types of policy is a learning algorithm that evaluates and improves a policy that is dissimilar from the Policy that is used for action selection?
41. Which of the following types of policy is a learning algorithm that evaluates and improves a policy that is dissimilar from the Policy that is used for action selection?
- behavior policy
- Target policy
- On-policy
- Off-policy
Answer: D) Off-policy
Explanation:
Off-policy is a type of policy, is a learning algorithm that evaluates and improves a policy that is dissimilar from the Policy that is used for action selection.