Home »
MCQs »
Reinforcement Learning MCQs
Among On-policy and off-policy, which of the following target policy is not equal to behavior policy?
42. Among On-policy and off-policy, which of the following target policy is not equal to behavior policy?
- On-policy
- Off-policy
Answer: B) Off-policy
Explanation:
In an off-policy learning algorithm target policy is not equal to behavior policy.