Home »
MCQs »
Reinforcement Learning MCQs
Which of the following algorithms will find the best course of action, based on the agent's current state, without using a model and off-policy reinforcement learning?
36. Which of the following algorithms will find the best course of action, based on the agent's current state, without using a model and off-policy reinforcement learning?
- Q-learning
- Markov property
- State action reward state action
- Deep Q neural network
Answer: A) Q-learning
Explanation:
A Q-learning algorithm will find the best course of action, based on the agent's current state, without using a model and off-policy reinforcement learning.