Which of the following algorithms will find the best course of action, based on the agent's current state, without using a model and off-policy reinforcement learning?

36. Which of the following algorithms will find the best course of action, based on the agent's current state, without using a model and off-policy reinforcement learning?

Q-learning
Markov property
State action reward state action
Deep Q neural network

Answer: A) Q-learning

Explanation:

A Q-learning algorithm will find the best course of action, based on the agent's current state, without using a model and off-policy reinforcement learning.

Comments and Discussions!

Load comments ↻

Recently Published MCQs

Which of the following algorithms will find the best course of action, based on the agent's current state, without using a model and off-policy reinforcement learning?

Top MCQs

Top Programs/Examples

Top Tutorials

About

Student's Section

Subscribe