A reinforcement learning algorithm that can learn a Markov decision process policy. SARSA agents interact with the environment and update their policy based on the actions an agent takes. SARSA is known as an on-policy learning algorithm.
Login to register for events. Don’t have an account? Just register for an event and an account will be created for you!