Chapter 21: Reinforcement Learning
Markov Decision Processes and Partial Observability intermediate
"Algorithm quality cannot rescue a poorly specified state/action/reward design." -- Chapter 21
"Algorithm quality cannot rescue a poorly specified state/action/reward design." -- Chapter 21
Register to Read
Sign up for a free account to access all 112 primer topics.
Create Free AccountAlready have an account? Sign in