Skip to content

Introduction to RL in Depth

goal

get deeper into the concepts. learn new math that are usfull for RL

overview

value iteration

policy iterration

liptishtness

contraction mapping

DDPG

natural gradient

policy gradient

kl

TRPO theory

SAC theory

conceteration bound

hofdding

regret bound

ucb

notaion

here is a the notation we will use thgoht this part:

prequestions

you may need to know the follwing concepts two better undestad this part: