Reinforcement learning theory

Posted: 2016-10-22 , Modified: 2016-10-24

Tags: reinforcement learning

Figure out what’s provably known about RL!

Known model

Unknown model

Parametrized policy

Suppose payout is convex in policy parameters. But why would this ever be the case???

Or: have to decide between several experts.

References