ALGORITHMIC FRAMEWORK

FOR

MODEL-BASED DEEP REINFORCEMENT LEARNING WITH THEORETICAL GUARANTEES