On The Road From Model-Based Dynamical Programming To Model-Free Reinforcement Learning