[RL basics] Week 3. Q-learning