Reinforcement learning: Dynamic Programming, Monte Carlo, Temporal Difference. Deep Q Learning. Experience Replay. Twin Delayed DDPG. More things to come, dreamer, etc.