Reinforcement Learning: Cliff Walking

Overview

This repository contains the implementation of two fundamental reinforcement learning algorithms, Q-learning and SARSA, applied to the Cliff Walking environment. The project explores how these algorithms learn to navigate the gridworld, avoid the cliff, and reach the goal while minimizing penalties.

Key Concepts

Parts:
- Q-learning: An off-policy algorithm that learns the optimal policy by estimating the maximum future rewards.
- SARSA: An on-policy algorithm that updates its policy based on the actual actions taken, leading to potentially safer but less aggressive strategies.
- Comparison: A detailed comparison of the paths chosen by each algorithm, highlighting differences in exploration and exploitation behaviors.
Tasks:
- Implement and evaluate the Q-learning algorithm.
- Implement and evaluate the SARSA algorithm.
- Compare and analyze the optimal policies derived from both algorithms.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
cliff_walking.gif		cliff_walking.gif
reinforcement-learning-cliff-walking.ipynb		reinforcement-learning-cliff-walking.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning: Cliff Walking

Overview

Key Concepts

About

Releases

Packages

Languages

Mahmood-Anaam/reinforcement-learning-cliff-walking

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning: Cliff Walking

Overview

Key Concepts

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages