COMP579 Final Project: A Comparative Study of Frame Stacking in Deep Reinforcement Learning

This repository contains the implementation of my reinforcement learning final project for COMP579 at McGill University during the 2025 Winter Term. The project investigates the impact of temporal information on reinforcement learning algorithms by comparing how PPO, DQN, and A2C leverage stacked frames as input. The study focuses on the Breakout environment, analyzing training rewards, sample efficiency, and final performance across different configurations. Below is a description of the files and instructions on how to set up and run the project using Poetry.

Repository Contents

train_eval.py: Contains the training and evaluation loop with all customizable parameters defined inside.
callback.py: Implements a custom callback function for logging during training.
visualization.py: Provides functions to plot graphs and visualize experiment results.
custom_logger.py: A logger utility to print messages both to the console and to a file.
demo.py: A demonstration script with human rendering of the agent's behavior.

Installation

This project uses Poetry for dependency management. Follow the steps below to set up the environment:

Clone the repository:

git clone git@github.com:Niamorine/RL_COMP579.git
cd RL_COMP579

Install Poetry if not already installed:
```
pip install poetry
```
Install the project dependencies:
```
poetry install
```

Running the Project

Training and Evaluation

You can change the parameters used for the training and evaluation at the bottom of the file train_eval.py.

To run the training and evaluation loop:

poetry run python train_eval.py

Visualization

To generate plots of experiment results:

poetry run python visualization.py

Demo

To run the demo with human rendering of the agent (first change the model and model path accordingly to yours in the script):

poetry run python demo.py

Logging

Logs are managed using custom_logger.py utility, which outputs messages to both the console and a log file for easier debugging and tracking.

Customization

You can customize training parameters directly in train_eval.py and modify logging behavior in callback.py and custom_logger.py.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.gitignore		.gitignore
README.md		README.md
callback.py		callback.py
custom_logger.py		custom_logger.py
demo.py		demo.py
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
train_eval.py		train_eval.py
visualization.py		visualization.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

COMP579 Final Project: A Comparative Study of Frame Stacking in Deep Reinforcement Learning

Repository Contents

Installation

Running the Project

Training and Evaluation

Visualization

Demo

Logging

Customization

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Niamorine/RL_COMP579

Folders and files

Latest commit

History

Repository files navigation

COMP579 Final Project: A Comparative Study of Frame Stacking in Deep Reinforcement Learning

Repository Contents

Installation

Running the Project

Training and Evaluation

Visualization

Demo

Logging

Customization

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages