CS486 Final Project - Cubefield Q-Learning Analysis

Welcome to our CS486 final project. In this project, we implemented an AI to play the game Cubefield, in order to analyze the effectiveness of different behaviour policies for the Q-learning algorithm. All experimentation was conducted within a simplified 2-D version of the Cubefield game. We coded the game with Python (version 3) using the Pygame module, and natively implemented the sensors and policies. As a result, other than the Pygame library, there are no additional libraries that need to be installed.

Screenshot and Features

The following is a screenshot of our game. As you can see, each square is either grey (representing an obstacle), or white (representing empty space). As the squares scroll downwards, the player has the option of moving left, right, or staying in the same column in order to avoid colliding with incoming obstacles. To aid visual feedback and help us better trace the agent's travelled path, we added room below the player where green and red dots show the last 20 moves of the player. A red dot is used to represent a collision with a block, otherwise a green dot is used. These red and green dots are purely graphical and have no impact on the game's mechanics.

In addition to the option of playing the game, our team has implemented the different behaviour policies and presented the AI's gameplay simultaneously on 7 screens, as shown below:

Commands

To start the game play, you can run the command:

python3 main.py

Controls

Esc - Quit
P - Pause
Up Arrow - Speed up game
Down Arrow - Slow down game
Left Arrow - Move left (player movement only)
Right Arrow - Move right (player movement only)
PrtScr - Save screenshot

You should see Pygame start up and showcase the screenshots as shown above.

Note: This game will not run if the build system is Python 2. Python version 3.6 or higher is required since we used random.choices as part of our implementation of the softmax policy.

Required Python3 Libraries

Pygame (tested with version 1.9.6)

Name	Name	Last commit message	Last commit date
Latest commit AllanChew Minor fixes to UI Dec 12, 2020 32c8dd3 · Dec 12, 2020 History 47 Commits
.gitignore	.gitignore	Added save_csv and positive reward function	Nov 14, 2020
README.md	README.md	Updated readme and screenshot	Dec 12, 2020
cubefield-screenshot.png	cubefield-screenshot.png	added cubefield-screenshot for readme	Nov 17, 2020
cubefield.py	cubefield.py	Resized window + added speed indicator	Dec 12, 2020
drawer.py	drawer.py	Minor fixes to UI	Dec 12, 2020
generator.py	generator.py	added a label to the UI to display total number of collisions	Nov 9, 2020
main.py	main.py	added a label to the UI to display total number of collisions	Nov 9, 2020
player.py	player.py	Added save_csv and positive reward function	Nov 14, 2020
playerhistory.py	playerhistory.py	Added support for multiple players and drawing them	Oct 20, 2020
playerstrategy.py	playerstrategy.py	Minor fixes to UI	Dec 12, 2020
prototype.py	prototype.py	Cloned repo from git.uwaterloo.ca	Oct 11, 2020
sensor.py	sensor.py	Fixed interaction bug, updated sensors and UI, added pausing	Nov 1, 2020
seven-agents-cubefield.png	seven-agents-cubefield.png	Minor fixes to UI	Dec 12, 2020
squish_csv.py	squish_csv.py	Removed unused import: csv	Dec 12, 2020
ui.py	ui.py	Minor fixes to UI	Dec 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CS486 Final Project - Cubefield Q-Learning Analysis

Screenshot and Features

Commands

Controls

Required Python3 Libraries

About

Releases

Packages

Contributors 3

Languages

AllanChew/Cubefield-Q-learning

Folders and files

Latest commit

History

Repository files navigation

CS486 Final Project - Cubefield Q-Learning Analysis

Screenshot and Features

Commands

Controls

Required Python3 Libraries

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages