2048 DQN

Deep Q learning agent playing 2048

Prerequisites

python 3.6

It's recommended to use conda

conda create --name tf-cpu tensorflow python=3.6

or gpu version:

conda create --name tf-gpu tensorflow-gpu

conda install --file requirements.txt
pip3 install recordclass

You may also use pure pip:

pip3 install -r requirements.txt
pip3 install recordclass

You may have to manually install python3-tk for tkinter library on linux

Usage

Use training_cli.py script to:

train random or specific agent configuration
render the game while training
find agent with maximum average score
watch best agent play

python3 training_cli.py --help

Description

Implementation of deep q learning agent for playing 2048 game

Results

Each agent is evaluated by playing 25000 steps of the game

Best agent goes up to 2804 sum of tiles and max 2048 tile with average sum on board ~1100

Max tile of random agent is 256 with average sum on board equal 250

Directory names meaning

ill0_em015_Adamax_mae_256_256_256_linear_batch_8_tUpdF100_learnF30_div_by_max_ddq_1_trm_epsC10000_RMDC1000000_dry1

ill0 - don't allow illegal moves
em015 - minimum eps = 0.15
Adamax - neural network optimizer
256_256_256 - sizes of each fully connected deep layer
linear - output activation function
batch_8 - number of memory samples in each training
tUpdF100 - update target network (double q learning) each 100 trainings
learnF30 - train each 30 steps in game
div_by_max - state mapping function, this one divides all tiles by current max
ddq_1 - use double q learning
trm - type of rewarding function, time reward minus (see helper_functions.py for details)
epsC10000 - epsilon constant (used in )
RMDC1000000 - type of replay memory, Replay Memory Dynamic Crucial with 1e6 max capacity
dry1 - dry training mode, in each state test and remember outcomes from all moves

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
models		models
.gitignore		.gitignore
2048.png		2048.png
LICENSE		LICENSE
README.md		README.md
board.py		board.py
configurations.py		configurations.py
env_2048.py		env_2048.py
experience_gathering.py		experience_gathering.py
gui_2048.py		gui_2048.py
helper_functions.py		helper_functions.py
just_testing.py		just_testing.py
logging.conf		logging.conf
logging_utils.py		logging_utils.py
neural_network.py		neural_network.py
paths_helpers.py		paths_helpers.py
q_agent.py		q_agent.py
replay_memories.py		replay_memories.py
requirements.txt		requirements.txt
results_analysis.py		results_analysis.py
training.py		training.py
training_cli.py		training_cli.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

2048 DQN

Prerequisites

Usage

Description

Results

Directory names meaning

About

Releases

Packages

Languages

License

rafalslaby/2048-NN

Folders and files

Latest commit

History

Repository files navigation

2048 DQN

Prerequisites

Usage

Description

Results

Directory names meaning

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages