Reinforcement Learning for Optimal Stopping Problem

This repository is for CP3106 at the National University of Singapore, which uses Rainbow DQN, a simple reinforcement learning algorithm, to solve an Optimal Stopping problem.

Project Background

Some work have demonstrated that Dollar Cost Averaging (DCA) is a useful investing strategy that can help deal with uncertain markets by making purchases automatic. DCA involves investing the same amount of money in a target security at regular intervals over a certain period of time, regardless of price, as shown below.

The primary objective of this project is to formalize finding a day with a lower investment cost during each investment cycle as an Optimal Stopping problem. We design a custom OpenAI Gym style Reinforcement Learning environment and propose a solution based on Rainbow-DQN.

Repository Structure:

Crypto Data:
- Scripts for obtaining up-to-date cryptocurrency data and create a temporary file for experiments.
- Instructions:
  1. Place the ChromeDriver in the same directory.
  2. Execute the following to fetch crypto data from CoinMarketCap:
```
python .\CoinData.py --sd 20221110 --ed 20221211 --item 100 --save ./Data/
```
  3. Generate the temporary .pkl file to expedite back-testing:
```
python .\GetTempData.py --sd 20221110 --ed 20221211 --index 10 --save_name Data
```
- The CryptoData.zip encompasses cryptocurrency data from 2013.04.28 to 2022.11.29 in .csv format.
Back-testing:
- Scripts to derive back-testing results using the DCA method.
- Execution:
```
python Back_Testing_with_DCA.py
```
  Ensure Data.pkl exists in the ./Data folder before running.
- Major functions:
  - ShowEffectiveness: Constructs the starting date list and triggers the strategy function.
  - AtomStrategy: Purchases a single cryptocurrency.
  - IndexStrategy: Buys multiple cryptocurrencies based on market capitalization.
RL Environment:
- Contains an OpenAI Gym-based reinforcement learning environment tailored for the optimal stopping problem.
- Environment registration instructions for CryptoEnv can be found here.
Rainbow DQN:
- Includes scripts for the Rainbow DQN agent's training, evaluation, and result visualization.
- Main script: rainbow.py
- Prior to execution:
  - Register the gym environment CryptoEnv-v0.
  - Install required packages: pip install -r requirements.txt.
- Execute with:
```
python rainbow.py --ExpID BTC_Exp_1 --frames 30000 --name 0 --wnd 30 --cycle 9 --memory_size 10000 --batch_size 128 --target_update 100 --gamma 0.95 --v_min 0 --v_max 20 --atom_size 51 --n_step 3 --data_-path 'path/to/price/data' --mode 0
```
  You can obtain detailed descriptions and default values of these command-line parameters by running the following command: python rainbow.py -h. Ensure to adjust these parameters based on your specific needs and dataset.
- After training, logs are available in ./logs. For visualization:
```
rl_plotter --show --save --avg_group --shaded_std --style default --title "Episode score v.s timesteps" --legend_outside --no_legend_group_num --resample 4096
```
Presentation and Report:
- Includes the presentation slides and the project's final report.

How to Register the Environment `CryptoEnv`:

First, install gym: pip install gym.
Determine the gym installation path:
- If uncertain, re-run pip install gym.
Navigate to ./gym/envs/.
Create a new directory named user.
Transfer CryptoEnv.py to the user directory.

In user directory, set up __init__.py, include:

from gym.envs.user.CryptoEnv import CryptoEnv

Return to the parent directory: ./gym/envs/.

Modify __init__.py to include:

register(
	id='CryptoEnv-v0',
	entry_point='gym.envs.user:CryptoEnv',
)

The expected directory hierarchy is:

|-- path to gym
|    |-- envs
|    |   |-- user
|    |   |   |-- CryptoEnv.py
|    |   |   |-- __init__.py
|    |   |-- __init__.py

Name		Name	Last commit message	Last commit date
Latest commit History 42 Commits
Back-testing		Back-testing
CryptoData		CryptoData
Presentation		Presentation
RL_Environment		RL_Environment
Rainbow_DQN		Rainbow_DQN
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning for Optimal Stopping Problem

Project Background

Repository Structure:

How to Register the Environment `CryptoEnv`:

References:

About

Releases

Packages

Languages

wshanmu/CP3106-RL-for-Optimal-Stopping

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning for Optimal Stopping Problem

Project Background

Repository Structure:

How to Register the Environment CryptoEnv:

References:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

How to Register the Environment `CryptoEnv`:

Packages