From 5b375d80e13923596d0bc902a761dc78e6437ef2 Mon Sep 17 00:00:00 2001
From: Raidas Grisk <30408368+RaidasGrisk@users.noreply.github.com>
Date: Mon, 22 Jan 2018 23:00:57 +0200
Subject: [PATCH] Update README.md
---
README.md | 6 +++---
1 file changed, 3 insertions(+), 3 deletions(-)
diff --git a/README.md b/README.md
index e0805a8..12b71fd 100644
--- a/README.md
+++ b/README.md
@@ -68,7 +68,7 @@ Instead of predicting price direction, allocate the funds to buy, sell, do not e
-**Concusions after a few tries:**
+**Concusions v1:**
- Optimizing by minimizing cost cross-entropy with target labels works (i.e. predicting price direction). Optimizing by maximizing average return without target labels does not work (i.e. predicting optimal positions allocation). Because of unstable / uneven gradients maybe..?
- LSTM and CNN models suffer from overfit problem (and underfit as well) that is hard to deal with. So I'll have to filter out least important features if I want to make it work.
- Learning rate does make a big difference. Training logistic regression with really small lr converges much better. It's probably a good idea to decrease lr again after a number of iterations.
@@ -89,13 +89,13 @@ This time the idea was to:
** Plot example of a few features after normalization
**

-After trying multiple ways of combining the features polynomials and PCA, it seems that this approach did not increase the accuracy of the model. Just for feature reference I unclude best ROC scores I was able to reach using this approach.
+After trying multiple ways of combining the features polynomials and PCA, it seems that this approach did not increase the accuracy of the model. Just for future reference I unclude best ROC scores I was able to reach using this approach.
** Receiver operating curve
**

-**Final concusion for now:**
+**Conclusions v2:**
1. Given only price and volume data, predicting price direction is not really accurate.
2. For predictions to be reasonable more features are needed. For instance sentiment data, other macroeconomic data or whatever.