Skip to content

Commit bd2834e

Browse files
committed
update README
1 parent 2eb7eeb commit bd2834e

File tree

1 file changed

+5
-0
lines changed

1 file changed

+5
-0
lines changed

README.md

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -36,6 +36,11 @@ Run `tensorboard --logdir ./`.
3636

3737
Implementation notes:
3838
- WordPiece tokenizer
39+
- Cosine decay learning rate schedule
40+
- Clip gradients by global norm
41+
- Gaussian Error Linear Unit (GELU) activation
42+
- Adam with β1 = 0.9 and β2 = 0.95
43+
3944

4045
## Licence
4146
MIT

0 commit comments

Comments
 (0)