Skip to content

Commit

Permalink
Update CHANGELOGs
Browse files Browse the repository at this point in the history
  • Loading branch information
n1t0 committed Mar 2, 2020
1 parent f10f77d commit 4deeb95
Show file tree
Hide file tree
Showing 2 changed files with 7 additions and 1 deletion.
5 changes: 4 additions & 1 deletion bindings/python/CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,9 @@
# v0.6.0 (not published yet)

Fixes:
## Changes:
- Big improvements in speed for BPE (Both training and tokenization) ([#165](https://github.com/huggingface/tokenizers/pull/165))

## Fixes:
- Some default tokens were missing from `BertWordPieceTokenizer` (cf [#160](https://github.com/huggingface/tokenizers/issues/160))
- There was a bug in ByteLevel PreTokenizer that caused offsets to be wrong if a char got split up
in multiple bytes. (cf [#156](https://github.com/huggingface/tokenizers/pull/156))
Expand Down
3 changes: 3 additions & 0 deletions tokenizers/CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,5 +1,8 @@
# v0.8.0 (not released yet)

## Changes:
- Big improvements in speed for BPE (Both training and tokenization) ([#165](https://github.com/huggingface/tokenizers/pull/165))

## Fixes:
- Do not open all files directly while training ([#163](https://github.com/huggingface/tokenizers/issues/163))
- There was a bug in ByteLevel PreTokenizer that caused offsets to be wrong if a char got split up
Expand Down

0 comments on commit 4deeb95

Please sign in to comment.