a bug in test dataset splitting 

I noticed that there is bug in the preprocessing code for 20ng(scripts/data_20ng.py)
https://github.com/adjidieng/ETM/blob/52b090b5b2fd6fcecc6d0b2c55d03a2d893b729d/scripts/data_20ng.py#L88

missing the `idx_permute ` index convert