Toxicity-Classification

This is the finetuning code for toxicity classification of text from university domain. I finetuned 3 model:

Note that using VNCoreNLP to segment the data will yield better results with PhoBERT since this is how the model was pre-trained.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Phobert		Phobert
Phobert_segment		Phobert_segment
Visobert		Visobert
README.md		README.md
VNCoreNLP.ipynb		VNCoreNLP.ipynb
classification.ipynb		classification.ipynb
greedy_input_reduction.ipynb		greedy_input_reduction.ipynb
merge_data.ipynb		merge_data.ipynb
phosegment.py		phosegment.py

Provide feedback