Text Classification (Sentiment Analysis)

Datasets

Three datasets of different fine-grained levels in both English and Chinese.

MR: It is an English Movie Review dataset with one-sentence review per movie. The classification includes two classes: positive and negative.
COVID_Chinese: It is a Chinese dataset consisted of Weibo posts during COVID-19 from st th January 1 to February 20 . It is a multimodal dataset with text, pictures and videos, but only text was used in this project. The classification includes three classes: positive, neutral and negative.
SST-5: It is an English fine-grained sentiment classification dataset from Stanford Sentiment Treebank. Data is provided at phrase-level, so the sentences after data transformation were used for training and testing. The classification includes five classes: very negative, negative, neutral, positive and very positive.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
BERT		BERT
CNN-text-classification		CNN-text-classification
RNN-text-classification		RNN-text-classification
data		data
other-text-classification-models		other-text-classification-models
README.md		README.md