RL-ChatMate

******* 🚧⚠️ PROJECT IS UNDER CONSTRUCTION ⚠️🚧 *********

RL ChatMate is an open-source project aimed at improving chatbot interactions using state-of-the-art reinforcement learning techniques. By fine-tuning language models with human feedback, the main goal is to enhance response quality, coherence, and engagement.

The project is organized into the following sections:

README.md: Read about the project and its main goal. About tokenization: a brief description of the tokenization process. Dataset tokenization: Chi-Square Test: The execution of the chi-square test and hypothesis validation.

The dataset

For this project I will be using the CommonsenseQA dataset.

CommonsenseQA is a new multiple-choice question answering dataset that requires different types of commonsense knowledge to predict the correct answers . It contains 12,102 questions with one correct answer and four distractor answers. The dataset is provided in two major training/validation/testing set splits: "Random split" which is the main evaluation split, and "Question token split", see paper for details.

Originals autors: Alon Talmor, Jonathan Herzig, Nicholas Lourie, Jonathan Berant

From: https://arxiv.org/abs/1811.00937 arXiv:1811.00937v2 [cs.CL]

Official website:

https://www.tau-nlp.sites.tau.ac.il/commonsenseqa

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
.ipynb_checkpoints		.ipynb_checkpoints
CommonsenseQA_DataSet.jsonl		CommonsenseQA_DataSet.jsonl
LICENSE		LICENSE
My used PROMPTS.md		My used PROMPTS.md
README.md		README.md
dataset_tokenization.ipynb		dataset_tokenization.ipynb
model_training.ipynb		model_training.ipynb
tokenized_data.csv		tokenized_data.csv
tokenized_data.json		tokenized_data.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL-ChatMate

The dataset

About

Releases

Packages

Languages

License

Q-Aware-Labs/RL-ChatMate

Folders and files

Latest commit

History

Repository files navigation

RL-ChatMate

The dataset

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages