GitHub - navidmdn/ESConv-SRA: Code and data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations" along with a UI to visualize attentions

This repository contains the code and the data for the paper "Steering Conversational Large Language Models for Long Emotional Support Conversations"

Using the model

Our fine-tuned llama3-8b-instruct lora weights are hosted on huggingface here. You can use cli_chat.py script to chat with the model. You can run the following command to chat with the model, at each turn you are asked to choose a strategy to continue the conversation (or choose to continue without any strategy):

python cli_chat.py --model_name_or_path navidmadani/esconv_sra_llama3_8b

You can use the following code to load the model and generate continuations with your desired strategy at each turn:

PYTHONPATH=.. python cli_chat.py --model_path navidmadani/esconv_sra_llama3_8b

Synthetic Strategy-Conditioned Data

The original ESConv dataset is available under esconv/ directory. You can run the process_esconv.sh to convert the data into a format that we use in our experiments. It will create a json file inside the same folder called conversations.json. You can run the script with the following command:

bash process_esconv.sh

Then you can postprocess the generated responses using prompting/postprocess.py script. A sample of the generated data is available in data/ directory. Each file contains one incomplete conversation and a few continuations using different strategies. Along with this information we also provide the exact prompt that we used to generate each continuation.

We are on the mission to complete this dataset and make it available for the public. We will update this section once each part of the dataset is ready. You can currently download the first version of the dataset from this link

Model Used	Number of Conversations	Number of Continuations	Download Link
LLama2-7b-chat	1,297	41,994	Download
LLama2-13b-chat	1,297	41,822	Download
LLama2-70b-chat	1,297	24,760	Download

Our most recent dataset which is generated based on llama3-70b-instruct is hosted on huggingface here.

Training

All of the scripts and experiments for training our proposed models can be found in training/ folder. You can use prepare_classification_data.py and data_preparation.py files to preprocess the synthetic data for building strategy classifier and fine-tuning Llama models. Also, lora_finetuning_llama.py and train_strategy_classifier.py can be used to train these models afterwards. A sample bash script to run the lora fine-tuning can be found in lora_finetuning_llama.sh.

Comparison

You can compare two models with standard prompting on strategy adherence. Simply run the following command to load the two models and compare them side by side:

python head2head_livechat.py\
 --model_name_or_path_1 'meta-llama/Meta-Llama-3-8B-Instruct'\
 --model_name_or_path_2 'outputs/your_llama3_lora_finetuned_model'\

You'll get a UI as the image below and you can choose a strategy at each turn to respond with.

Experiments

For our experiments we use LLaMa v2 chat models with 4bit quantization. You can follow the instruction in the following links to get access to 7b, 13b and 70b models on huggingface.

All of the experiments are conducted using the transformers library. We use bitsandbytes to quantize the models. We also run inference on the models using one A100 GPU with 80GB memory.

You can run the experiments in the paper using the following commands:

cd prompting
bash llama7b.sh
bash llama13b.sh
bash llama70b.sh

This will generate the sampled data collections for the experiments in the paper. The rest of the analysis will be done using prompting/strategy_following_comparison.ipynb notebook.

Visualizing attentions

We also provide the code to visualize the attention that each span of the prompt receives. You can run the following Flask app to visualize the attention weights:

cd attention_visualizer
python app.py --data_dir /path/to/generated/pickle/files/from/previous/step

Make sure you provide a directory with pickle files in the format we produce in the previous section using prompting/multiple_strategy_continuation.py. You will see a similar html page to the following in which you can select a span and visualize the weights on the prompt text. Also you will get top 20 tokens that the model attends to on the left side of the page:

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
annotation		annotation
attention_visualizer		attention_visualizer
data		data
esconv		esconv
prompting		prompting
training		training
we_steering		we_steering
.gitignore		.gitignore
README.md		README.md
process_esconv.sh		process_esconv.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Using the model

Synthetic Strategy-Conditioned Data

Training

Comparison

Experiments

Visualizing attentions

About

Releases

Packages

Languages

navidmdn/ESConv-SRA

Folders and files

Latest commit

History

Repository files navigation

Using the model

Synthetic Strategy-Conditioned Data

Training

Comparison

Experiments

Visualizing attentions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages