Hello, I would like to ask, when you are training the model, do you only use the first round of dialogue from the ultrachat_200k? #37

jackwwy · 2024-07-21T12:51:38Z

def load_and_process_data_ultrachat(dataset_name, split): try: dataset = load_dataset(dataset_name, split=split) reformatted_data = [{ 'generated': [message['messages'][0], {"role": "assistant", "content": ""}], 'real': [message['messages'][0], message['messages'][1]] } for message in dataset] return reformatted_data except Exception as e: logging.error(f"Error loading or processing dataset: {e}") return []

The text was updated successfully, but these errors were encountered:

junming-yang · 2024-11-04T02:40:52Z

Yes. Only the first round of real dialogue dataset is sampled from ultrachat 200k.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hello, I would like to ask, when you are training the model, do you only use the first round of dialogue from the ultrachat_200k? #37

Hello, I would like to ask, when you are training the model, do you only use the first round of dialogue from the ultrachat_200k? #37

jackwwy commented Jul 21, 2024

junming-yang commented Nov 4, 2024

Hello, I would like to ask, when you are training the model, do you only use the first round of dialogue from the ultrachat_200k? #37

Hello, I would like to ask, when you are training the model, do you only use the first round of dialogue from the ultrachat_200k? #37

Comments

jackwwy commented Jul 21, 2024

junming-yang commented Nov 4, 2024