data split

Hi, after reading your code, can i check with you the following regarding how you split your dataset?
1. 5 samples are generated from prompt_gen_data to induce the instruction from the open-source LLM
2. 20 samples are generated from eval_data to evaluate the quality of the induced instruction during BO iterations
3. 100 samples are generated from test_data to evaluate the quality of the proposed instruction after BO iterations
Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data split #4

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

data split #4

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions