Proposal: refactor probing pipeline args #93

oserikov · 2022-12-05T15:58:22Z

I propose to allow for model_config, model, and tokenizer to be optional arguments to the experiment class, rather than setting them post-factum. Like, you either simply pass the model's name, like here, or pass the whole config-model-tokenizer triplet like this :

    model_config = AutoConfig.from_pretrained(
        model_name,
        output_hidden_states=True, 
        output_attentions=True
    )
    model = AutoModelForCausalLM.from_pretrained(
        model_name,
        config=model_config,
        device_map="auto",
        torch_dtype=dtype,
        max_memory = get_max_memory_per_gpu_dict(dtype, model_name)
    )
    tokenizer = AutoTokenizer.from_pretrained(model_name, config = model_config)
    experiment = ProbingPipeline(
        config=config, model = model, tokenizer=tokenizer,
        device = device,
        metric_names = ["f1", "accuracy"],
        encoding_batch_size = encoding_batch_size,
        classifier_batch_size = classifier_batch_size
    )

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Proposal: refactor probing pipeline args #93

Proposal: refactor probing pipeline args #93

oserikov commented Dec 5, 2022 •

edited

Loading

Proposal: refactor probing pipeline args #93

Proposal: refactor probing pipeline args #93

Comments

oserikov commented Dec 5, 2022 • edited Loading

oserikov commented Dec 5, 2022 •

edited

Loading