Confusion about training adapters sequentially in a single script #519

pugantsov · 2023-03-15T13:03:44Z

I am training multiple adapters, using BertModel as a base. I have a question about the following process and how it works with adapters:

(1) I initiate a BertModel outside of a loop (mostly to save time).
(2) I then set a new adapter to train with the following code:

model.add_adapter(model_name, config=transformers.adapters.PfeifferConfig())
model.train_adapter(model_name)
model = model.to(device)

(3) At the end of the training loop, I disable and delete the adapter as follows:

model.set_active_adapters(None)
model.delete_adapter(model_name)

Now, the question I have is:
Once this loop starts again with adding a fresh adapter:
(1) Is it training a brand new, randomly initialised head and, therefore, only encodes the knowledge of the task I am currently training?
(2) Or do I have to call something like delete_head (or just initialise the BertModel at the start of each iteration) as well if I am running these in a loop so that there is no information leak between each of the subsequent tasks?

The text was updated successfully, but these errors were encountered:

hSterz · 2023-03-16T16:28:46Z

Hey @pugantsov , to make sure each adapter training starts with a new randomly initialized adapter and a new head you need to make sure that you:

use the BertAdapterModel which allows you to have a new head for each adapter.
add an adapter (add_adapter) and head (add_classification_head depending on the task) with the same name at the beginning of each training of a fresh adapter.
activate and train the adapter and head with train_adapter. If they have the same name the head is automatically activated and set to training
reset the optimizer and gradients at the beginning of each training to avoid information passing between the training of different adapters

You don't have to delete the adapters that you have trained. You could keep them loaded but not activated in the model. If you want to delete them you have to delete the adapter and head separately.
I hope this helps.

adapter-hub-bert · 2023-06-15T06:15:33Z

This issue has been automatically marked as stale because it has been without activity for 90 days. This issue will be closed in 14 days unless you comment or remove the stale label.

adapter-hub-bert · 2023-06-29T06:17:44Z

This issue was closed because it was stale for 14 days without any activity.

pugantsov added the question Further information is requested label Mar 15, 2023

adapter-hub-bert added the Stale label Jun 15, 2023

adapter-hub-bert closed this as not planned Won't fix, can't repro, duplicate, stale Jun 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Confusion about training adapters sequentially in a single script #519

Confusion about training adapters sequentially in a single script #519

pugantsov commented Mar 15, 2023 •

edited

Loading

hSterz commented Mar 16, 2023

adapter-hub-bert commented Jun 15, 2023

adapter-hub-bert commented Jun 29, 2023

Confusion about training adapters sequentially in a single script #519

Confusion about training adapters sequentially in a single script #519

Comments

pugantsov commented Mar 15, 2023 • edited Loading

hSterz commented Mar 16, 2023

adapter-hub-bert commented Jun 15, 2023

adapter-hub-bert commented Jun 29, 2023

pugantsov commented Mar 15, 2023 •

edited

Loading