Life Sequence Transformer: Generative Modelling for Counterfactual Simulation

Code for model architecture using pytorch, experiments run with pytorch-lightning and hydra for configuring hyperparameters. Database management using dask, hyperparameter optimization using optuna and ray, performer implementation based on perfomer-pytorch using fast-transformers CUDA builds.

This codebase is based on life2vec from the paper Using Sequences of Life-events to Predict Human Lives.

Overall Structure

The /conf folder contains configs for the experiments:

/experiment contains configuration for training.
/tasks contain configuration for data augmentation.
/trainer and /datamodule contain configuration for lightning's Trainer.
/data_new contains configuration for data loading and processing.
callbacks.yaml contains configuration for the lightning's Callbacks.
prepare_data.yaml can be used to run data preprocessing.

The /src folder contains the source code:

The /src/dataloaders contains scripts to preprocess, augment and load data.
The /src/models contains the model's source code.
train.py, finetune.py, test.py, tune.py are used to run a particular stage of the training.
prepare_data.py was used to run the data processing.
sample_idx.py and multiple_idx.py are used to generate sequences for individuals in the database, conditioned on some know years.

If using NVIDIA GPUs, we recommend building a container using Dockerfile.

Run Training and Experiments

# build datasets
HYDRA_FULL_ERROR=1 python -m src.prepare_data experiment=decode_only

# run training
HYDRA_FULL_ERROR=1 python -m src.train experiment=decode_only

# run finetuning
HYDRA_FULL_ERROR=1 python -m src.finetune generate=decode_only

# run sequence generation (requires specifying parameters)
HYDRA_FULL_ERROR=1 python -m src.multiple_idx generate=decode_only datamodule.batch_size=8 generate.dataloader.file_name=...

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
conf		conf
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Life Sequence Transformer: Generative Modelling for Counterfactual Simulation

Overall Structure

Run Training and Experiments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Life Sequence Transformer: Generative Modelling for Counterfactual Simulation

Overall Structure

Run Training and Experiments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages