Some kind of schema for all the data requirements to run the model? Whilst we wait for #347 to be available