This folder contains metadata files for the models submitting to the COVID-19 Forecast Hub. The specification for these files has been adapted to be consistent with model metadata guidelines in the hubverse documentation.
Each model is required to have metadata in yaml format.
These instructions provide detail about the data format as well as validation that you can do prior to a pull request with a metadata file.
This section describes each of the fields (keys) in the YAML document. Please order the variables in this order in your YAML metadata file.
The following metadata fields are mandatory.
The full name of your team. Must be fewer than 50 characters.
An abbreviated (<16 character) name for your team.
The full name of your model. Must be fewer than 50 characters.
An abbreviated (<16 character) name for your model.
A list of all individuals involved in producing the model. For each contributor, please provide a name, affiliation, and email address. Individually may optionally provide ORCID identifiers.
Use the following YAML syntax
model_contributors: [
{
"name": "Modeler Name 1",
"affiliation": "Institution Name 1",
"email": "modeler1@example.com",
"orcid": "1234-1234-1234-1234"
},
{
"name": "Modeler Name 2",
"affiliation": "Institution Name 2",
"email": "modeler2@example.com",
"orcid": "1234-1234-1234-1234"
}
]
All email addresses provided will be added to an email distribution list through which the Hub makes announcements to model contributors. You can unsubscribe from this list at any time.
One of the following [accepted licenses] by inputting license: <license code>
with one of the following codes. The license you pick will govern future use of the forecast data you contribute to the Hub.
CC-BY-4.0
: Creative Commons Attribution 4.0 InternationalCC0-1.0
: CC0 1.0 Universal (public domain declaration)CC-BY_SA-4.0
: Creative Commons Attribution-ShareAlike 4.0 InternationalPPDL
:ODC-by
: Open Data Commons Attribution LicenseODbL
: Open Data Commons Open Database LicenseOGL-3.0
: UK National Archives Open Government License 3.0
A team-specified boolean indicator (true
or false
) for whether the model should be considered eligible for inclusion in Hub ensembles and public visualizations. A team may specify up to two models as designated_model
s for inclusion. Models which have a designated_model value of false
will still be included in internal forecasting Hub evaluations, but not in published ensembles and visualizations.
List or description of the data sources used to inform the model, in particular any dataset used that are not the target dataset of epiweekly incident COVID-19 hospital admissions reported to NHSN.
A brief description of your forecasting methodology. Must be fewer than 200 characters.
A full description of your model methods. If the model is modified, you can use this field to provide a changelog, with dates and descriptions of implemented changes.
A boolean value (true
or false
) that indicates whether a model is an ensemble of any separate component models.
A boolean value (true
or false
) that indicates whether a model is an ensemble specifically of other models submited to the FluSight forecasting hub.
The following metadata fields are optional, but encouraged.
An identifier of the version of the model. We recommend semantic versioning style: X.Y
or X.Y.Z
, so 1.2
for version 1.2.
The url of a website with additional information about your model, such as detailed methods, visualizations, or interactive dashboards.
The URL of a Github (or similar) code repository containing model source code.
Citations for one or more publications, preprints, et cetera with additional model details. Example:
citation: "Gibson GC , Reich NG , Sheldon D. Real-time mechanistic bayesian forecasts of Covid-19 mortality. medRxiv. 2020. https://doi.org/10.1101/2020.12.22.20248736".
Any information about funding source(s) for the team or members of the team that would be relevant to include in resulting COVID-19 Forecast Hub publications. Example:
team_funding: "National Institutes of General Medical Sciences (R01GM123456). The content is solely the responsibility of the authors and does not necessarily represent the official views of NIGMS."
GitHub user ids of team members who would be responsible for submitting forecasts as a pull request to the CovidHub repository. Only the pull request from users specified here can get merged automatically after validation. Example:
designated_github_users: [
"dependabot",
"octocat"
]
or
designated_github_users: ["dependabot"]
Optionally, you may validate a model metadata file locally before submitting it to the hub in a pull request. Note that this is not required, since the validations will also run on the pull request, but it is encouraged. To run validations locally, follow these steps:
- Create a fork of the
covid-forecast-hub-2024
repository and then clone the fork to your computer. - Create a draft of the model metadata file for your model and place it in the
model-metadata
folder of this clone. - Install the hubValidations package for R by running the following command from within an R session:
install.packages("hubValidations", repos = c("https://hubverse-org.r-universe.dev", "https://cloud.r-project.org"))
- Validate your draft metadata file by running the following command in an R session:
hubValidations::validate_model_metadata(
hub_path="<path to your clone of the hub repository>",
file_path="<name of your metadata file>")
For example, if your working directory is the root of the hub repository, you can use a command similar to the following:
hubValidations::validate_model_metadata(hub_path=".", file_path="UMass-trends_ensemble.yml")
If all is well, you should see output similar to the following:
✔ model-metadata-schema.json: File exists at path hub-config/model-metadata-schema.json.
✔ UMass-trends_ensemble.yml: File exists at path model-metadata/UMass-trends_ensemble.yml.
✔ UMass-trends_ensemble.yml: Metadata file extension is "yml" or "yaml".
✔ UMass-trends_ensemble.yml: Metadata file directory name matches "model-metadata".
✔ UMass-trends_ensemble.yml: Metadata file contents are consistent with schema specifications.
✔ UMass-trends_ensemble.yml: Metadata file name matches the `model_id` specified within the metadata file.
If there are any errors, you will see a message describing the problem.