Validation framework for model #17

sgreenbury · 2024-04-19T12:04:27Z

Aim: define metrics to be used at different parts of the modelling to validate model against data. E.g. flows from QUANT model. See section in wiki.

sgreenbury · 2024-04-26T10:57:26Z

Some measures to consider:

Modelled travel times compared to reported travel times in NTS
Modelled flows between MSOAs compared to observed flows used in QUANT
See here for commuting flows census data. We can check the SPC people to matched workplace flows is consistent with this data.

Hussein-Mahfouz · 2024-04-26T11:51:50Z

Guidelines from dft on activity and agent based models: TAG unit M5-4

Hussein-Mahfouz · 2024-04-29T10:00:33Z

Putting this here for now but could be a separate issue on calibration later:

"We need data sets to be recognized as components on the same level as models. Such data components can then enter the integrated frameworks at various places, not only at the top, as input to drive the whole integrated model, and at the bottom, to compare with the output and to calibrate the model. Data components can be also used between components to test, adjust, and correct the data flows inside the integrated model. This can substantially increase the efficiency and accuracy of the integration process, and reduce the overall complexity of the calibration task for the whole integrated model." - ‘Integronsters’, integral and integrated modeling

A useful exercise could be to identify the datasets that could be used to calibrate at intermediate points of the pipeline

sgreenbury · 2024-05-09T13:57:01Z

Notes on tasks:

Retrieve data on MSOA to MSOA flows by mode
Function to measure difference in NTS (ground truth) and estimated travel time (from travel time matrices)
Calculate the aggregate MSOA to MSOA flows for a SPC matched population

sgreenbury · 2024-05-24T10:52:16Z

We also need to determine a set of metrics for measuring quality of matching between the two datasets as part of task1.

BZ-BowenZhang · 2024-05-24T11:06:04Z

We also need to determine a set of metrics for measuring quality of matching between the two datasets as part of task1.

I completely agree with @sgreenbury, and I think calculating the metrics for comparison is easy. But after generating metrics, I have a question: How do we judge the 'goodness'? In other words, how do we set a threshold value as the acceptable standard? We do not have another candidate dataset to compare, but maybe we can compare it with other synthetic populations published in previous papers.

sgreenbury · 2024-05-24T11:15:57Z

Adding reference with validation methods from @stuartlynn

BZ-BowenZhang · 2024-08-28T12:47:33Z

Add more markdown descriptions for the notebook Validation_SPC_with Cencus

BZ-BowenZhang · 2024-09-27T09:54:27Z

As discussed on 20th Sep:

The Census (commuting OD) dataset will be used as the benchmark dataset to validate the performance of different synthetic datasets (SPC and AcBM). The Census and SPC validation is completed and ready to merge with the main branch PR 17 validation framework for model #50
For the next step, more validation work will be done after the whole Leeds population with activity chain is available, still comparing with the census data and comparing with the SPC.
In the further London case, we may have more option datasets to compare/validate such as mobile phone data from Tao's Group or travel flow from TfL etc.

sgreenbury added the validation Model validation and consistency label Apr 26, 2024

sgreenbury mentioned this issue Apr 29, 2024

Convert notebooks to scripts #23

Merged

sgreenbury mentioned this issue May 7, 2024

Task 2b: Primary Locations (Work) #12

Closed

BZ-BowenZhang self-assigned this May 9, 2024

sgreenbury mentioned this issue Aug 7, 2024

Validation framework for model #37

Closed

BZ-BowenZhang linked a pull request Sep 26, 2024 that will close this issue

17 validation framework for model #50

Closed

BZ-BowenZhang mentioned this issue Sep 27, 2024

17 validation framework for model #51

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Validation framework for model #17

Validation framework for model #17

sgreenbury commented Apr 19, 2024

sgreenbury commented Apr 26, 2024 •

edited

Loading

Hussein-Mahfouz commented Apr 26, 2024

Hussein-Mahfouz commented Apr 29, 2024

sgreenbury commented May 9, 2024 •

edited by BZ-BowenZhang

Loading

sgreenbury commented May 24, 2024

BZ-BowenZhang commented May 24, 2024

sgreenbury commented May 24, 2024

BZ-BowenZhang commented Aug 28, 2024 •

edited

Loading

BZ-BowenZhang commented Sep 27, 2024

Validation framework for model #17

Validation framework for model #17

Comments

sgreenbury commented Apr 19, 2024

sgreenbury commented Apr 26, 2024 • edited Loading

Hussein-Mahfouz commented Apr 26, 2024

Hussein-Mahfouz commented Apr 29, 2024

sgreenbury commented May 9, 2024 • edited by BZ-BowenZhang Loading

sgreenbury commented May 24, 2024

BZ-BowenZhang commented May 24, 2024

sgreenbury commented May 24, 2024

BZ-BowenZhang commented Aug 28, 2024 • edited Loading

BZ-BowenZhang commented Sep 27, 2024

sgreenbury commented Apr 26, 2024 •

edited

Loading

sgreenbury commented May 9, 2024 •

edited by BZ-BowenZhang

Loading

BZ-BowenZhang commented Aug 28, 2024 •

edited

Loading