Adding Mean F1 Score Difference and Hitting Rate Metrics #39

emersodb · 2025-09-17T17:01:28Z

PR Type

Feature

Short Description

Clickup Ticket(s): https://app.clickup.com/t/868fk3gbd

This PR adds a metric to the the quality and privacy metrics in the library respectively. These are Mean F1 Score Difference and Hitting Rate.

Two additional components have also trivially changed.

The first was to change the name of the base class associated with the SynthEval metrics to be more general (as it underpins both privacy and quality metrics now)
The second was to fix a bug in the preprocessing function that I had created when preprocessing with a holdout set 😢 . This is fixed and now properly tested within the metrics tests.

Tests Added

Tests have been added for both metrics. Hitting Rate is very thoroughly tested and Mean F1 Score Difference is run on a large set of data that requires training multiple models. I've sanity checked the results and they all make sense.

…ixing a preprocessing bug and changing the name of the syntheval metric base class.

emersodb · 2025-09-17T17:03:48Z

src/midst_toolkit/evaluation/metrics_base.py



-class SynthEvalQualityMetric(MetricBase, ABC):
+class SynthEvalMetric(MetricBase, ABC):


Name change to be more general as it now underpins more than just quality metrics (Hitting Rate, with others to follow).

src/midst_toolkit/evaluation/metrics_base.py

lotif

Small changes :)

src/midst_toolkit/evaluation/metrics_base.py

src/midst_toolkit/evaluation/privacy/hitting_rate.py

src/midst_toolkit/evaluation/quality/mean_f1_score_difference.py

bzamanlooy · 2025-09-17T21:04:52Z

src/midst_toolkit/evaluation/privacy/hitting_rate.py

+
+        A smaller rate is better.
+
+        NOTE: Categorical variables must be encoded in some way (ordinal or vector) for the evaluation to work. This


Should we perhaps implement the compute function in a way that takes care of the encoding for categorical variables? With do_preprocess only attributed to the numerical ones perhaps?

I'm trying to allow a bit of flexibility in the way a user can process the datasets they are sending to this function, while also providing a "sensible" default (i.e. the one that SynthEval provides).

I don't want to force preprocessing on the user if they want to do it some other way. For example, perhaps imputation needs to be done or you are going to collapse under represented classes before encoding. So this was the route I thought might work best. Happy to discuss it a bit more though!

…ering.py (#8) Moving the clustering code into its own clustering.py module and adding docstrings. Also, moving some common parameter type definitions to a params.py module.

Adding in Hitting Rate and Mean F1 Difference implementations. Also f…

43834a4

…ixing a preprocessing bug and changing the name of the syntheval metric base class.

emersodb requested review from amrit110, lotif, fatemetkl, sarakodeiri, masi-sh and bzamanlooy September 17, 2025 17:01

emersodb changed the base branch from main to dbe/add_hellinger_pmse September 17, 2025 17:01

emersodb commented Sep 17, 2025

View reviewed changes

src/midst_toolkit/evaluation/metrics_base.py Outdated Show resolved Hide resolved

Removing hard coding

bec6418

emersodb marked this pull request as ready for review September 17, 2025 17:06

lotif requested changes Sep 17, 2025

View reviewed changes

bzamanlooy reviewed Sep 17, 2025

View reviewed changes

emersodb and others added 2 commits September 17, 2025 17:26

Some CR fixes from Marcelo's review

fb3c48a

Train code split, Part 2: moving some of the model.py code into clust…

c286906

…ering.py (#8) Moving the clustering code into its own clustering.py module and adding docstrings. Also, moving some common parameter type definitions to a params.py module.

emersodb requested a review from lotif September 17, 2025 21:29

Merge branch 'dbe/add_hellinger_pmse' into dbe/add_f1_dff_hitting_rate

072a09a

lotif approved these changes Sep 19, 2025

View reviewed changes

emersodb added 4 commits September 24, 2025 11:23

Merge branch 'dbe/add_hellinger_pmse' into dbe/add_f1_dff_hitting_rate

4023d21

Merge branch 'dbe/add_hellinger_pmse' into dbe/add_f1_dff_hitting_rate

2ccdfa6

Merge branch 'dbe/add_hellinger_pmse' into dbe/add_f1_dff_hitting_rate

c78973f

Merge branch 'dbe/add_hellinger_pmse' into dbe/add_f1_dff_hitting_rate

217c7a7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Adding Mean F1 Score Difference and Hitting Rate Metrics #39

Adding Mean F1 Score Difference and Hitting Rate Metrics #39

Uh oh!

emersodb commented Sep 17, 2025 •

edited

Loading

Uh oh!

emersodb Sep 17, 2025

Uh oh!

Uh oh!

lotif left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bzamanlooy Sep 17, 2025

Uh oh!

emersodb Sep 17, 2025

Uh oh!

Uh oh!



		class SynthEvalQualityMetric(MetricBase, ABC):
		class SynthEvalMetric(MetricBase, ABC):


		A smaller rate is better.

		NOTE: Categorical variables must be encoded in some way (ordinal or vector) for the evaluation to work. This

Adding Mean F1 Score Difference and Hitting Rate Metrics #39

Are you sure you want to change the base?

Adding Mean F1 Score Difference and Hitting Rate Metrics #39

Uh oh!

Conversation

emersodb commented Sep 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Type

Short Description

Tests Added

Uh oh!

emersodb Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lotif left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

bzamanlooy Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

emersodb Sep 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

emersodb commented Sep 17, 2025 •

edited

Loading