Refactor evaluation metrics to output Legolas table rows #45

ericphanson · 2021-12-02T16:59:43Z

I wanted something like this for an internal project but ended up just processing the output myself. But maybe something like this would be useful here.

The idea is to return a Legolas.@row("lighthouse.evaluation@1",...) row instead of a Dict from evaluation_metrics. Then if you run Lighthouse evaluation on e.g. several models with the same dataset or one model with several datasets, you can easily collect the results as rows in a structured table.

See also #42 / #41 for other comparison stuff.

Status: I think the tests are failing and this started to feel like a waste of time so I switched focus, but could be picked up.

hannahilea · 2021-12-02T18:13:49Z

Great---this would be a huge improvement! As discussed in person, we'll try to get to this at some point in the not too distant future, and in the interim count not having this implementation yet as tech debt.

hannahilea · 2022-03-15T15:30:01Z

working on this now

…ghthouse.jl into eph/legolas

hannahilea · 2022-03-17T23:55:36Z

TODO: file issue to make Matrix{Int64} in EvaluationRow in future
EDIT: In EvaluationRow, use Matrix{Int64} instead of Array{Int64} #51

hannahilea · 2022-03-18T14:41:33Z

Project.toml

@@ -1,10 +1,12 @@
 name = "Lighthouse"
 uuid = "ac2c24cd-07f0-4848-96b2-1b82c3ea0e59"
 authors = ["Beacon Biosignals, Inc."]
-version = "0.13.4"
+version = "0.14.0"


Bumping the version b/c we've changed the expected output for "failure" cases from missing to NaN for any metrics output types that show up in vectors (to aid in Arrow serialization).

hannahilea · 2022-03-18T14:41:55Z

Project.toml

 TensorBoardLogger = "0.1"
-julia = "1.5"
+julia = "1.6"


Since we're making a breaking version bump, taking the opportunity to additionally bump the lowest supported Julia version.

hannahilea · 2022-03-18T14:42:42Z

src/Lighthouse.jl

@@ -18,6 +20,9 @@ export confusion_matrix, accuracy, binary_statistics, cohens_kappa, calibration_
 include("classifier.jl")
 export AbstractClassifier

+include("row.jl")
+# TODO: export EvaluationRow ?


Should we export EvaluationRow?

Suggested change

# TODO: export EvaluationRow ?

export EvaluationRow

I think so, because it’s needed to fix Arrow serialization issues - maybe we should add a comment about that too. Since normally it's not important to pass through the Row type after reading from Arrow, but in this case it is, since we're using it to work around serialization issues.

Will do, although I'm not sure I understand the rationale?

so normally, if you do

table = Legolas.read(path_to_table)

then you get a table that serves your needs; you don't need to do any further processing to it (though you could pass to a DataFrame for convenience, for example). But for a table of evaluation rows, your confusion matrices will be vectors in this case! So to get a "good" table, you should do

table = (EvaluationRow(row) for row in Tables.rows(Legolas.read(path_to_table)))

Then you've got a (row-oriented) table whose confusion matrix column has actual matrices. Then you can pass that to a DataFrame if you want.

Usually you don't need to pass through the Row type, but since we're using it to fix stuff, you do in this case.

Ah, gotcha! Thanks for the explanation.

hannahilea · 2022-03-18T14:45:29Z

src/learn.jl

-    (isnothing(votes) || size(votes, 2) < 2) && return nothing  # no votes given or only one expert
+    # no votes given or only one expert:
+    (isnothing(votes) || size(votes, 2) < 2) &&
+        return (; per_class_IRA_kappas=missing, multiclass_IRA_kappas=missing)


in this case, still okay to return missing not NaN b/c we aren't serializing these as part of a vector---the schema accounts for them

hannahilea · 2022-03-18T14:49:07Z

test/learn.jl

@@ -303,11 +326,9 @@ end

    # Test NaN spearman due to unranked input
    votes = [1; 2; 2]
-    predicted_soft = [


...juliaformatter got hold of these matrices, and I didn't have the heart to undo the updates.

hannahilea · 2022-03-18T14:50:41Z

@ericphanson, I can't officially request your review here (since you were the originating author!), but want to take a review anyway?

ericphanson

I'm very excited for this! I think this will make things much nicer. (Cannot approve bc it's technically my own PR, but ✅ )

ericphanson · 2022-03-18T18:43:31Z

src/Lighthouse.jl

@@ -18,6 +20,9 @@ export confusion_matrix, accuracy, binary_statistics, cohens_kappa, calibration_
 include("classifier.jl")
 export AbstractClassifier

+include("row.jl")
+# TODO: export EvaluationRow ?


I think so, because it’s needed to fix Arrow serialization issues - maybe we should add a comment about that too. Since normally it's not important to pass through the Row type after reading from Arrow, but in this case it is, since we're using it to work around serialization issues.

Project.toml

hannahilea

#45 (review)

wip

4f521a6

ericphanson mentioned this pull request Dec 2, 2021

Add binary_comparison_metrics_plot with ROC curves #42

Draft

hannahilea self-assigned this Mar 15, 2022

hannahilea added 7 commits March 15, 2022 18:47

Merge branch 'main' into eph/legolas

06ea52d

Bump patch version, fix tests

8bf4cfc

add missing param

f4fc682

bump minor version

4ccafc4

add codecov

982ef8b

more codecov

21349d6

Bump julia version to 1.6, testing to 1.6+1.7

6194a88

ericphanson mentioned this pull request Mar 15, 2022

Lighthouse should consume tables #50

Closed

2 tasks

hannahilea added 8 commits March 15, 2022 21:59

Merge branch 'main' into eph/legolas

4a97d98

wip rt tests

571a7fe

wip

ac1f666

Support matrix serialization/deserialization

424a548

Support matrix serialization/deserialization

38bac6b

Merge branch 'eph/legolas' of https://github.com/beacon-biosignals/Li…

fb3332d

…ghthouse.jl into eph/legolas

Replace missing with NaN

353b22e

Fix test dep

eb874d3

hannahilea added 3 commits March 18, 2022 14:30

test for inclusion of all metrics

8bf1f47

foiled by my own test case

b353c5a

cleanup

9c82ab6

hannahilea reviewed Mar 18, 2022

View reviewed changes

hannahilea marked this pull request as ready for review March 18, 2022 14:50

hannahilea added 2 commits March 18, 2022 16:41

Add new docstrings to docs

abaa14e

fix docs

4a999ce

fix docstring

3af9d23

ericphanson commented Mar 18, 2022

View reviewed changes

hannahilea added 2 commits March 18, 2022 15:23

export EvaluationRow

0067fec

remove uneeded dep

72be90b

hannahilea approved these changes Mar 18, 2022

View reviewed changes

hannahilea merged commit 9e9de96 into main Mar 18, 2022

hannahilea deleted the eph/legolas branch March 18, 2022 20:17

This was referenced Mar 18, 2022

In EvaluationRow, use Matrix{Int64} instead of Array{Int64} #51

Open

Support evaluation_metrics_plot directly from EvaluationRow #53

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor evaluation metrics to output Legolas table rows #45

Refactor evaluation metrics to output Legolas table rows #45

ericphanson commented Dec 2, 2021 •

edited

Loading

hannahilea commented Dec 2, 2021

hannahilea commented Mar 15, 2022

hannahilea commented Mar 17, 2022 •

edited

Loading

hannahilea Mar 18, 2022

hannahilea Mar 18, 2022

hannahilea Mar 18, 2022

ericphanson Mar 18, 2022

hannahilea Mar 18, 2022

ericphanson Mar 18, 2022

hannahilea Mar 18, 2022

hannahilea Mar 18, 2022

hannahilea Mar 18, 2022

hannahilea commented Mar 18, 2022

ericphanson left a comment

ericphanson Mar 18, 2022

hannahilea left a comment

Refactor evaluation metrics to output Legolas table rows #45

Refactor evaluation metrics to output Legolas table rows #45

Conversation

ericphanson commented Dec 2, 2021 • edited Loading

hannahilea commented Dec 2, 2021

hannahilea commented Mar 15, 2022

hannahilea commented Mar 17, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hannahilea commented Mar 18, 2022

ericphanson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hannahilea left a comment

Choose a reason for hiding this comment

ericphanson commented Dec 2, 2021 •

edited

Loading

hannahilea commented Mar 17, 2022 •

edited

Loading