Skip to content

Conversation

@wagnerlmichael
Copy link
Member

@wagnerlmichael wagnerlmichael commented Feb 4, 2026

Same as the res pipeline change

This PR reworks the performance report to work with out new sales val data model additions and add two things to the report:

  • outlier numbers (raw and proportion) per year
  • outlier proportion maps per nbhd incorporated from our geography group testing

It also reorganizes the outlier reasons that are displayed

Closes ccao-data/model-res-avm#426.

@wagnerlmichael wagnerlmichael changed the title Update outlier report for sales val changes Update outlier section of performance report for sales val changes Feb 4, 2026
"Algorithm: Outlier, High price",
str_detect(sv_outlier_reason, "Low price") ~
"Algorithm: Outlier, Low price",
TRUE ~ sv_outlier_reason
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Took a stab at removing the redundant combo of str_detect and regex here.

make_triad_map("North")
```

:::
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Exact same map code as res

@wagnerlmichael wagnerlmichael marked this pull request as ready for review February 4, 2026 16:19
@jeancochrane jeancochrane changed the base branch from master to Update-pipeline-with-outlier-column-changes February 4, 2026 16:26
Copy link
Member

@jeancochrane jeancochrane left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Tested and confirmed that the report runs and renders these charts correctly!

Base automatically changed from Update-pipeline-with-outlier-column-changes to master February 4, 2026 19:36
Copy link
Member

@jeancochrane jeancochrane left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks great! I hope it's OK that I took the liberty to run the ingest, push to the DVC remote, and update the hashes in the lockfile here. That way when we merge this, everyone else can use dvc pull to grab the version of the training data that includes nonliveable spaces in sv_outlier_reason.

The error we've been seeing in rendering the performance report is definitely unrelated to this PR, so let's go ahead and merge this in the meantime.

wagnerlmichael and others added 2 commits February 6, 2026 12:55
Co-authored-by: Jean Cochrane <jeancochrane@users.noreply.github.com>
Co-authored-by: Jean Cochrane <jeancochrane@users.noreply.github.com>
* Update lockfiles to ggplot2 v4

* Upgrade plotly to 4.11.0 to see if it has better support for latest ggplot

* Make sure to ungroup outlier ratio comparison before computing range in `_outliers.qmd`
Copy link
Member

@jeancochrane jeancochrane left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Great work, sorry this was so much trouble to test due to #133!

@wagnerlmichael wagnerlmichael merged commit 1c73d1d into master Feb 6, 2026
5 checks passed
@wagnerlmichael wagnerlmichael deleted the update-condo-report-for-sales-val-changes branch February 6, 2026 22:34
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Rework outlier section of performance report to simplify and accommodate new spec

2 participants