Redaction questions #368
-
Anna has checked some output and come up with a couple of redaction queries:
Thanks |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
For histograms, you have a few options:
For any of these if the y-scale is large, then you won't notice the difference anyway in the image but it's necessary to avoid the problem with vector images containing the raw data. I'm not sure what you've done with hazard ratios, but if you've based them on KM estimates of survival (or equivalently the raw cumulative survival probability if there is no censoring) then you can round the survival estimate / probability first, where the rounding resolution is equivalent to the redaction threshold (as it done here: https://github.com/opensafely/comparative-ve-research/blob/71e4786b9b9919f3beb3a914055280614143fab0/analysis/descriptive/km.R#L171) and then recalculate the HRs from that (ie, This assumes we're being strict about the "redact 5 or less" rule. At some point we may move to a "principles-based" system where if you can justify non-disclosivity despite low counts then you're ok to release. But I'm not sure we're there yet! |
Beta Was this translation helpful? Give feedback.
For histograms, you have a few options:
For any of these if the y-scale is large, then you won't notice the difference anyway in the image but it's necessary to avoid the problem with vector images containing the raw data.
I'm not sure what you've done with hazard ratios, but if you've based them on KM esti…