Mathematical Inconsistency: Running Statistics notation and variance calculation #1626
debemdeboas
started this conversation in
General
Replies: 1 comment 1 reply
-
I may be wrong about this too. Statistics is not my strong suit. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
In the Running Statistics subsection of the documentation (From batch to online/stream), there appears to be a mathematical inconsistency in how variance is presented.
Current notation:
The text states that "$\sigma_t$ is the running variance at time $t$ ", but this is incorrect.$\frac{s_{t+1}}{n_{t+1}}$ where s is the running sum of squares represents neither the variance nor the standard deviation - it's simply the sum of squares divided by n.
The quantity
Proposed correction:
Beta Was this translation helpful? Give feedback.
All reactions