For static tools it would also be good to know how many false negatives where explicitly reported (as “no error”), and how many data races were reported as “may exist” or “definitely exists”. This makes quite a difference in ranking/triaging of bugs (I didn’t read all of the text, it might be mentioned somewhere).