-
When performing bootstrapping, the "optimal cutpoint" is different from the mean / median of all bootstrap samples. How is this to be interpreted? How is the "optimal cutpoint" determined during bootstrapping? opt_cut$optimal_cutpoint == opt_cut %>%
dplyr::select(boot) %>%
tidyr::unnest(cols = boot) %>%
dplyr::summarize(mean_oc = mean(optimal_cutpoint))
opt_cut$optimal_cutpoint == opt_cut %>%
dplyr::select(boot) %>%
tidyr::unnest(cols = boot) %>%
dplyr::summarize(mean_oc = median(optimal_cutpoint))
|
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Hi, I can't see the code you used to create
This will calculate an optimal cutpoint for the Youden-Index in the full data set and also calculate optimal cutpoints for the Youden-Index in 1000 bootstrap samples. The cutpoint in the full data set is of course not necessarily identical to the mean or median of the cutpoints from the bootstrap. This type of "outer bootstrap" is useful for estimating the out-of-sample performance of the estimation method at hand, in this example empirically maximizing the Youden-Index. You get the distribution of the obtained values of the Youden-Index in the bootstrap samples, along with some other metrics (try If you want bootstrapped cutpoints, you should use
This will return an optimal cutpoint which is the mean of the optimal cutpoints from 500 bootstrap samples, maximizing the Youden-Index in every sample. If you would like to estimate the out-of-sample performance too, you can again set, e.g., Keep in mind, though, that this will create 1000 outer bootstrap samples which are then resampled 500 times each, so that this might take quite long to run. You can parallelize this if you set Let me know if you have other questions or if this does not answer your question. |
Beta Was this translation helpful? Give feedback.
Hi,
I can't see the code you used to create
opt_cut
, but I assume it is something like:This will calculate an optimal cutpoint for the Youden-Index in the full data set and also calculate optimal cutpoints for the Youden-Index in 1000 bootstrap samples. The cutpoint in the full data set is of course not necessarily identical to the mean or median of the cutpoints from the bootstrap.
This type of "outer bootstrap" is useful for estimating the out-of-sample performance of the estimation method at hand, in this example empirically maximizing the Youden-Index. You get the distribution…