Bulk deidentification workflow improvements #288

cooperlab · 2025-02-13T00:29:42Z

We often have a single folder storing a large volume of slides and we want to deidentify only some of them. Under the current workflow we have to generate a copy of these files in another folder, and apply the deidentification there. This adds additional steps for the user (copy and deletion of temporary file), and the temporary volume of storage required can be on the order of hundreds of TB, and copying this can be quite slow. It is possible to script the calling of imagedephi run on individual files, but a lot of work is required to cleanup and aggregate the output.

Desired solution - consume a text file containing the select input files+paths to deidentify. This is also helpful in other scenarios (nested folders for example)

Allowing in place deidentification of the temporary copies is another solution, but this can lead to PHI leakage if some files fail.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bulk deidentification workflow improvements #288

Bulk deidentification workflow improvements #288

cooperlab commented Feb 13, 2025

Bulk deidentification workflow improvements #288

Bulk deidentification workflow improvements #288

Comments

cooperlab commented Feb 13, 2025