Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bulk deidentification workflow improvements #288

Open
cooperlab opened this issue Feb 13, 2025 · 0 comments
Open

Bulk deidentification workflow improvements #288

cooperlab opened this issue Feb 13, 2025 · 0 comments

Comments

@cooperlab
Copy link

We often have a single folder storing a large volume of slides and we want to deidentify only some of them. Under the current workflow we have to generate a copy of these files in another folder, and apply the deidentification there. This adds additional steps for the user (copy and deletion of temporary file), and the temporary volume of storage required can be on the order of hundreds of TB, and copying this can be quite slow. It is possible to script the calling of imagedephi run on individual files, but a lot of work is required to cleanup and aggregate the output.

Desired solution - consume a text file containing the select input files+paths to deidentify. This is also helpful in other scenarios (nested folders for example)

Allowing in place deidentification of the temporary copies is another solution, but this can lead to PHI leakage if some files fail.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant