Skip to content
Nel Swanepoel edited this page Nov 18, 2022 · 5 revisions

The PIXL MVP forms part of The Tracer Bullet (TTB) MVP for FlowEHR, UCLH's cloud research & innovation infrastructure initiative.

Deliverables

Delivering the PIXL MVP requires that

  1. We pre-specify the cohort of patients who received chest X-rays (estimated ~300k images)
  2. We intelligently query the PACS/VNA for the chest X-rays from this cohort without causing operational systems to fall over
  3. We automatically de-identify DICOM elements with a simple whitelisting approach and removal of PII overlays
  4. We automatically push DICOM instances to a DICOM node in Azure via DICOMweb
  5. We extract EHR and free-text radiology reports for the specified cohort
  6. We de-identify free-text radiology reports with Presidio
  7. We de-identify PII EHR attributes using a blacklisting approach
  8. We link de-identified data securely
  9. We automatically push radiology reports & EHR data into Delta Lake on Azure
  10. We automatically ingest data from Delta Lake into the Feathr feature store
  11. We provide controlled access to the DICOM node and the Feathr feature store to an Azure TRE workspace
  12. We offer useful written guidance to the research team
  13. We provide workable policies and SOPs for managing image extraction via the PIXL pipeline

Out of Scope

  • Real-time data ingestion
  • DICOM pixel data de-identification
  • Identifiable patient data in Azure
  • Perfection

Target Delivery Date

mid-November 2022
mid December 2022

Clone this wiki locally