A project analyzing data from Immigration and Customs Enforcement's Alien Repatriation and Transfer System (ARTS) for the report Hidden in Plain Sight: ICE Air and the Machinery of Mass Deportation by the University of Washington Center for Human Rights.
Original Excel files as released by ICE can be found on UWCHR's Google Drive. These raw files are excluded from the repository due to their size. The most complete data were released in "Installment 2", containing ICE Air ARTS passenger and mission data for 2010-10-01 through 2019-05-03.
- Hidden in Plain Sight: ICE Air and the Machinery of Mass Deportation
- Hidden in Plain Sight: King County Collaboration with ICE Air Deportation Flights at Boeing Field
- Hidden in Plain Sight: ICE Air Data Appendix
This repo uses Git LFS.
This project uses "Principled Data Processing" techniques and tools developed by @HRDAG; see for example "The Task Is A Quantum of Workflow."
Tasks in this project are designed to be executed using the recursive make tool makr.
Datasets and high level resources:
installment1/- Dataset released to UWCHR via FOIA in December 2018. Contains ICE Air ARTS passenger data for 2010-10-01 through 2018-12-05.installment2/- Dataset released to UWCHR via FOIA in August 2019. Contains ICE Air ARTS passenger and mission data for 2010-10-01 through 2019-05-03. This is the most complete version of data obtained via FOIA by UWCHR.installment3/- Dataset released to UWCHR via FOIA in November 2020. Contains ICE Air ARTS passenger and mission data for 2018-10-01 through 2020-05-08. This version of data is more heavily redacted than previous installments obtained via FOIA by UWCHR.compare/- Project for comparing contents of ICE Air ARTS installments 1 and 2.kykm/- Small task for dataset of community observations of ICE Air flights at Yakima Air Terminal (KYMK).radarbox/- Task for data on Swift Air (SWQ) and World Atlatic (WAL) flights using commercial flight tracker records from Radarbox.comshare/- Various hand-written files and resources shared by multiple other tasks.docs/- HTML documentation published at https://uwchr.github.io/ice-air/
Project-level tasks, in order of workflow (not all tasks will be present in all projects):
import/- Convenience task for importing ICE Air ARTS dataset. Input files inimport/input/have been previously renamed to remove spaces in filenames, converted to CSV with pipe separator (|), and compressed using Gzip. Input files are symlinked toimport/output/and then toinput/of downstream task for transformation and analysis.- Original Excel files as released by ICE can be found on UWCHR's Google Drive. These raw files are excluded from the repository due to their size.
optimize/- Determines optimal Python/Pandas data types for each field in the original dataset and outputs this as a YAML dictionary used and modified in downstream tasks.clean/- Standardizes selected field values inclean/hand/clean.yaml; fixes missing and bad airport data; removes duplicate passenger records. Outputs full ICE Air ARTS datasets, after cleaning, as Gzipped CSV files.analyze/- Contains various exploratory Jupyter notebooks and R Markdown. These notebooks and their outputs are exploratory and do not necessarily reflect the findings of UWCHR's report.analyze/output/contains various versions of figures and data subsets; currently none of these are used in any downstream tasks.
write/- Writes out final reports to HTML using Pweave.- All analysis, figure generation, etc. takes place in
write/src/.
- All analysis, figure generation, etc. takes place in