Skip to content

Latest commit

 

History

History
19 lines (9 loc) · 932 Bytes

README.md

File metadata and controls

19 lines (9 loc) · 932 Bytes

cms-sqlite-loader

This repository contains python code to ETL the synthetic PUF from medicare found here:

This ETL code creates a 1-to-1 mapping between a CMS PUF artifact file (a CSV), and a SQLite .db file.

Motivation

A CMS Synthetic PUF Claims file has all the data needed to do analysis. However it comes in a slightly inconvenient format for querying at the cost for a universally known format: CSV.

Sqlite is a powerful embeddable database that allows complete access to the entirety of the SQL specification, and stores all its data in a single file that is 100% portable from sysyste to system. This last quality allows us to do the following:

CMS Synthetic PUF

Here our ETL code maps from one serialization (a CSV file) to another (a SQLite db file).