-
-
Notifications
You must be signed in to change notification settings - Fork 6
Home
Robb Shecter edited this page Mar 11, 2022
·
24 revisions
The key idea is to split parsing into two stages. This reduces the complexity of the code. The first stage (this repo) "simply" retrieves and converts an original source to JSON. The actual schema of the JSON output mirrors the original content. And so, each original source will have very different looking JSON with different attributes. However, as JSON (instead of PDF, HTML, etc.) they're all easily read by the next stage. This allows the second stage to focus on converting the source schema to a particular app's schema.
Current project: International Law in support of Ukraine