Skip to content

Latest commit

 

History

History
26 lines (18 loc) · 6.36 KB

project-home-2021-03-31.md

File metadata and controls

26 lines (18 loc) · 6.36 KB

2021-03-31

Date

31 Mar 2021

Participants

Discussion topics

Item Notes
Dolt
  • works as POC
  • requires scraper
  • Richard is waiting for a bug fix from dolt and will merge his branch after that
Scrapers
  • Former user (Deleted) add “convert data to a known type or create a new type” to scraper instructions

    • This is a good amount of work because all would need to be kept up to date
  • Former user (Deleted)We should use github to store the DDL scripts and reference it in dolthub, should make a folder for them

    • people would reference these → their columns would be kept up to date
  • Richard Ji basic Python style guide for scrapers

What data to collect

  • We need a decision / definition of “what we really want”
  • The decision from last meeting is something like

    • Accept all data that is up and out publicly
    • Tier and sort and prioritize data based on good or consistent formatting, but accept all legal public data. Omission is not a good start to our process if our goal is “a source of truth for police data.”
    • Only surface PII
    • Classification should not be a barrier—we only need to classify what is
  • Former user (Deleted) draft a policy → publish
Business / professional things
  • Eddie working with Denice Ross on New Jersey data that’s not public. Someone needs to explain to Eddie what’s going on with New Jersey data → he can use that as more specific rationale for more investigation
  • Eddie ensuring our taxes are paid.

Action items

  • Alec Akin to dig deeper on what blockers were for getting New Jersey data + communicate to Eddie
  • Richard Ji basic python style guide for beginner scrapers