R scripts for basic text analysis of US federal legislation: bills introduced in the US Senate or House of Representatives. TidyBill.R reads a bill in .pdf format and converts it to a 'tidy' .csv format in which each text paragraph is accompanied by an outline tag usable to reference the paragraph and an optional list of keywords. CompareBills.R uses these keywords to identify paragraphs in a second bill that may be related to each paragraph in a first bill. Lightly tested by processing two bills from the current session (2021-22).
-
Notifications
You must be signed in to change notification settings - Fork 1
R scripts for basic text analysis of US federal legislation: bills introduced in the US Senate or House of Representatives. TidyBill.R reads a bill in .pdf format and converts it to a 'tidy' .csv format in which each text paragraph is accompanied by an outline tag usable to reference the paragraph and an optional list of keywords. CompareBills.R…
License
orelyx/legistext
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
R scripts for basic text analysis of US federal legislation: bills introduced in the US Senate or House of Representatives. TidyBill.R reads a bill in .pdf format and converts it to a 'tidy' .csv format in which each text paragraph is accompanied by an outline tag usable to reference the paragraph and an optional list of keywords. CompareBills.R…
Topics
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published