Skip to content

R scripts for basic text analysis of US federal legislation: bills introduced in the US Senate or House of Representatives. TidyBill.R reads a bill in .pdf format and converts it to a 'tidy' .csv format in which each text paragraph is accompanied by an outline tag usable to reference the paragraph and an optional list of keywords. CompareBills.R…

License

Notifications You must be signed in to change notification settings

orelyx/legistext

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

legistext

R scripts for basic text analysis of US federal legislation: bills introduced in the US Senate or House of Representatives. TidyBill.R reads a bill in .pdf format and converts it to a 'tidy' .csv format in which each text paragraph is accompanied by an outline tag usable to reference the paragraph and an optional list of keywords. CompareBills.R uses these keywords to identify paragraphs in a second bill that may be related to each paragraph in a first bill. Lightly tested by processing two bills from the current session (2021-22).

About

R scripts for basic text analysis of US federal legislation: bills introduced in the US Senate or House of Representatives. TidyBill.R reads a bill in .pdf format and converts it to a 'tidy' .csv format in which each text paragraph is accompanied by an outline tag usable to reference the paragraph and an optional list of keywords. CompareBills.R…

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages