Data Source: PoliInformatics Research Challenge from PoliInformatics.org

Collected, curated, and processed by John Wilkerson and Anne Washington

Transformed and formatted by Jason Chuang

Description Fields Files

README

Credits and file formats

README.txt

Report of the Senate Committee on Homeland Security and Governmental Affairs

Wall Street and the financial crisis: anatomy of a financial collapse

April 2011, 646 pages, parsed by page

doc_id
paragraph
page_number
paragraph_number
Spreadsheet (CSV)
Compact data (CSV, SQLite, plain text)
Full data (CSV, SQLite, plain text, raw documents)

Report of the House Committee on Financial Services Hearing

The stock market plunge: what happened and what is next?

May 2010, 306 pages, parsed by page

doc_id
paragraph
page_number
paragraph_number
Spreadsheet (CSV)
Compact data (CSV, SQLite, plain text)
Full data (CSV, SQLite, plain text, raw documents)

Final Report of the Financial Crisis Inquiry Commission (FCIC)

An independent commission created by Congress to investigate the causes of the crisis

662 pages, parsed by page

doc_id
paragraph
page_number
paragraph_number
Spreadsheet (CSV)
Compact data (CSV, SQLite, plain text)
Full data (CSV, SQLite, plain text, raw documents)

First Hearing of the Financial Crisis Inquiry Commission (FCIC)

An independent commission created by Congress to investigate the causes of the crisis

248 pages, parsed by page

doc_id
paragraph
page_number
paragraph_number
Spreadsheet (CSV)
Compact data (CSV, SQLite, plain text)
Full data (CSV, SQLite, plain text, raw documents)

Meeting transcripts of the Federal Open Market Committee (FOMC)
of the Federal Reserve (2005-08)

Meeting transcripts are embargoed for five years

Parsed by meeting, speaker, speech

doc_id
speech
date
discourse
speaker
Spreadsheet (CSV)
Compact data (CSV, SQLite, plain text)
Full data (CSV, SQLite, plain text, raw documents)

Congressional Hearings:
Dodd-Frank Wall Street Reform and Consumer Protection Act

62 hearings, parsed by hearing, speaker and speech

doc_id
speech
Speaker
Type
Comments
FolderName
Index
Spreadsheet (CSV)
Compact data (CSV, SQLite, plain text)
Full data (CSV, SQLite, plain text, raw documents)

Congressional Hearings:
Semi-annual Hearings on Monetary Policy before Congress

House and Senate, 2005-2010

doc_id
speech
Speaker
Type
Comments
Committee
Event
Date
Index
Spreadsheet (CSV)
Compact data (CSV, SQLite, plain text)
Full data (CSV, SQLite, plain text, raw documents)

Congressional Hearings:
Other committees related to financial regulatory reform

House Agriculture, House Energy and Commerce, Senate Agriculture, Senate Homeland Security and Government Affairs hearings

17 hearings, parsed by speaker

doc_id
speech
Speaker
Type
Comments
Committee
CommitteeID
Event
Index
Spreadsheet (CSV)
Compact data (CSV, SQLite, plain text)
Full data (CSV, SQLite, plain text, raw documents)

Congressional Hearings:
Troubled Assets Relief Program (TARP)

12 hearings, parsed by hearing, speaker and speech

doc_id
speech
Speaker
Type
Comments
FolderName
Index
Spreadsheet (CSV)
Compact data (CSV, SQLite, plain text)
Full data (CSV, SQLite, plain text, raw documents)

Complete Bills of the 110th and 111th Congresses

Every version of every bill, parsed by bill section and related to additional information about the bill

doc_id
text
bill_source
internal_bill_id
internal_bill_version
internal_section_id
congress_session
bill_type
bill_number
bill_version
bill_id
sec_count
issued_on
section_sequence
section_length
Spreadsheet (CSV)
Compact data (CSV, SQLite, plain text)
Full data (CSV, SQLite, plain text, raw documents)

Enacted Version of PL 110-343,
the Emergency Economic Stabilization Act of 2008

Contains the Troubled Assets Relief Program (TARP)

Parsed by law section

doc_id
text
bill_source
internal_bill_id
internal_bill_ver_id
internal_section_id
congress_session
bill_type
bill_number
bill_version
bill_identifier
sec_count
issued_on
section_sequence
section_length
Spreadsheet (CSV)
Compact data (CSV, SQLite, plain text)
Full data (CSV, SQLite, plain text, raw documents)