This app lets you manipulate, explore, and download the CongressData dataset. Also see the associated R package.
The dataset is comprised of member-years and combines over 1,200 variables from numerous sources. Individual variable coverage varies, but the dataset includes information regarding all congressional districts between 1789-2024. The dataset tracks characteristics of congressional districts, the members of congress themselves, and the behavior of those members in policymaking.
Use the Subset & Download tab to dynamically generate downloadable portions of the data based on several user inputs. You can also generate a codebook with descriptions and citations (bib, plain text) for each of the variables you download.
If you use the dataset, please cite the original data sources. You can use this web application or our R package to do so.
If you use CongressData or this app, in addition to individual data sources that you use, please cite:
Grossmann, Matt, Caleb Lucas, and Benjamin Yoel. Introducing CongressData and Correlates of State Policy. East Lansing, MI: Institute for Public Policy and Social Research (IPPSR), 2024.
*Use 'Related To' to search a variable's name, short/long descriptions from the codebook,
and its citation for non-exact matches of a search term. For example, searching 'tax' will
return variables with words like 'taxes' and 'taxable' in any of those columns.
Use the Variable Matches tab to see a list of each of the matched variables, along with a short description. The list of variable matches updates dynamically.
Use the Download Citations & Codebook tab to download the cites/codebook associated with the variables that result from your search.
Download a CSV with the variables from your search; their citations in both BibTeX and plain text;
and information regarding their coding with the button below.
Please verify the citations are correct, as some errors may exist. Not all citations are available.
If you use this app, in addition to individual data sources that you use, please cite:
Grossmann, M., Lucas, C., McCrain, J, & Ostrander, I. (2022). CongressData. East Lansing, MI: Institute for Public Policy and Social Research (IPPSR)
Interested in adding data to CongressData? Instructions below!
The CongressData Project aims to compile, disseminate, and encourage the use of data relevant to research of the US Congress and tracks member-level variables across the states across time. We have gathered more than 900 variables from various sources and assembled them into one large, useful dataset. We hope this project will become a 'one-stop shop' for academics, policy analysts, students, and researchers looking for variables germane to the study of the US Congress.
We encourage researchers and practitioners to submit relevant data for inclusion to the dataset. Sources and citations for each variable in congressData are provided with the dataset. Data deposits should include the files detailed below, along with the submitters contact information, at a minimum. The congressData team will review the submission and, if it conforms to the dataset's standards, it will be integrated during an update.
Data File
Preferably a .csv file, but other common file types such as .dta or .rda are acceptable.
The file should contain a table comprised of rows and columns with no labels.
Preferably in a member-year panel, where each row of the table represents information regarding a specific year for a single member of a US Congress. Any collection of members and years is acceptable.
The table should contain the following panel variables at a minimum:
a common member identification code
Congress number
district number
state
year
Codebook or ReadMe
Data deposits should also contain a document that will be used to update the CongressData codebook with information regarding the variables in the submitted file.
Preferably a .txt or .csv file.
The file should contain a listing of each variable in the data deposit with the following information:
short_desc: A short description of the variable (i.e. 1-2 sentences)
long_desc: A long description of the variable (if necessary)
sources: The source/s to create the variable. This information should include all sources used in the creation of the variable.