SAGE Stats brings together over 50,000 datasets about U.S. geographies from over 190 authoritative sources harmonized into consistently formatted statistical data series. It makes data points easy to find, easy to compare, and easy to download—for novice students and advanced researchers alike.
SAGE Stats contains over 6,500 data series—or single data measures over time about U.S. geographies—across 17 high-interest research areas. The data series span as far back as 1980, and are updated on a timely basis in accordance with the collection and publication schedules of original sources. Detailed source information, with links where applicable, are provided for every data series.
Do similarly populated areas have similar incarceration rates?
Most librarians and state researchers know that SAGE Stats began in print, with the CQ Press State Fact Finder series, a statistical annual first published in 1993. In response to overwhelming research demand and the need for digitized data, SAGE converted the backfile of CQ Press State Fact Finder titles into the SAGE Stats platform, adopting its topic areas but expanding upon them to create a three-tier hierarchical taxonomy. We introduced more data measures and more geographies, including counties, central cities, and metropolitan statistical areas.
Does religious adherence correlate with other social indicators, like income or employment?
Data Cleaning and Harmonization
Public government data, in addition to privately available data, vary widely in their provision, with common pitfalls including confusing query processes (like having to browse variable by variable or state by state), static data tables, PDFs, or difficult-to-open data files. Our authors and editors carefully review and convert datasets into the same machine-readable structure. We also diligently work with government sources, who don’t make their data available to the public, to acquire their raw data behind the scenes. We clean data from all file types, and when necessary, add standard Census place codes, which are more often than not lacking in the original sources. We also translate codebooks, entirely eliminating the need for them, but still linking to them via the original source if codes are desirable. Codes are usually some combination of letters and numbers that take the place of simple data measure names. Codebooks are frequently paired with government datasets, and can be especially difficult for lower-level students to grasp, as they require further study and understanding, and sometimes multiple downloads or flipping back and forth between codebooks and data sets, depending on the preference of the user or format of the site.
Many of the data series in SAGE Stats are available nowhere else because they were calculated from raw data, including lists and records, into measures that align with typical research demands. For instance, the Office of Fair Housing and Equal Opportunity publishes records at the case level—that is, a list of every FHEO case filed in a certain year. We have taken those record lists and converted them to multiple data series aimed at typical research avenues in housing.
For more details, visit our Methodology overview page here.
How has your hometown's general level of education changed over time?
SAGE Stats allows users to visualize data in the form of series over time—in tables, maps, line graphs, and scatter plots—and download that data either in small amounts or in bulk, using the Export Center. The visualization and comparison functionality is easy to use, meant to engage novice researchers and encourage them to draw unique insights among the data. Vastly different data can be easily compared, which enables students to draw interesting correlations for papers or class assignments. More advanced researchers are encouraged to use these functions as well, but can defer to the Export Center for bulk spreadsheets, ideal for use in third party statistical programs like SPSS.
In SAGE Stats, users can:
- View every data series in a standard table format, and filter the locations in that table based on other measures about them like population or median age.
- Visualize data series on a gradient map of the United States, and view that map change over time with a time bar.
- Compare data series within geographies and across geographies, in line charts and scatter plots.
- Download data tables in XLS or CSV and visualizations in JPG or PNG.
- View and print profile pages for every location in the product. Profile pages include key demographics at a glance and the ability to compare the location to others in the United States based on measures of your choosing.
For a guided video tour of the SAGE Stats platform, click here.
SAGE Stats can be made available to your library as a whole or as separate collections, depending on the statistical needs of your patrons. Below are the collections currently available as combined or separate entities within SAGE Stats. To try one or both of these collections with your library, click here.
||Data measures for all 50 states and the District of Columbia.
||Data measures for all counties, cities, and metropolitan statistical areas in the United States.