Our population statistics cover age, sex, race, Hispanic origin, migration, ancestry, language use, veterans, as well as population estimates and projections.
The U.S. Census Bureau is the official source of statistical data tracking the national economy.
Business is a large part of America's diverse economy. This section provides key information about businesses in your community.
This section provides information on a range of educational topics, from educational attainment and school enrollment to school districts, costs and financing.
Prepare for Emergencies, Natural and Man-made Disasters using U.S. Census Bureau's data and statistics
We measure the state of the nation's workforce, including employment and unemployment levels, weeks and hours worked, occupations, and commuting.
Our statistics highlight trends in household and family composition, describe characteristics of the residents of housing units, and show how they are related.
Health statistics on insurance coverage, disability, fertility and other health issues are increasingly important in measuring the nation’s overall well-being.
We measure the housing and construction industry, track homeownership rates, and produce statistics on the physical and financial characteristics of our homes.
Income is the gauge many use to determine the well-being of the U.S. population. Survey and census questions cover poverty, income, and wealth.
The U.S. Census Bureau is the official source for U.S. export and import statistics and regulations governing the reporting of exports from the U.S.
The U.S. Census Bureau provides data for the Federal, state and local governments as well as voting, redistricting, apportionment and congressional affairs.
Geography is central to the work of the Bureau, providing the framework for survey design, sample selection, data collection, tabulation, and dissemination.
Find resources on how to use geographic data and products with statistical data, educational blog postings, and presentations.
The Geographic Support System Initiative will integrate improved address coverage, spatial feature updates, and enhanced quality assessment and measurement.
Work with interactive mapping tools from across the Census Bureau.
Find geographic data and products such as Shapefiles, KMLs, TIGERweb, boundary files, geographic relationship files, and reference and thematic maps.
Metropolitan and micropolitan areas are geographic entities used by Federal statistical agencies in collecting, tabulating, and publishing Federal statistics.
Find information about specific partnership programs and learn more about our partnerships with other organizations.
Definitions of geographic terms, why geographic areas are defined, and how the Census Bureau defines geographic areas.
We conduct research on geographic topics such as how to define geographic areas and how geography changes over time.
Tells the stories behind the numbers in a new inviting way. It features stories on various topics such as housing, employment and population.
Official audio files from the Census Bureau, including "Profile America," a daily series of bite-sized statistics, placing current data in a historical context.
Stock photos that illustrate official Census Bureau operations and activities.
Read publications, briefs and reports from Census Bureau experts.
Watch Census Bureau vignettes, testimonials, and video files.
Explore Census data with infographics and visualizations covering a broad range of topics.
Read research analyses via working papers and reports from Census Bureau experts.
Find information using interactive applications to get statistics from multiple surveys.
Developer portal to access services and documentation for the Census Bureau's APIs.
Find a multitude of DVDs, CDs and publications in print by topic.
These external sites provide more data.
The Census Bureau reuses data from other agencies to cut the cost of data collection and to reduce the burden on people who respond to our censuses and surveys.
Download extraction tools to help you get the in-depth data you need.
Learn more about our data from this collection of e-tutorials, presentations, webinars and other training materials. Sign up for training sessions.
Explore Census data with infographics and visualizations covering a broad range of topics.
If you have received a survey, this site will help you verify that the survey came from us, understand and complete the form, and know how we protect your data.
The U.S. Census Bureau is researching modern and cost-efficient methods for the population to exercise its civic obligation to be counted in the 2020 Census.
The 2017 Census Test provided the opportunity to research new methods and advanced technologies to improve data collection during the 2020 Census.
The 2018 End-to-End Census Test will take place in Pierce County, Wash.; Providence County, R.I.; and the Bluefield-Beckley-Oak Hill, W.Va., area.
National and state population totals from the 2010 Census were released on December 21, 2010
The American Community Survey is the premier source for information about America's changing population, housing and workforce.
The AHS is sponsored by the Department of Housing and Urban Development (HUD) and conducted by the U.S. Census Bureau.
The Economic Census is the U.S. government's official five-year measure of American business and the economy.
Basic enumerations of population, housing units, group quarters and transitory locations conducted by the Census Bureau at the request of a governmental unit.
Collects data and measures change for many topics including: economic well-being, family dynamics, education, assets, health insurance, and childcare.
Connect with the U.S. Census Bureau using social media.
Find interesting and quirky statistics regarding national celebrations and major events.
Stats for Stories provides story ideas highlighting the Bureau's newsworthy statistics that relate to current events, observances, holidays, and anniversaries.
See what's coming up in releases and reports.
Press Kits package information about Census Bureau topics, including media contacts, press releases and fact sheets, story ideas and multimedia resources.
Information about the U.S. Census Bureau.
Information about what we do at the U.S. Census Bureau.
Our researchers explore innovative ways to conduct surveys, increase respondent participation, reduce costs, and improve accuracy.
Learn about other opportunities to collaborate with us.
Explore prospective positions available at the U.S. Census Bureau.
Information about the current field vacancies available at the U.S. Census Bureau Regional Offices.
The regional offices are responsible for all data collection, data dissemination, and geographic operations under a new service area boundary.
Explore the rich historical background of an organization with roots almost as old as the nation.
The Census Bureau reuses data from other agencies to cut the cost of data collection and to reduce the burden on people who respond to our censuses and surveys.
Data Editing and Imputation
This section describes the data editing and imputation procedures applied to data from the Survey of Income and Program Participation (SIPP) after completion of the interviews. Three different approaches are used for dealing with missing data in SIPP:
Weighting is discussed in Chapter 8 of the SIPP Users' Guide.
The section begins with a brief discussion of the types of missing data and the goals of imputation in SIPP. It then presents an overview of the editing and imputation procedures used to deal with missing and inconsistent data. Next, the chapter provides a detailed description of each of the major steps used by the Census Bureau when creating its internal files and the files that are released for public use. Prior to 1996 the development of cross-sectional wave files involved mainly cross-sectional editing and imputation. The longitudinal files involved longitudinal editing. Beginning with the 1996 Panel, the processing procedures for the wave files were replaced with methods that use prior wave information to inform the editing and imputation of a current wave (after Wave 1). The generic imputation technique, that is, the hot-deck method, is still used in the 1996+ Panels, but the donors are now chosen on the basis of similarities in reported prior wave information when that reported information exists.
As in all surveys, there are two general types of missing data in SIPP: unit nonresponse and item nonresponse. Unit nonresponse occurs in SIPP when one or more of the people residing at a sample address are not interviewed and no proxy interview is obtained. This can happen for a number of reasons, described in Chapter 2 of the SIPP Users' Guide. Most types of unit nonresponse are dealt with through weighting adjustments (see Chapters 2 and 8 of the SIPP Users' Guide). However, the data editing and statistical imputation procedures described in this chapter are used with one type of unit nonresponse: Type Z noninterviews, which occur when an interview is obtained from at least one household member but interviews are not obtained from one or more other sample persons in that household.1 Prior to the 1996 Panel and in some instances in the 1996 Panel, the method used to adjust for person-level noninterviews in the core wave files is known as Type Z imputation, which is discussed below.
Item nonresponse occurs when a respondent completes most of the questionnaire but does not answer one or more individual questions. Item nonresponse data in SIPP occur under the following circumstances:
Missing data cause a number of problems: analyses of data sets with missing data are more problematic than analyses of complete data sets; there is a lack of consistency among analyses because analysts compensate for missing data in different ways and their analyses may be based on different subsets of data; and, in the presence of nonresponse that is unlikely to be completely random, estimates of population parameters are biased.
Because missing data are always present to some degree, analyses of survey data must be based on assumptions about patterns of missing data. When missing data are not imputed or otherwise accounted for in the model being estimated, the implicit assumption is that data are missing at random after controlling for other variables in the model. The imputation procedures used for SIPP are based on the assumption that data are missing at random within subgroups of the population. The statistical goal of imputation is to reduce the bias of survey estimates. This goal is achieved to the extent that systematic patterns of item nonresponse are correctly identified and modeled. In SIPP, the statistical goals of imputation are general, rather than specific. Instead of addressing the estimation of specific parameters, SIPP procedures are designed to provide reasonable estimates for a variety of analytical purposes.
Data editing is generally preferred over statistical imputation, and it is used whenever a missing item can be logically inferred from other data that have been provided. When information exists on the same record from which missing information can logically be inferred, that information is used to replace the missing information. The advantage of data editing is that it avoids the increase in variance that occurs when missing items on one record are imputed with nonmissing responses from other records.
Users of SIPP data interested in assessing the influence of imputed data on their analyses should consider whether SIPP imputation procedures have properties that affect their specific analytical requirements. A general discussion of the treatment of missing data in sample surveys is given in Kalton and Kaspyrzyk (1986). Sedransk (1985), Little (1986), and Jinn and Sedransk (1987) discuss properties of commonly used imputation processes. An example of the impact of imputation procedures on the distributional characteristics of a low-income population is discussed in Doyle and Dalrymple (1987).
An evaluation of the effects of imputed data should include a review of rates of unit nonresponse and an assessment of the extent of item nonresponse. Unit nonresponse tends to increase over the life of a panel, as does the likelihood that nonresponse is not a random effect. And as the percentage of eligible sample members re-interviewed decreases, the pool from which donors3 are selected shrinks accordingly. This smaller pool of donors leads to an increased likelihood that individual donors will be used more than once, which in turn increases the variance of an estimate.
The effects of imputation will likely be small for items with low rates of missing data as long as rates of item nonresponse are not high among important subclasses. Lepkowski et al. (1987), using data from a large federal survey, provide a framework for evaluating the effect of imputed values on analyses. This framework can be readily adapted to SIPP analyses.
At the conclusion of each wave of interviewing, the data collected during that wave are processed, creating the core wave and topical module files.
Figure 4-1 illustrates the steps that generate the Census Bureau's internal core wave and full panel files.
There are six steps in the first phase of SIPP data processing:
These steps are repeated at the conclusion of each wave of interviews. Prior to the 1996 Panel, each wave was processed independently of other waves of data. Thus, when multiple core wave files are linked, apparent changes in a respondent's status could be due to different applications of data edits and imputations to the files being combined (file linkage is the subject of Chapter 13 of the SIPP Users' Guide). With the 1996 data, the hot-deck procedure was redesigned to rely on historical information reported in prior waves. In addition, other forms of longitudinal imputation, such as carryover methods, were adapted.
All of the editing and imputation procedures described in the preceding sections are part of the process of preparing the data for internal Census Bureau use. Before the files are released for public use, they undergo additional editing to protect the confidentiality of respondents. Two procedures are used: topcoding of selected variables (income, assets, and age) and suppression of geographic information. Because of these procedures, estimates based on data from the public use files will differ slightly from the Census Bureau's published estimates.
One piece of information that might reveal a respondent's identity is a very high income. For that reason, the Census Bureau topcodes income before making that information publicly available, recoding any income amounts over a certain maximum value to that maximum. In other words, income on the public use data files has a ceiling value. Although income is the primary variable that is topcoded, other variables that may disclose a respondent's identity, such as age, are also topcoded. A few variables, such as starting dates for employment, may be bottomcoded if they pose a disclosure risk.
Geographic information that can be used to directly identify survey respondents, such as an address, is removed from the public use files. In addition, states and metropolitan areas with populations less than 250,000 are not identified. Specific nonmetropolitan areas (such as counties outside of metropolitan areas) are never identified. In certain states, when the nonmetropolitan population is small enough to present a disclosure risk, a fraction of that state's metropolitan sample is recoded to nonmetropolitan status. For that reason, the SIPP data cannot be used to estimate characteristics of the population residing outside metropolitan areas. Chapter 10 of the SIPP Users' Guide provides details.
For the 1996 Panel, state-level geography is shown for 45 states and the District of Columbia. The remaining five states are combined as follows:
For the 1984 through 1993 Panels, state-level geography is shown for 41 individual states and the District of Columbia; the nine other states are combined into three groups:
By selecting this link you will leave www.census.gov. Please check the Privacy Policy of the site you are visiting.