What Is A Contingency Table

Explore contingency tables, a statistical tool for displaying and analyzing the relationship between two or more categorical variables, used widely in data analysis and research.

Have More Questions →

Definition of a Contingency Table

A contingency table, also known as a cross-tabulation or crosstab, is a statistical tool used to display and analyze the relationship between two or more categorical variables. It presents frequency distributions of variables in a matrix format, where rows and columns correspond to different categories of the variables being examined.

Structure and Components

Typically, a contingency table has rows representing the categories of one variable and columns representing the categories of another. Each cell within the table contains the count or frequency of observations that fall into both the corresponding row category and column category. The margins of the table display the total frequencies for each category of the individual variables.

Practical Example: Survey Analysis

Consider a survey asking respondents about their preferred mode of transportation (Car, Bus, Train) and their gender (Male, Female). A contingency table would show how many males prefer cars, how many females prefer buses, and so on. For instance, if 150 males chose 'Car' and 100 females chose 'Car', these counts would populate the relevant cells, allowing for a clear visual comparison.

Importance and Applications

Contingency tables are crucial for exploring associations or dependencies between categorical variables. They are widely used in social sciences, market research, and medical studies to perform tests like the Chi-squared test, which determines if there's a statistically significant relationship between the variables. This helps researchers understand patterns and make informed decisions.

Frequently Asked Questions

What is the primary purpose of a contingency table?
What kind of data does a contingency table use?
Can a contingency table show more than two variables?
What statistical test is often performed using a contingency table?