Central tendency refers to a statistical measure that identifies the central or typical value within a dataset. When large amounts of numerical data are collected, it becomes difficult to interpret individual values. Central tendency provides one representative value—called an average—that summarizes the entire dataset. This representative value lies close to the center of the data distribution and helps in comparison, interpretation, and further statistical calculations.
Why Central Tendency Is Important
- Summarizes large datasets into a single meaningful value.
- Helps compare two or more groups quickly.
- Acts as a foundation for further statistical analysis.
- Useful for understanding the distribution of biological and clinical measurements.
Main Measures of Central Tendency
There are three major measures of central tendency:
- Mean
- Median
- Mode
1. Mean
The mean is the most commonly used average. It is calculated by dividing the sum of all observations by the number of observations. There are three types of means:
a) Arithmetic Mean
- Individual Series (Ungrouped Data): The mean is calculated by adding all values and dividing by the total number of values.
- Discrete Series: Frequencies are multiplied by their respective values before dividing by total frequency.
- Continuous Series: Midpoints of class intervals are used to calculate the mean.
b) Geometric Mean
The geometric mean is the nth root of the product of n values. It is useful for data involving rates, growth, or ratios.
c) Harmonic Mean
The harmonic mean is the reciprocal of the average of reciprocals. It is often used when dealing with speed, rates, or ratios.
2. Median
The median is the middle-most value when the data is arranged in ascending or descending order. It is especially useful when data contains extreme values (outliers), because unlike the mean, the median is not influenced by unusually high or low observations.
Properties of the median:
- Divides the dataset into two equal halves.
- Always lies between the highest and lowest values.
- Best measure of central tendency for skewed distributions.
3. Mode
The mode is the value that appears most frequently in a dataset. A dataset may have one mode (unimodal), two modes (bimodal), or more than two modes (multimodal).
- Useful for categorical or qualitative data.
- Represents the most typical or common value.
- Not affected by extreme observations.
Relationship Between Mean, Median, and Mode
In a perfect normal distribution (symmetrical bell-shaped curve):
- Mean = Median = Mode
In a positively skewed distribution:
- Mean > Median > Mode
In a negatively skewed distribution:
- Mode > Median > Mean
Choosing the Right Measure
The best measure of central tendency depends on:
- Nature of data: continuous, discrete, or categorical
- Presence of outliers: median is preferred if extreme values exist
- Purpose of analysis: mean for average calculation, mode for common value identification
Detailed Notes:
For PDF style full-color notes, open the complete study material below:
