This MCQ module is based on: Frequency Curves, Bivariate Distribution & Exercises
Frequency Curves, Bivariate Distribution & Exercises
This assessment will be based on: Frequency Curves, Bivariate Distribution & Exercises
Upload images, PDFs, or Word documents to include their content in assessment generation.
Frequency Curves, Bivariate Distributions and NCERT Exercises
Once data are arranged into classes, two new questions appear. Can we see the distribution as a picture? And what happens when we collect two variables from the same person — sales and ad-spend, height and weight, marks and study-hours? This part draws frequency curves, builds bivariate tables, examines the cost of grouping (loss of information), and walks through every NCERT exercise with worked answers.
3.9 Drawing the Distribution — Frequency Curve, Polygon and Histogram
A table of frequencies is precise, but a picture reveals the shape of the distribution at a glance. NCERT mentions three closely related graphic forms.
3.9.1 The Frequency Curve of Example 4 (Equal Classes)
NCERT's Fig 3.1 plots the ten class marks (5, 15, 25, 35, 45, 55, 65, 75, 85, 95) against the ten frequencies (1, 8, 6, 7, 21, 23, 19, 6, 5, 4) and joins them. The curve rises steeply between marks 35 and 55, peaks around 55, and then falls — confirming the bell-like concentration in the middle of the distribution.
3.9.2 The Histogram of Example 4
The same data plotted as a histogram. Each rectangle covers exactly one class interval (10 marks wide). Heights record the class frequencies. Because adjacent classes share a common boundary, the rectangles touch — emphasising the continuity of the underlying variable.
3.9.3 The Frequency Polygon — Same Data, Straight Lines
Connect the same ten (class mark, frequency) points by straight segments and close the figure at the X-axis. The result is the frequency polygon. It is the easiest graph to overlay if you ever want to compare two distributions on the same axes.
3.9.4 The Effect of Unequal Classes — NCERT Table 3.7
NCERT splits the busy middle classes (40–50, 50–60, 60–70) of Table 3.6 into narrower 5-mark classes (40–45, 45–50, 50–55, 55–60, 60–65, 65–70). The new class marks (42.5, 47.5, 52.5, 57.5, 62.5, 67.5) sit closer to the actual values inside, so the distribution becomes more representative. The frequency curve in Fig 3.2 looks slightly different from Fig 3.1 — the same data, but better resolution where it matters.
| Class | Frequency | Class Mark |
|---|---|---|
| 0–10 | 1 | 5 |
| 10–20 | 8 | 15 |
| 20–30 | 6 | 25 |
| 30–40 | 7 | 35 |
| 40–45 | 9 | 42.5 |
| 45–50 | 12 | 47.5 |
| 50–55 | 7 | 52.5 |
| 55–60 | 16 | 57.5 |
| 60–65 | 10 | 62.5 |
| 65–70 | 9 | 67.5 |
| 70–80 | 6 | 75 |
| 80–90 | 5 | 85 |
| 90–100 | 4 | 95 |
| Total | 100 |
3.10 Loss of Information — The Hidden Cost of Grouping
Classification has a price. Once raw data are summarised into classes, the individual observations vanish from view. NCERT highlights this as loss of information?.
Consider class 20–30 in Example 4. The actual values inside are 25, 25, 20, 22, 25 and 28. After grouping, the table shows only "frequency = 6" and the class mark 25 — the six original numbers are gone. Every further calculation (mean, variance, standard deviation) treats all six observations as if they equalled exactly 25. The actual spread between 20 and 28 is lost.
What we lose: the precise value of every observation. Statistics calculated from a frequency distribution are based on class marks, not on the actual numbers.
The good news: the loss is small if the class marks are chosen so that observations cluster around them. That is exactly why NCERT prefers unequal classes (Table 3.7) where the middle of the data is busy — narrower classes there bring the class mark closer to the actual values.
3.11 Frequency Array — Frequency Distribution for Discrete Variables
Everything so far assumed a continuous variable (marks of students). When the variable is discrete, classification looks different. Since a discrete variable does not take fractional values between two adjacent integers, the natural unit is the integer itself, not an interval. The result is called a frequency array?.
NCERT's Table 3.8 illustrates the idea with the size of household — a discrete variable that only takes whole-number values 1, 2, 3, 4, …
| Size of the Household | Number of Households |
|---|---|
| 1 | 5 |
| 2 | 15 |
| 3 | 25 |
| 4 | 35 |
| 5 | 10 |
| 6 | 5 |
| 7 | 3 |
| 8 | 2 |
| Total | 100 |
There are no class intervals in a frequency array — each integer value of the variable is its own row. The most common household size is 4 (35 households), and the distribution clearly tails off after that.
3.12 Bivariate Frequency Distribution — Two Variables Together
Real research rarely confines itself to a single variable. A market researcher samples 20 firms and records both their sales and their advertisement expenditure. A school teacher records each student's marks and weekly study hours. We have bivariate data — two variables observed for each unit. The summary tool is the bivariate frequency distribution?, also called a two-way table.
3.12.1 NCERT Example — Sales and Advertisement Expenditure of 20 Firms
NCERT's Table 3.9 cross-classifies 20 firms by their sales (in lakh ₹) and advertisement expenditure (in thousand ₹). Sales are split into six column classes; advertisement expenditure into five row classes. Each cell shows how many firms fall into that particular (sales, ad-spend) combination.
| Ad ↓ / Sales → | 115–125 | 125–135 | 135–145 | 145–155 | 155–165 | 165–175 | Total |
|---|---|---|---|---|---|---|---|
| 62–64 | 2 | 1 | — | — | — | — | 3 |
| 64–66 | 1 | 3 | — | — | — | — | 4 |
| 66–68 | 1 | 1 | 2 | 1 | — | — | 5 |
| 68–70 | — | 2 | — | 2 | — | — | 4 |
| 70–72 | — | 1 | 1 | — | 1 | 1 | 4 |
| Total | 4 | 5 | 3 | 3 | 1 | 1 | 20 |
How to read it. The cell at row "64–66" and column "125–135" has frequency 3. That means three of the 20 firms have advertisement expenditure between ₹64,000 and ₹66,000 and sales between ₹125 lakh and ₹135 lakh. The row totals (3, 4, 5, 4, 4) give the marginal distribution of advertisement expenditure; the column totals (4, 5, 3, 3, 1, 1) give the marginal distribution of sales. Both marginals add up to the grand total, 20.
- How many firms have ad expenditure between ₹68,000 and ₹70,000?
- How many firms have sales between ₹135 lakh and ₹145 lakh?
- How many firms have both sales above ₹155 lakh and ad expenditure above ₹70,000?
- What share of firms lies in the cell (62–64, 115–125)?
(2) Column total of 135–145 = 3 firms.
(3) Cells (70–72, 155–165) and (70–72, 165–175) each have frequency 1 → 2 firms.
(4) Cell (62–64, 115–125) = 2 firms out of 20 = 10%.
3.13 Recap — The Whole Workflow at a Glance
- Classification brings order to raw data.
- A frequency distribution shows how the values of a variable are distributed across classes, with their frequencies.
- Either the upper limit or the lower limit is excluded in the exclusive method; both limits are included in the inclusive method.
- After grouping, statistical calculations are based on class marks, not the original observations — this is the loss of information.
- Classes should be set so that the class mark lies as close as possible to the values inside.
- For a discrete variable, the analogue of the frequency distribution is the frequency array.
- For two variables observed together, the bivariate frequency distribution presents joint frequencies in a two-way table.
3.14 Worked CBQ — Reading a Frequency Distribution
📊 Case-Based Question — The Cell-Phone Survey
| Number of Cell Phones | Tally | Number of Households |
|---|---|---|
| 0 | / | 1 |
| 1 | |||| /// | 8 |
| 2 | |||| |||| /// | 13 |
| 3 | |||| |||| // | 12 |
| 4 | |||| | 5 |
| 5 | // | 2 |
| 6 | // | 2 |
| 7 | // | 2 |
| Total | 45 |
Choose: (A) Both A and R are true and R is the correct explanation of A. (B) Both A and R are true but R is not the correct explanation of A. (C) A is true, R is false. (D) A is false, R is true.
3.15 NCERT End-of-Chapter Exercises — With Model Answers
Click "Show Answer" under each question for the worked solution paraphrased from the NCERT explanation.
(a) the average of the upper and lower class limits (b) the product of upper and lower class limits (c) the ratio of upper and lower class limits (d) None.
(ii) The frequency distribution of two variables is known as: (a) Univariate (b) Bivariate (c) Multivariate (d) None.
(iii) Statistical calculations in classified data are based on: (a) actual observations (b) upper class limits (c) lower class limits (d) class midpoints.
(iv) Range is the: (a) difference between the largest and smallest observations (b) difference between the smallest and the largest (c) average of the largest and smallest (d) ratio of the largest to the smallest.
(ii) (b) — bivariate distribution.
(iii) (d) — class midpoints (class marks).
(iv) (a) — Range = largest − smallest. Option (b) is wrong because subtracting smallest from largest gives a negative; the standard definition uses (a).
Continuous variable: takes any numerical value within a range — whole, fractional, or irrational. Between any two values another value always exists. Examples: height (90 cm, 90.85 cm, 90.853 cm…), weight, time, distance.
Discrete variable: changes only by finite jumps; cannot take a value between two adjacent permitted values. Examples: number of students in a class (25 or 26, never 25.5), number on a dice. A discrete variable need not be a whole number — X = 1/8, 1/16, 1/32, … is discrete because it jumps from one fraction to the next without passing through values in between.
Inclusive method: both the lower and upper limits are included in the same class. Successive classes appear with a visible gap, e.g., 0–10, 11–20, 21–30. Used commonly for discrete variables. For continuous variables, an adjustment of ±0.5 (or whatever half the gap is) is applied to restore continuity for graphing — exactly the operation NCERT performs in moving from Table 3.4 to Table 3.5.
(i) Obtain the range of monthly household expenditure on food.
(ii) Divide the range into appropriate class intervals and obtain the frequency distribution.
(iii) Find the number of households whose monthly expenditure on food is (a) less than ₹2000 (b) more than ₹3000 (c) between ₹1500 and ₹2500.
(i) Range = 5090 − 1007 = ₹4083.
(ii) Choose 9 classes of width ₹500 (so 9 × 500 = 4500 covers the range comfortably). A typical exclusive frequency distribution looks like:
| Expenditure (₹) | Frequency |
|---|---|
| 1000–1500 | 22 |
| 1500–2000 | 13 |
| 2000–2500 | 6 |
| 2500–3000 | 3 |
| 3000–3500 | 3 |
| 3500–4000 | 1 |
| 4000–4500 | 1 |
| 4500–5000 | 0 |
| 5000–5500 | 1 |
| Total | 50 |
(b) More than ₹3000 — add classes 3000–3500 (3) + 3500–4000 (1) + 4000–4500 (1) + 4500–5000 (0) + 5000–5500 (1) → 6 households.
(c) Between ₹1500 and ₹2500 — add classes 1500–2000 (13) and 2000–2500 (6) → 19 households.
Note: answers may vary slightly depending on chosen class width and exclusive/inclusive convention. The method is what matters.
| Number of Cell Phones | Tally | Number of Families |
|---|---|---|
| 0 | / | 1 |
| 1 | |||| /// | 8 |
| 2 | |||| |||| /// | 13 |
| 3 | |||| |||| // | 12 |
| 4 | |||| | 5 |
| 5 | // | 2 |
| 6 | // | 2 |
| 7 | // | 2 |
| Total | 45 |
Why classified data wins:
- It condenses unmanageable rows of numbers into a compact, comprehensible table.
- It reveals the shape of the distribution — peaks, tails, concentration — at a glance.
- Comparisons across groups become straightforward (e.g., comparing two schools' marks distributions).
- Statistical formulas for mean, median, mode, variance and so on can be applied cleanly.
Bivariate frequency distribution presents the joint frequency distribution of two variables — for example, sales and advertisement expenditure of 20 firms (NCERT Table 3.9). One variable's classes label the columns, the other's the rows; each cell shows the number of units that fall into both a given column class and a given row class. It captures relationships between two variables and is the starting point for the study of correlation.
| Class (Inclusive) | Tally | Frequency |
|---|---|---|
| 1–7 | |||| |||| //// | 14 |
| 8–14 | |||| |||| // | 12 |
| 15–21 | |||| |||| //// | 14 |
| 22–28 | |||| //// | 9 |
| 29–35 | |||| // | 7 |
| 36–42 | // | 2 |
| Total | 58 |
| Number of Letters | Tally | Number of Words |
|---|---|---|
| 3 | |||| | 4 |
| 4 | // | 2 |
| 5 | /// | 3 |
| Total | 9 |
Pull out your old mark-sheets — half-yearly and annual maths marks from your previous classes. Arrange them year-wise. Is "marks in maths" a variable? How does the variable behave over time? Have you improved? (NCERT closes the chapter with this self-reflective activity.)
Frequently Asked Questions — Frequency Curves, Bivariate Distributions and NCERT Exercises
What is a bivariate frequency distribution in NCERT Class 11 Statistics?
A bivariate frequency distribution is a two-way table that records the joint frequencies of two variables observed on the same set of units, such as the height and weight of students or the marks in maths and economics. NCERT Class 11 Statistics Chapter 3 Part 2 explains that one variable is placed along the rows and the other along the columns, with each cell showing the number of observations falling in that pair of class intervals. The row and column totals (called marginal distributions) recover the univariate frequency distributions of each variable separately.
What is cumulative frequency and how is it calculated in Class 11?
Cumulative frequency is the running total of frequencies as you move down (or up) the classes of a frequency distribution. NCERT Class 11 Statistics Chapter 3 Part 2 explains two forms: less-than cumulative frequency adds frequencies progressively from the lowest class upward, telling you how many observations are below the upper limit of each class; more-than cumulative frequency adds from the highest class downward. To calculate it, simply keep a running total of the simple frequency column. Cumulative frequencies are essential for finding the median, quartiles and percentiles, and for drawing the less-than and more-than ogives.
What is the difference between less-than ogive and more-than ogive?
A less-than ogive is the cumulative frequency curve plotted by taking upper class limits on the X-axis and less-than cumulative frequencies on the Y-axis; the curve rises from left to right. A more-than ogive plots lower class limits against more-than cumulative frequencies; the curve falls from left to right. NCERT Class 11 Statistics Chapter 3 Part 2 explains that the X-coordinate of the point where the two ogives intersect gives the median of the distribution graphically, making ogives a powerful tool for both visualisation and median estimation in grouped data.
What are the main types of frequency curves in NCERT Class 11 Statistics?
NCERT Class 11 Statistics Chapter 3 Part 2 lists four main types of frequency curves: a symmetrical curve, where values are evenly distributed on both sides of the mean (the bell shape); a moderately skewed curve, which has a longer tail on one side (positive or negative skew); a J-shaped or reverse-J curve, common in income distributions where most observations cluster at one extreme; and a U-shaped curve, where extremes are common but the middle is rare. A bimodal curve has two peaks. The shape of the curve immediately reveals key features of the data without further calculation.
How do you draw a histogram from a frequency distribution in Class 11 Statistics?
To draw a histogram, plot the class intervals on the X-axis and the frequencies on the Y-axis, then draw adjacent rectangles with no gaps between them, where each rectangle's width equals the class width and its height equals the frequency. NCERT Class 11 Statistics Chapter 3 Part 2 explains that for unequal class widths the height must be the frequency density (frequency divided by class width) so that the area of each rectangle is proportional to its frequency. Histograms are used only for continuous variables with exclusive class intervals; for discrete data a bar diagram is used instead.
What is the relationship between a frequency polygon and a histogram?
A frequency polygon is a line graph that joins the midpoints of the tops of all the rectangles in a histogram, with the curve closing on the X-axis at the midpoints of the imaginary classes just before the first and just after the last. NCERT Class 11 Statistics Chapter 3 Part 2 explains that the polygon and histogram convey the same information about the distribution shape but the polygon is smoother and is preferred when comparing two or more distributions on the same axes. A frequency polygon does not require a histogram to be drawn first if midpoints of class intervals are known.