This MCQ module is based on: Sources of Data, Survey Methods & Questionnaire Design
Sources of Data, Survey Methods & Questionnaire Design
This assessment will be based on: Sources of Data, Survey Methods & Questionnaire Design
Upload images, PDFs, or Word documents to include their content in assessment generation.
Sources and Methods of Data Collection
Every statistical insight begins with a deceptively simple step: collecting the data. Where do numbers come from? How does a researcher walk up to strangers and gather honest answers about price, income, opinion or behaviour? And why does the wording of a single question decide whether the data is gold or garbage? This part introduces primary vs secondary sources, the three modes of survey, and the craft of writing an unbiased questionnaire.
2.1 Why Do We Collect Data?
In Chapter 1 you learned what economics is and why statistics matters to it. The next obvious question is: where do all these numbers come from? The aim of data collection is to assemble enough credible evidence so that an economic problem can be analysed and solved. Without good data, every conclusion is a guess; with good data, even small problems can be tackled with confidence.
Consider this NCERT example of how data make a story visible. Indian food-grain output rose from 108 million tonnes in 1970-71 to 132 million tonnes in 1978-79, briefly slipped back to 108 mn t in 1979-80, and then climbed steadily to 252 million tonnes in 2015-16 and 272 million tonnes in 2016-17. The numbers themselves are the production of food grain — but what they reveal is a country slowly winning its long battle against hunger.
| Year (X) | Production in million tonnes (Y) |
|---|---|
| 1970–71 | 108 |
| 1978–79 | 132 |
| 1990–91 | 176 |
| 1997–98 | 194 |
| 2001–02 | 212 |
| 2015–16 | 252 |
| 2016–17 | 272 |
Each year (X) and each output figure (Y) is a single observation?; together they form the data. Because the values change from year to year, X and Y are called variables?. Variables are usually written using the letters X, Y or Z. Every individual reading of a variable is one observation.
2.2 Where Does Data Come From? Primary vs Secondary Sources
Statistical data flows from two distinct sources. You can either go out and collect numbers yourself — by asking, observing, measuring — or you can lift numbers that someone else has already collected, cleaned and published.
2.3 How Do We Collect Primary Data? The Survey Method
How does a manufacturer know whether shoppers like a new shampoo? How does a political party gauge a candidate's popularity? They run a survey — a structured way of gathering information from a group of people. The point of a survey is to describe characteristics of a product (price, quality, usefulness) or of a person (popularity, honesty, loyalty), and the raw fuel of every survey is data.
2.3.1 The Instrument: Questionnaire / Interview Schedule
The most common survey instrument is the questionnaire? (sometimes called an interview schedule). It is either filled by the respondent themselves (self-administered) or read out by a trained researcher / enumerator who records the replies. A badly written questionnaire will collect badly distorted data, no matter how big the sample. So writing one is half art, half science. NCERT lists six rules of thumb.
(2) Use simple, unambiguous language.
(3) Order questions so the respondent feels comfortable; move from general to specific.
(4) Make every question precise and clear.
(5) Avoid double negatives ("Wouldn't you...", "Don't you...").
(6) Avoid leading questions and questions that hint at an answer.
2.3.2 Poor Questions vs Good Questions — Five Worked Examples
NCERT pairs a poorly framed question with a corrected version. Each pair illustrates one of the rules above.
(a) Order — general before specific:
(i) Is the increase in electricity charges justified?
(ii) Is the electricity supply in your locality regular?
(i) Is the electricity supply in your locality regular?
(ii) Is the increase in electricity charges justified?
Why? Asking the broad fact-of-life question first relaxes the respondent and produces honest answers to the loaded follow-up.
(b) Precision — drop vague phrases:
What percentage of your income do you spend on clothing in order to look presentable?
What percentage of your income do you spend on clothing?
Why? "Look presentable" is subjective and biases the answer. Strip such phrases.
(c) Avoid ambiguity — quantify the response options:
Do you spend a lot of money on books in a month?
(Tick one) How much do you spend on books in a month?
(i) Less than ₹200 (ii) ₹200–300 (iii) ₹300–400 (iv) More than ₹400
Why? "A lot of money" means different things to different people. Range buckets standardise the reply.
(d) Avoid double negatives:
Don't you think smoking should be prohibited?
Do you think smoking should be prohibited?
Why? Phrases beginning with "Wouldn't you" or "Don't you" push the respondent towards agreement and produce biased data.
(e) Avoid leading questions:
How do you like the flavour of this high-quality tea?
How do you like the flavour of this tea?
Why? Calling the tea "high-quality" tells the respondent what to think before they answer.
(f) Avoid questions that suggest the alternatives:
Would you like to do a job after college or be a housewife?
What would you like to do after college?
Why? Pinning the respondent to two alternatives hides every other career or life choice.
Below are three poorly framed survey questions. Identify which design rule each one violates and rewrite it as a good question.
- "Don't you agree that online classes are worse than offline classes?"
- "Do you spend a reasonable amount on monthly internet recharge?"
- "Will you become a doctor or an engineer after Class 12?"
(2) Violation: ambiguous "reasonable". Rewrite: "How much do you spend on internet recharge per month? (a) Less than ₹200 (b) ₹200–500 (c) ₹500–1000 (d) More than ₹1000."
(3) Violation: forced alternatives. Rewrite: "What career do you wish to pursue after Class 12?"
2.3.3 Closed-Ended vs Open-Ended Questions
Once a question is well-worded, the next design choice is the response format. There are two broad families.
Closed-ended questions are preferred when the researcher already knows the universe of likely answers. Open-ended questions are preferred when the goal is to discover new viewpoints — for example, "What is your view about globalisation?"
2.4 The Three Modes of Data Collection
Once the questionnaire is ready, how does it actually reach a respondent? NCERT identifies three classic modes.
2.4.1 Personal Interviews
The investigator meets the respondent face-to-face. This works when the entire population (or sample) is reachable in person. The interviewer can explain difficult words, expand on important answers, and read the respondent's body language for extra clues.
✅ Advantages
- Highest response rate
- All types of questions can be used
- Best for open-ended questions
- Ambiguous wording can be clarified on the spot
❌ Disadvantages
- Most expensive — needs trained interviewers
- Possibility of the interviewer influencing the respondent
- Takes the longest time
2.4.2 Mailing (Mailed) Questionnaires
The questionnaire is posted to each respondent with a request to fill it in and return it by a deadline. Today this also includes online forms and SMS surveys.
✅ Advantages
- Least expensive of the three
- Reaches respondents in remote areas
- No interviewer to influence answers — preserves anonymity, best for sensitive topics
- Respondent has time to think before replying
❌ Disadvantages
- Cannot be used by illiterate respondents
- Long response time
- No one to clarify ambiguous questions
- Reactions of the respondent cannot be observed
- Low response rates — forms get lost, returned blank, or never returned
2.4.3 Telephone Interviews
The investigator asks the questions over a phone call. This is faster and cheaper than personal interview, allows clarification, and works well when respondents are shy about face-to-face questioning on a sensitive topic. The catch: many people may not own a telephone.
✅ Advantages
- Relatively low cost
- Comparatively less interviewer influence
- Reasonably high response rate
- Quicker than personal interview
- Better for questions respondents are reluctant to answer in person
❌ Disadvantages
- Limited use — requires the respondent to own a telephone
- Reactions cannot be watched
- Some interviewer influence still possible
2.4.4 Comparison Table — Mode at a Glance
| Feature | Personal Interview | Mailed Questionnaire | Telephone Interview |
|---|---|---|---|
| Cost | Highest | Lowest | Moderate |
| Time per interview | Long | Long (waiting for return) | Short |
| Response rate | Highest | Lowest | High |
| Reaches remote areas | Difficult | Easy (with postal/online access) | Only if telephone available |
| Suitable for illiterate respondents | Yes | No | Yes |
| Risk of interviewer bias | High | None | Moderate |
| Best use of open-ended questions | Excellent | Limited | Limited |
| Sensitive / personal questions | Difficult | Best (anonymity) | Reasonable |
2.5 Pilot Survey — Test Before You Trust
Before unleashing the final questionnaire on hundreds of respondents, a researcher runs a small trial called a Pilot Survey? (also known as pre-testing). A pilot is a miniature version of the planned survey on a small group, used to expose flaws before scaling up.
A well-run pilot answers four practical questions:
- Are the questions intelligible? Do respondents misread, skip or struggle with any item?
- Are the instructions clear? Can a respondent follow the form without help?
- Are the enumerators well-trained? Are they recording answers consistently?
- What will the full survey cost in money and time? The pilot gives the first realistic estimate.
Imagine you have to find out which snack the school canteen should add next term. You have written a 10-question questionnaire for 600 students. Design a 30-student pilot survey: who do you pick, what do you watch for, and what changes would you make to the questionnaire after the pilot?
- You have to collect information from a person who lives in a remote village in India. Which mode of data collection will you use, and why?
- You have to interview the parents about the quality of teaching in a school. If the school principal is present in the room, what kinds of problems can arise?
(2) The principal's presence creates an authority bias — parents may hide complaints out of fear that their child will be penalised, or they may praise the school excessively. The interviewer cannot guarantee anonymity. Ideally the parents should be interviewed away from the school in a neutral setting.
2.6 Bringing It Together — A Quick Worked Application
📊 Case-Based Question — The Filmstar Popularity Survey
Choose: (A) Both A and R are true and R is the correct explanation of A. (B) Both A and R are true but R is not the correct explanation of A. (C) A is true, R is false. (D) A is false, R is true.
Frequently Asked Questions — Sources and Methods of Data Collection
What is the difference between primary and secondary data in NCERT Class 11 Statistics?
Primary data is information collected first-hand by the investigator for a specific study, while secondary data is information already collected and published by another agency that is re-used for a new investigation. NCERT Class 11 Statistics Chapter 2 explains that primary data is more reliable and tailored to the research question but is expensive and time-consuming to collect, whereas secondary data from sources like NSSO, RBI, Census of India and the Economic Survey is faster and cheaper but may be outdated, in different definitions, or not perfectly suited to the new question. The investigator must judge fitness for purpose before using any secondary source.
What are the three main methods of data collection in Class 11 Statistics?
The three main methods of primary data collection in NCERT Class 11 Statistics Chapter 2 are personal interview, mailed questionnaire and telephone interview. A personal interview gives high response rates and clarification but is costly and slow; a mailed questionnaire reaches a wide geographic area cheaply but suffers from low response and unclear answers; a telephone interview is fast and inexpensive but excludes people without phones and limits how complex the questions can be. The investigator chooses the method based on budget, speed, coverage required and the literacy level of respondents.
How do you design a good questionnaire in NCERT Class 11 Statistics?
A good questionnaire in NCERT Class 11 Statistics Chapter 2 is short, simple and uses easy-to-understand language with no leading or ambiguous questions. Questions should follow a logical sequence, move from easy to difficult, and avoid personal or sensitive items unless essential. Use multiple-choice or yes/no formats where possible, define units clearly, and avoid double-barrelled questions that ask two things at once. Always pilot the questionnaire on a small sample first, refine the wording based on confusion or non-response, and then deploy the final version for the main survey.
What is a pilot survey and why is it important in data collection?
A pilot survey is a small trial run of the questionnaire on a few respondents before the main study is conducted, and it is important because it reveals problems that would otherwise spoil the entire investigation. NCERT Class 11 Statistics Chapter 2 explains that a pilot survey checks whether questions are understood correctly, whether the time taken is reasonable, whether response options are exhaustive, and whether sensitive items cause non-response. Issues uncovered in the pilot are fixed before the full survey, saving cost and improving the quality of primary data substantially.
Which agencies publish secondary data in India for Class 11 Statistics?
Major agencies that publish secondary data in India, as listed in NCERT Class 11 Statistics Chapter 2, include the National Statistical Office (NSO) and its successor body MoSPI, the Reserve Bank of India (RBI), the Census of India (decennial population census), the Central Statistics Office (CSO), the Labour Bureau, the Indian Council of Agricultural Research (ICAR) and the National Sample Survey Office (NSSO). International sources include the World Bank, IMF, WTO and UN agencies. These agencies publish official data on prices, employment, output, population and prices that researchers and policymakers re-use widely.
What is the difference between a leading question and a neutral question in survey design?
A leading question is worded in a way that pushes the respondent toward a particular answer, for example 'Don't you agree that prices are too high?', while a neutral question lets the respondent answer without bias, such as 'How do you rate the current price level — high, average or low?'. NCERT Class 11 Statistics Chapter 2 warns that leading questions destroy data quality because the answers reflect the questioner's view rather than the respondent's true opinion. Survey designers must use balanced wording, avoid loaded terms, and offer all reasonable response options to keep results objective.