1. Sources of Collection of Data
- Primary Data:
Data collected by the investigator himself for the first time for a specific purpose.
Example: Data collected through surveys, interviews, observations. - Secondary Data:
Data already collected by someone else and used by the investigator for reference.
Example: RBI publications, Census reports, NSSO data.
2. Nature of Data
- Primary Data: First-hand, original data collected for the current investigation.
Example: Conducting a survey of students to know their study hours. - Secondary Data: Second-hand data, already collected for some other purpose.
Example: Using government census data for population study.
3. Difference Between Primary and Secondary Data
Basis | Primary Data | Secondary Data |
---|---|---|
Originality | Original, collected first-hand. | Already collected by others. |
Objective | Specific to current research purpose. | Collected earlier for different purposes. |
Cost | Expensive and time-consuming. | Cheaper and saves time. |
4. Statistical Methods / Modes of Collection of Data
(i) Personal Interviews
(a) Direct Personal Investigation
- Definition: Investigator personally collects information from the informants.
- Suitability: Best for small-scale, local, and intensive studies.
- Advantages:
- Original and accurate data.
- Highly reliable.
- Clarifications possible.
- Investigator observes personally.
- Control over data collection.
- Disadvantages:
- Very costly.
- Time-consuming.
- Difficult in large areas.
- Needs trained investigators.
- Subject to personal bias.
(b) Indirect Oral Investigation
- Definition: Data collected from third parties or witnesses instead of directly from informants.
- Suitability: Best for sensitive issues or when direct contact is not possible.
- Advantages:
- Cheaper than direct method.
- Less time required.
- Wide coverage possible.
- Useful in confidential matters.
- Easy to conduct.
- Disadvantages:
- Less reliable.
- Depends on others’ statements.
- May lack accuracy.
- Difficult to verify truth.
- Biased answers possible.
(c) Information from Local Sources / Correspondents
- Definition: Data collected regularly from local agents or correspondents appointed in different areas.
- Suitability: Best for continuous information like newspapers, trade journals, weather reports.
- Advantages:
- Continuous flow of data.
- Wide area coverage.
- Quick availability.
- Low cost.
- Useful for journals and agencies.
- Disadvantages:
- May lack accuracy.
- Reports may be biased.
- Delay possible.
- Investigator has little control.
- Reliability doubtful.
(ii) Information Through Questionnaires or Schedules
(a) Mailing Surveys (Questionnaires)
- Definition: List of questions sent by post to respondents for replies.
- Suitability: Best when respondents are educated and spread over a wide area.
- Advantages:
- Covers large area.
- Low cost.
- Respondents answer at convenience.
- No interviewer bias.
- Suitable for large-scale surveys.
- Disadvantages:
- Low response rate.
- Only literates can respond.
- Delay in response.
- Clarifications not possible.
- Incomplete or careless answers.
(b) Enumeration Method (Schedules)
- Definition: Enumerators personally visit respondents and fill the schedule.
- Suitability: Best when respondents are illiterate or when detailed information is required.
- Advantages:
- Reliable and detailed data.
- Clarifications possible.
- High response rate.
- More accuracy.
- Suitable for all sections of society.
- Disadvantages:
- Very costly.
- Time-consuming.
- Requires trained staff.
- Personal bias possible.
- Difficult for large geographical areas.
(iii) Telephonic Interviews
- Definition: Information collected by asking questions over the telephone.
- Suitability: Best for quick information where personal contact is difficult.
- Advantages:
- Quick method.
- Saves time and money.
- Wide coverage.
- Respondents feel comfortable.
- Easy for preliminary surveys.
- Disadvantages:
- Limited number of questions possible.
- Only people with telephones can be contacted.
- Respondents may refuse to cooperate.
- Less reliable than personal interviews.
- No personal observation possible.
5. Collection of Secondary Data
- Published Sources: Data already collected and made public by different organizations.
Examples:- Census Reports – Population and demographic details.
- Economic Surveys – Annual government survey.
- RBI Publications – Banking and finance data.
- UN Reports – International statistics.
- Trade Journals & Magazines – Industry and market data.
- Unpublished Sources: Data not published but available in records.
Examples:- Company Records – Sales, production, accounts.
- Research Theses – University projects.
- Government Departments’ Internal Files – Confidential data.
- Private Institutions’ Surveys – NGO studies.
- Committee Reports – Investigations by commissions.
6. Precautions in the Use of Secondary Data
- Check reliability of the source.
- Data must be suitable for the study purpose.
- Ensure definitions and methods are clearly understood.
- Units of measurement must match requirements.
- Data should be recent and up to date.
7. Two Major Sources of Secondary Data
- Census of India
- Conducted every 10 years by the Government of India.
- Provides reliable data on population, literacy, housing, employment, migration, etc.
- NSSO (National Sample Survey Organisation)
- Conducts sample surveys on socio-economic aspects like consumption, health, education, employment.
- Supplements census data with frequent and detailed studies.