Statistics is the representation of a large amount of data in graphs or charts, which helps derive the necessary inference from the same. Because it deals with a lot of data, it becomes necessary to know the prerequisites of collecting that specific data too.
To develop a clear understanding of the same, here we have taken two examples of how data is collected and used.
Primary Data
Suppose you are asked to analyse the amount of electricity consumed in your colony, consisting of 20 different households. So, you go to each house and gather the data yourself and note it down. This is called primary data collection. Herein, you are doing the first-hand job of gathering data from the resources.
Secondary Data
You are asked to make a case study of how the education sector has improved in the last 5 years in India. So, herein you do not go out to every school in India and ask for data. Instead, you look into the government website for the census data for all the necessary parameters.
Therefore, this is the secondary data collection where you do not directly collect data from the people involved in the act. Rather, you use the already collected data to derive other conclusions as asked in your work.
Both these examples of primary and secondary data are necessary to gain insight into how they are collected and how they can be used later.
How do Primary and Secondary Data differ?
The difference between primary data and secondary data is discussed in detail below:
Basis of Difference |
Primary Data |
Secondary Data |
Definition |
The primary data definition says it to be the data that is collected for the first time by the user himself |
The secondary data definition says it to be the type of data that is previously collected by others and later used by another. |
Method of collection |
These are collected via physical testing, observation, surveys, questionnaires, focus groups, telephonic interviews, photographs, artifacts, case studies, videos, diary entries, eyewitnesses, personal interviews, etc. |
They are collected from published data by the state or central government, articles by local bodies, census data, magazines, periodicals, journals, publications by CSO, etc. |
Purpose of collection |
They are mainly collected for a specific purpose and are involved in direct usage without any manipulation. |
They may be collected for multiple purposes as required by the user to derive various kinds of inferences from them after necessary manipulation. |
Authenticity |
They are collected by the user directly, so they are original and devoid of any kind of alteration. |
The data are collected by others for their usage, so it is not original. |
Type of data |
The collected primary data is in the form of raw materials, which need to be represented in proper ways to derive the necessary conclusion. |
These data are already collected and used for a specific purpose. So, they are obtained in a polished form. |
Dependability |
They are first-hand data, so these are more reliable, as they are collected with a specific intention. |
They are collected by others which makes them less reliable as they are collected for other purposes and may not be in line with the user’s requirement. |
Collection time |
Since they are collected for the first time directly by the user, it requires a lot of time. |
As the secondary data is already collected and can be used later, it does not demand time for collection when used later. |
Expenditure |
The first-time data collection is also quite expensive because the user has to conduct the surveys or questionnaires all by himself. |
This type of data is already present in a presentable form in the official websites or magazines, which can be used by the user to suffice his purpose. Hence, collecting them does not require extra expenses. |
Editing |
The primary data is generally gathered for a definite purpose, so they do not require alteration and manipulation to represent them for the project. |
The secondary data was collected for a different purpose, so when they are to be used in another project or purpose, they need manipulation or editing. |
Having a proper understanding of both primary and secondary data is vital for Commerce students. They need to understand that statistics has a vast range of applications in multiple arenas. It is essentially a tool to represent data in a simplified manner.
Various primary data examples reveal that they are most useful in showcasing any data with higher accuracy. As these data are in line with the purpose of collection, one can rest assured to use them in projecting future sales, analysing projects related to capital expenditure, and a lot more.
Similarly, the secondary data examples show that the financial reports, sales data, retailer or distributor, customer statements, etc are used to reach various conclusions. These inferences or conclusions are then projected to meet the purpose of collecting them.
Therefore, having clarity on both will help in differentiating the need for manipulation when you work with a certain type of data.
To know more about the types of primary data and secondary data along with their uses and other necessary details, you can visit our online learning programmes. It has a set of study materials written in an absolute lucid language for a clear and holistic understanding of the syllabus. There are online classrooms for your benefit.
Pros and Cons of Primary Data
For’s
-
At the time of data collection, primary data is adapted to the researcher’s needs. The researcher has control over the type of data that is gathered.
-
It is more accurate than secondary data. Because the data is not influenced by human judgment, its authenticity may be believed.
-
Because primary data is obtained in real-time rather than from old sources, it is frequently up to current.
-
The researcher has complete control over the data gathered during primary research. He has the authority to select the design, method, and data analysis techniques to be employed.
-
The researcher claims ownership of the data gathered during primary research. He or she may decide to make it public, patent it, or even sell it.
Against
-
Primary data is far more expensive than secondary data. As a result, acquiring primary data may be difficult.
-
Due to the complexity and time commitment necessary, collecting primary data may not be possible in some circumstances.
-
It takes a long time.
Pros and Cons of Secondary Data
For’s
-
Secondary data, in comparison to primary data, is more widely available. Secondary data is available on many platforms and may be used by the researcher.
-
The cost of secondary data is quite low. Because they are often given out for free, they may be gotten for little to no cost.
-
Secondary data collection often takes far less time than primary data collection.
-
Secondary data enables longitudinal studies to be undertaken without having to wait a longer amount of time for conclusions.
Against
-
Secondary data may not be authentic or trustworthy. A researcher may need to double-check the information gathered from various sources.
-
Before eventually identifying the essential data, researchers may have to deal with irrelevant data.
-
Secondary data sources are occasionally out of date, with no fresh information to replace the old.
-
Because of the data source’s personal bias, some of the data is overstated.