The study has been widely quoted by the Union government to battle criticism of jobless growth in the economy. Four days after the study by Ghosh and Ghosh was made public, Prime Minister Narendra Modi said in a televised interview that the data on 7 million jobs was a result of a study. While announcing the steps taken by the Centre to address the employment problem, Finance Minister Arun Jaitley said in his Union Budget 2018-19 speech earlier this month

that “an independent study conducted recently has shown that 7 million formal jobs will be created this year”.

Soumya Kanti Ghosh confirmed that both the researchers worked on their study from the office of the Aayog since the data was not publicly available. Sources said the EPFO placed the entire database on a file server and provided a URL (uniform resource locator) that could be accessed from anywhere by anyone who had the link to the server. The URL had since been removed, sources said.

The Aayog gave Ghosh and Ghosh access to around 60 gigabytes of EPFO database that included employees’ names, dates of birth, permanent account numbers, provident fund contributions, and industry names, for a period between January 2015 and November 2017.

On November 2 last year, NITI Aayog Vice-Chairman Rajiv Kumar wrote a letter to the EPFO asking it to provide data on new EPF subscribers from April to October 2017. In that communication, Kumar referenced the PMO meeting, held on October 29 last year, where the Aayog was told to “collect and analyse employment data across various sources”.

“The government think tank does not have the wherewithal to work on such massive data. So, it might have delegated it (the survey) to them,” said a senior government official who was part of the meetings that discussed the employment data.

Using Big Data and machine learning tools, Ghosh and Ghosh worked on the sample data first (for the April-October 2017 period) before gaining access to the bigger database to conduct their final survey. Pulak Ghosh is an expert on Big Data and was also part of the advisory group on Big Data at the United Nations.

The EPFO data centre provided the entire database of its subscribers to the Aayog on November 29 last year, for the period between January 2015 and October 2017.

A senior EPFO official said, “We gave the data to the Aayog, which is a government entity. We did not know there was an independent survey being undertaken based on our data.” The data was not meant to be shared with private citizens, he added.

EPFO Central Provident Fund Commissioner V P Joy did not respond to an email questionnaire. The Aayog did not revert to a query on whether it would allow other researchers to work on a similar study in their office.

Many economists and Opposition party leaders have criticised the Union government in the past for providing “privileged access” to Ghosh and Ghosh for their survey and have demanded similar access to the database for their own studies.

Ghosh and Ghosh took more than a month to work on the massive EPFO subscriber data before making a final presentation to the PMO on the survey findings on January 12, days prior to making their report public. It was published as an “academic study” conducted by IIM Bangalore and SBI.

Part II: What Ghosh & Ghosh did not make public about their survey report