The DFRLab analyzed the dataset to determine its veracity, as well as the accuracy of the reporting surrounding it. The data was scanned for malware and run on a virtual machine, as content shared on Raidforums sometimes contains malicious code. For instance, one of the leaks by WikiLeaks on Turkey’s ruling Justice and Development Party found that among the files there were more than 3,000 files that were infected with malicious code.

The file being an .mdb database required an mdb viewer, since mdb is an legacy format that was used prior to the release of database management software Microsoft Access 2007. The file format was a giveaway that the database seems to have been developed using a technology that would be fairly old for a circa-2020 election database.

One of the key findings can be noticed on the file itself: it was created in August 2011. Under the Breach also confirmed that the database appears to have been leaked around 2011 but had not observed its surfacing prior to 2020.

Image of the file with creation date implying it was made in 2011. (Source: DFRLab)

The file name “reestri” implied that the data seems to be a registry database. The lead document contained Georgian citizens’ ID number, last name, first name, father’s name, date of birth, registration date, “DMONAC” (the DFRLab couldn’t confirm the meaning of this acronymn), sex, card number, address, and region.

The DFRLab verified the authenticity of the database by going to the Georgian voter registration verification site voters.cec.gov.ge and searching for random people listed in the data set via the site’s search engine.

The data set includes people born in 1880. This supports the idea that it includes information about family descent, given that those born in 1880 would now be 131 years old.

Image of the leaked database showing that the document includes the information about people born in 1880. (EtoBuziashvili/DFRLab)

Another finding was that the data set includes the personal information of underaged Georgians. The data finishes with the people born in 2011, who would still be young to vote in a 2020 election.

Image of the leaked database showing that the document includes the information about people born in 2011. (EtoBuziashvili/DFRLab)

After the leaked data surfaced, the Central Election Committee of Georgia (CEC) stated that the database uploaded on Raidforums doesn’t match their official database because there are only 3.5 million Georgian citizens in the CEC election database and it does not include data regarding family descent.

While the leaked dataset dates to 2011, it could be used for various nefarious purposes, including privacy breaches, identity theft, and election-related intimidation. Even if the leaked data in itself is useless for influencing elections, the fact that it was circulated in the first place may raise serious perception problems and impact Georgians’ trust in democratic processes.