Scientists have developed a computer tool that can spot if somebody has filed a fake police statement based purely on the text included in the document.

Using a combination of automatic text analysis and advanced machine learning techniques, the tool has been able to successfully identify false robbery reports with over 80 per cent accuracy.

The tool has now been rolled out across all of Spain to help support police officers and indicate where further investigations are necessary.

Known as VeriPol, the tool is specific to reports of robbery and can recognise patterns that are more common with false claims, such as the types of items reported stolen, finer details of incidents and descriptions of a perpetrator.

The research team, which included computer science experts from Cardiff University and Charles III University of Madrid, believe the tool could save the police both time and effort by complementing traditional investigative techniques, whilst also deterring people from filing fake statements in the first place.

The results of the study have been published in the academic journal Knowledge-Based Systems.

In many parts of the world the filing of false police statement is a crime that carries serious consequences, such as jail terms and heavy fines.

False statements not only contaminate police databases and damage the outcomes of criminal investigations, but also waste significant amounts of public resources that could be dedicated to pursuing other crimes.

Yet false reports are very common, especially in the reporting of low level crimes such as robbery.

VeriPol is partly based on a process known as natural language processing – a branch of artificial intelligence that helps computers understand, interpret and manipulate human language. For example, the computer tool uses algorithms to identify and quantify various features in text, such as adjectives, acronyms, verbs, nouns, punctuation marks and numbers and figures.

Historical police reports that were known to be false have been fed through VeriPol so that it could code each one and begin to ‘learn’ the specific patterns.

An initial study of more than 1000 police reports from the Spanish National Police showed that VeriPol was ‘extremely effective in discriminating between false and true reports’, with a success rate of more than 80 per cent.

VeriPol identified a number of themes that were common amongst false robbery reports, including: shorter statements that were more focussed on the stolen property than the incident; a lack of precise detail about the incident itself; limited details of the attacker; and a lack of witnesses or other hard evidence, such as contacting a police officer or doctor straight after the incident.

“As an example, our model began to identify false statements where it was reported that incidents happened from behind or where the aggressors were wearing helmets,” co-author of the study Dr Jose Camacho-Collados, from Cardiff University’s School of Computer Science and Informatics, said.