Gibberish-Loaded Fake Papers Are Flooding Academia And A New Cat And Mouse Detection Game Is Afoot

from the gyre-and-gimble-in-the-wabe dept

"As is clear from the title of this paper, this paper deals with the entertainment industry. So, we do provide entertainment in this paper. So, if you are reading this paper for entertainment, we suggest a heuristic that will allow you to read this paper efficiently. You should read any paragraph that starts with the first 4 words in bold and italics – those have been written by the author in painstaking detail. However, if a paragraph does not start with bold and italics, feel free to skip it because it is gibberish auto-generated by the good folks at SCIGen ."

"Over the past two years, computer scientist Cyril Labbe of Joseph Fourier University in Grenoble, France, has catalogued computer-generated papers that made it into more than 30 published conference proceedings between 2008 and 2013. Sixteen appeared in publications by Springer, which is headquartered in Heidelberg, Germany, and more than 100 were published by the Institute of Electrical and Electronic Engineers (IEEE), based in New York. Both publishers, which were privately informed by Labbe, say that they are now removing the papers."

"Labbe is no stranger to fake studies. In April 2010, he used SCIgen to generate 102 fake papers by a fictional author called Ike Antkare [see pdf]. Labbe showed how easy it was to add these fake papers to the Google Scholar database, boosting Ike Antkare's h-index, a measure of published output, to 94 — at the time, making Antkare the world's 21st most highly cited scientist. Last year, researchers at the University of Granada, Spain, added to Labbe's work, boosting their own citation scores in Google Scholar by uploading six fake papers with long lists to their own previous work."

Thank you for reading this Techdirt post. With so many things competing for everyone’s attention these days, we really appreciate you giving us your time. We work hard every day to put quality content out there for our community. Techdirt is one of the few remaining truly independent media outlets. We do not have a giant corporation behind us, and we rely heavily on our community to support us, in an age when advertisers are increasingly uninterested in sponsoring small, independent sites — especially a site like ours that is unwilling to pull punches in its reporting and analysis. While other websites have resorted to paywalls, registration requirements, and increasingly annoying/intrusive advertising, we have always kept Techdirt open and available to anyone. But in order to continue doing so, we need your support. We offer a variety of ways for our readers to support us, from direct donations to special subscriptions and cool merchandise — and every little bit helps. Thank you.

–The Techdirt Team

At the beginning of the year, Mike pointed out the strange and funny tale of Navin Kabra, an entrepreneur in India . Kabra started wondering if the requirement frequently placed on students in India to have two papers "published" at various conferences was little more than a huge scam, designed largely to get students to pay the fees for the submissions and the conferences. Despite claims that these works are "reviewed by panelists from a panel of international experts using a double-blind review methodology," Kabra didn't think they were actually even being read.To go about proving his theory, he started using the science gibberish-generating SCIgen app to submit papers to conferences. It's worth noting he didn't evento make the papers sound coherent or logical, burying entire paragraphs referencing things likeor using dialogue from movies like. In numerous spots within the papers he clearly admits that a nonsense generator is writing the text:His two bogus papers were accepted (one he paid to have published), and Kabra hoped at the time that his experiences would build awareness of the issue. Apparently that hasn't been the case. As it turns out, the practice isn't just occurring in India -- it's happening everywhere, and has been seemingly spreading for some time. This week it was revealed that just two publishers, Springer and the Institute of Electrical and Electronic Engineers (IEEE), had published more than 120 bogus papers packed with nonsense:Labbe has one-upped Kabra's exposure attempts by also publishing fake research of his own; research that helped him fairly easily boost his reknown in the Google Scholar database:He's taken things one step further, creating a web-based program to help publishers scan for SCIgen gibberish, the technical specifics of which he has published with Springer . With this story now starting to see broader traction, it's probably safe to assume publishers are quietly pretty busy reviewing the archives to determine just how embarrassed they should all be. We've essentially just witnessed the evolution of a new generation of cat and mouse bullshit creation and detection, something somebody should clearly write a preferably-factual and coherent research paper on.

Filed Under: academia, navin kabra, scigen