« Michael Barone: No Matter What the Media-Fashioned "National Dialogue" on Guns, America's Culture Has Shifted Dramatically in Favor of Gun Rights |
Main
|
Rand Paul: Don't Rub Putin's Nose in Ukraine Failure »
February 26, 2014
120 Scientific Papers Withdrawn After Being Proven to be Gibberish.
No, Actual Computer-Generated Gibberish.
Multiple layers of painstaking fact-checking editorial oversight.
So, some scientists at MIT had invented a program called "SCIgen" to generate, by computer, random scientific-sounding papers. They did this for amusement.
But people (especially in China, apparently) have been using the program to generate papers and then submit them to actual scientific publishers' subscription services.
“The papers are quite easy to spot,” says Labbé, who has built a website where users can test whether papers have been created using SCIgen. His detection technique, described in a study published in Scientometrics in 2012, involves searching for characteristic vocabulary generated by SCIgen. Shortly before that paper was published, Labbé informed the IEEE of 85 fake papers he had found. Monika Stickel, director of corporate communications at IEEE, says that the publisher “took immediate action to remove the papers” and “refined our processes to prevent papers not meeting our standards from being published in the future”. In December 2013, Labbé informed the IEEE of another batch of apparent SCIgen articles he had found. Last week, those were also taken down, but the web pages for the removed articles give no explanation for their absence.
Ruth Francis, UK head of communications at Springer, says that the company has contacted editors, and is trying to contact authors, about the issues surrounding the articles that are coming down. The relevant conference proceedings were peer reviewed, she confirms — making it more mystifying that the papers were accepted.
It's possible the reviewers chalked up the computerese nonsense to a language barrier, figuring the "scientist" who wrote them spoke Chinese as a first language and was struggling with the English language. But this only goes so far, because, ultimately, these papers didn't make sense in any language. Because they were gibbrerish.
Labbé (the guy who built the tool for finding these fakes) wanted to prove how easy it was to spoof the system so he created a fake scientist named "Antkare."
Labbé is no stranger to fake studies. In April 2010, he used SCIgen to generate 102 fake papers by a fictional author called Ike Antkare. Labbé showed how easy it was to add these fake papers to the Google Scholar database, boosting Ike Antkare’s h-index, a measure of published output, to 94 — at the time, making Antkare the world's 21st most highly cited scientist.
Why? Why would 120 fake, gibberish, nonsense papers be submitted to these publishers? And how did they make it onto the system?
Well possibly this is a prank, or an attempt to prove how easy it is to get nonsense published, as Labbé already proved.
Or, possibly:
Apparently, in science, one gross method of ranking your authority is by counting up the number of times you're cited in other scientific papers.
So, what if you could just spam a lot of fictitious, gibberish papers and get them into "the system" (the subscription services) citing you a whole bunch of times? Then your crude bean-counting ranking goes up.