{"id":23366,"date":"2025-09-24T03:28:40","date_gmt":"2025-09-24T03:28:40","guid":{"rendered":"https:\/\/naijaglobalnews.org\/?p=23366"},"modified":"2025-09-24T03:28:40","modified_gmt":"2025-09-24T03:28:40","slug":"journals-infiltrated-with-copycat-papers-that-can-be-written-by-ai","status":"publish","type":"post","link":"https:\/\/naijaglobalnews.org\/?p=23366","title":{"rendered":"Journals infiltrated with \u2018copycat\u2019 papers that can be written by AI"},"content":{"rendered":"<p>\n<\/p>\n<p class=\"figure__caption u-sans-serif\"><span class=\"mr10\">Open data sets and AI tools can be used to mass-produce low-quality, redundant papers.<\/span><span>Credit: Tutatama\/Alamy<\/span><\/p>\n<p>An analysis of a literature database finds that text-generating artificial intelligence (AI) tools \u2014 including ChatGPT and Gemini \u2014 can be used to rewrite scientific papers and produce \u2018copycat\u2019 versions that are then passed off as new research.<\/p>\n<p>In a preprint posted on medRxiv on 12 September1, researchers identified more than 400 such papers published in 112 journals over the past 4.5 years, and demonstrated that AI-generated biomedicine studies could evade publishers\u2019 anti-plagiarism checks.<\/p>\n<p><p class=\"recommended__title u-serif\">Low-quality papers based on public health data are flooding the scientific literature<\/p>\n<\/p>\n<p>The study\u2019s authors warn that individuals and paper mills \u2014 companies that produce fake papers to order and sell authorships \u2014 might be exploiting publicly available health data sets and using large language models (LLMs) to mass-produce low-quality papers that lack scientific value.<\/p>\n<p>\u201cIf left unaddressed, this AI-based approach can be applied to all sorts of open-access databases, generating far more papers than anyone can imagine,\u201d says Csaba Szab\u00f3, a pharmacologist at the University of Fribourg in Switzerland, who was not involved in the work. \u201cThis could open up Pandora\u2019s box [and] the literature may be flooded with synthetic papers.\u201d<\/p>\n<h2>Redundant research<\/h2>\n<p>To investigate, researchers screened association studies \u2014 those that statistically link a variable to a health outcome \u2014 that were based on data from the US National Health and Nutrition Examination Survey (NHANES), a huge open repository of data on the health, diet and lifestyles of thousands of people.<\/p>\n<p>They focused their search on studies they defined as \u2018redundant\u2019, meaning that the work tested the association between the same variable and health outcome as other research did, but analysed a subtly different subset of the actual data \u2014 including results from different survey years, for example, or participants of a different age or sex.<\/p>\n<p><p class=\"recommended__title u-serif\">AI linked to explosion of low-quality biomedical research papers<\/p>\n<\/p>\n<p>Their search of the PubMed index of biomedical literature revealed 411 redundant studies published between January 2021 and July 2025. Most were simple \u2018repeat\u2019 cases, involving two publications that were almost identical. But three associations had a particularly high number of duplicate studies \u2014 six papers apiece \u2014 some of which were published in the same year.<\/p>\n<p>This \u201cshouldn\u2019t be happening, and it doesn\u2019t help the health of the scientific literature\u201d, says co-author Matt Spick, a biomedical scientist at the University of Surrey in Guildford, UK.<\/p>\n<p>Most publishers have checks in place to prevent researchers submitting the same research to multiple journals, but Spick and his colleagues suspect that AI tools are being used to evade these.<\/p>\n<h2>Avoiding detection<\/h2>\n<p>To test whether AI can help to produce multiple papers from the same data set, the researchers used OpenAI\u2019s chatbot ChatGPT and Google\u2019s Gemini to rewrite three of the most heavily redundant articles identified by their analysis (each reported a particular association that had already been published five or six times). The researchers prompted the LLMs to use the information in each paper, and NHANES data, to produce a new manuscript that could avoid plagiarism detectors.<\/p>\n<p>\u201cWe were shocked that it worked straight away,\u201d says Spick. \u201cThey weren\u2019t perfect, and the LLMs did create some errors. It took two hours of cleaning-up work for each manuscript.\u201d<\/p>\n<p><p class=\"recommended__title u-serif\">Rise of ChatGPT and other tools raises major questions for research<\/p>\n<\/p>\n<p>When analysed with a plagiarism-detection tool used by many publishers, the AI-generated manuscripts did not score at a level that would be considered problematic by editors. This shows that LLMs \u201ccan produce something that is derivative of everything that\u2019s gone before and doesn\u2019t include anything new. But it\u2019ll still pass plagiarism checks\u201d, says Spick. This, in turn, makes it more difficult to distinguish between researchers who are producing genuine research using public-health data sets such as NHANES, and others who are deliberately creating redundant papers using LLMs, the authors note.<\/p>\n<p>\u201cThese are completely new challenges for the editors and the publishers,\u201d says Igor Rudan at the University of Edinburgh, UK, who studies global public health and is joint editor-in-chief of the Journal of Global Health, which introduced new guidelines for researchers submitting research on open-access data sets in July. \u201cWhen we first tried the LLMs, we immediately realized that this would become an issue, and this preprint confirms it,\u201d he adds.<\/p>\n<h2>Serious challenge<\/h2>\n<p>In July, Spick and his colleagues reported2 that there had been a surge in low-quality, formulaic publications using NHANES and other publicly available health data sets that they suspected had been fuelled by AI. The current analysis found a sharp rise in redundant NHANES studies after 2022 \u2014 the year ChatGPT was publicly released.<\/p>\n<p>Some publishers, including Frontiers, based in Lausanne, Switzerland, and the open-access Public Library of Science (PLOS) in San Francisco, California, have tried to address this by tightening up their editorial rules for accepting studies that are based on open-access health data sets such as NHANES.<\/p>\n<p>\u201cAI-driven redundancy, in general, poses a serious and ongoing challenge to publishers,\u201d says Elena Vicario, Frontiers\u2019 head of research integrity. Frontiers published 32% of the 411 redundant papers identified in the preprint, with 132 articles appearing across 11 of its journals. Vicario says that these papers pre-date policies that were introduced by the publisher earlier this year, and would not be published if they were submitted today. Frontiers journals have rejected 1,382 NHANES-based submissions since the introduction of their policy in May.<\/p>\n<p><p class=\"recommended__title u-serif\">The fight against fake-paper factories that churn out sham science<\/p>\n<\/p>\n<p>Springer Nature journals published 37% of the papers flagged by the preprint, including 51 articles that appeared in the journal Scientific Reports (Nature\u2019s news team is editorially independent of its publisher). \u201cWe take our responsibility towards maintaining the validity of the scientific record very seriously and all papers referenced by this preprint will be investigated and action taken where appropriate,\u201d says Richard White, editorial director for Scientific Reports, which, he says, has rejected more than 4,500 NHANES-based submissions since the start of 2024.<\/p>\n<p>\u201cThere are differences in opinion regarding the value of some analyses utilizing NHANES and similar data sets, and we are committed to both supporting the whole community and ensuring what we publish adds value,\u201d White adds. \u201cOur focus is on getting the right checks in place to remove unethically produced or meaningless research, whilst still publishing papers using such data sets that represent valid and valuable additions to the scientific literature.\u201d<\/p>\n<p>\u201cWe are alive to concerns around inappropriate use of these data sets and have been taking sustained action.\u201d<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Open data sets and AI tools can be used to mass-produce low-quality, redundant papers.Credit: Tutatama\/Alamy An analysis of a literature database finds that text-generating artificial intelligence (AI) tools \u2014 including ChatGPT and Gemini \u2014 can be used to rewrite scientific papers and produce \u2018copycat\u2019 versions that are then passed off as new research. In a<\/p>\n","protected":false},"author":1,"featured_media":23367,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[58],"tags":[9230,14254,14253,4107,9192],"class_list":{"0":"post-23366","1":"post","2":"type-post","3":"status-publish","4":"format-standard","5":"has-post-thumbnail","7":"category-science","8":"tag-copycat","9":"tag-infiltrated","10":"tag-journals","11":"tag-papers","12":"tag-written"},"_links":{"self":[{"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=\/wp\/v2\/posts\/23366","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=23366"}],"version-history":[{"count":0,"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=\/wp\/v2\/posts\/23366\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=\/wp\/v2\/media\/23367"}],"wp:attachment":[{"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=23366"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=23366"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/naijaglobalnews.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=23366"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}