- arXiv, a leading repository for preprint research, has started banning authors who submit AI-generated papers with fake references.
- AI ‘slop’ refers to low-quality content generated by AI systems, often introducing entirely fabricated data and references.
- arXiv is implementing automated detection and human audits to flag suspicious submissions and maintain academic integrity.
- The ‘slopification’ of scholarly publishing threatens to erode trust in peer review and scientific credibility.
- Over 12,000 preprints across major platforms contain at least one AI-hallucinated reference, according to a 2025 study.
In a striking move to preserve academic integrity, arXiv, the world’s most influential repository for preprint research in physics, mathematics, and computer science, has begun banning authors who submit papers containing AI-hallucinated references. These false citations—fabricated by large language models posing as legitimate scientific sources—have surged since 2023, with one 2025 study estimating that over 12,000 preprints across major platforms contained at least one hallucinated reference. In some cases, entire bibliographies were invented by AI tools with no grounding in real literature. arXiv’s decision marks the strongest institutional response yet to what scientists are calling the ‘slopification’ of scholarly publishing—a crisis threatening to erode trust in peer review and scientific credibility.
The Rise of AI ‘Slop’ in Academic Publishing
The term ‘AI slop’ has gained traction in academic circles to describe low-quality, often incoherent content generated automatically by artificial intelligence systems. Unlike traditional plagiarism, which involves copying existing work, AI slop introduces entirely fabricated data, methods, and references. arXiv, operated by Cornell University and hosting over 2.5 million preprints, is now implementing automated detection systems and human audits to flag suspicious submissions. Authors found to have knowingly or negligently included hallucinated references will face immediate bans from submitting future work. This policy shift reflects growing alarm among journal editors, peer reviewers, and funding agencies about the contamination of scientific discourse. As AI writing tools become more accessible, the line between human and machine authorship blurs, raising urgent questions about accountability, verification, and the future of scholarly communication.
How arXiv Is Enforcing the New Rules
arXiv’s enforcement mechanism combines algorithmic screening with expert oversight. Every new submission now undergoes an AI-assisted bibliographic scan that cross-references citations against a curated database of verified journals, conferences, and digital object identifiers (DOIs). When a citation fails to match—such as papers attributed to non-existent journals or authors with fabricated names—the system flags it for human review. In confirmed cases, arXiv notifies the author and permanently revokes their submission privileges. The platform has already revoked access for at least seven researchers since the policy took effect in April 2026. Notably, the ban applies regardless of whether the author claims ignorance of AI involvement, reinforcing arXiv’s stance that due diligence is non-negotiable. This zero-tolerance approach distinguishes arXiv from other platforms, such as ResearchGate or SSRN, which have so far relied on post-publication corrections rather than pre-emptive bans.
Why Hallucinated References Undermine Scientific Trust
The inclusion of false references isn’t merely a technical violation—it strikes at the foundation of scientific reproducibility. When a researcher cites a non-existent study, they create the illusion of consensus or prior validation, misleading reviewers and readers. For example, a 2024 incident involved a machine learning paper that cited a fictional Nature article claiming breakthrough accuracy in protein folding—a claim later debunked when no such study existed. Such deceptions distort the scientific record and can lead to wasted resources as other teams attempt to replicate phantom results. Experts warn that unchecked AI hallucinations could trigger a ‘credibility cascade,’ where flawed papers citing other flawed papers proliferate across the literature. According to a report by Nature, some AI models generate plausible-sounding but entirely false references up to 27% of the time when prompted to support technical claims.
The Debate Over Punitive vs. Educational Responses
While arXiv’s crackdown has drawn praise from integrity advocates, it has also sparked debate over whether bans are too harsh, especially for early-career researchers who may not fully understand AI limitations. Some scholars argue that instead of punishment, institutions should focus on education and transparency. “We need citation provenance standards, not just penalties,” said Dr. Elena Torres, a meta-science researcher at the University of Edinburgh. “Requiring authors to disclose AI use and verify each reference through trusted tools would be more constructive.” Others counter that the scale of the problem demands decisive action. “If we tolerate even a small number of hallucinated papers, we risk systemic contamination,” said Dr. Rajiv Malhotra, a computational biologist at MIT. The controversy mirrors broader tensions in academia over how to regulate AI without stifling innovation or disproportionately penalizing less-experienced scholars.
Expert Perspectives
Opinions among scientists are deeply divided. Proponents of arXiv’s policy, like Stanford AI ethicist Dr. Naomi Chen, argue that “academic publishing cannot function if the reference layer is compromised.” She compares hallucinated citations to falsified data: “It’s not a typo—it’s fraud.” On the other side, digital humanities scholar Dr. Marcus Liu warns that overly rigid enforcement could exclude researchers from under-resourced institutions who rely on AI for language polishing. “We’re creating a two-tier system where only those with access to expert editing teams can publish safely,” he said in an interview with ScienceDaily. The debate underscores the need for global standards in AI-assisted research, potentially led by bodies like the Committee on Publication Ethics (COPE) or UNESCO’s AI ethics framework.
Looking ahead, the scientific community faces a critical juncture. Will arXiv’s hardline approach become a model for other repositories, or will it spur resistance and fragmentation in publishing norms? As AI tools evolve, so must verification systems—potentially including blockchain-verified citations or mandatory AI-detection audits. One thing is clear: the integrity of science depends not just on what is published, but on whether every claim can be traced back to a real, verifiable source. The battle against AI slop is no longer theoretical—it’s underway, and the stakes couldn’t be higher.
Source: Nature




