AI Tools Combat Errors in Scientific Research
In a bid to bolster the integrity of scientific publications, two new artificial intelligence (AI) tools are being developed to identify and flag errors in research papers. These tools are designed to scrutinize calculations, methodologies, and references, potentially preventing flawed findings from entering the scientific record.

One incident that highlights the need for such tools involved a widely reported issue with black plastic cooking utensils. Initial reports claimed the utensils contained dangerous levels of cancer-linked flame retardants. However, a mathematical error in the underlying research greatly exaggerated the risk, demonstrating the potential for mistakes to be amplified and publicized.
The Black Spatula Project
The first project, known as the Black Spatula Project, is an open-source initiative that has already analyzed approximately 500 papers. Joaquin Gulloso, an independent AI researcher coordinating the project, noted that the tool is detecting numerous errors. The project involves around eight active developers, hundreds of volunteer advisors, and is currently approaching the authors of the affected papers directly.
YesNoError
Inspired by the Black Spatula Project, YesNoError has taken an even more ambitious approach. Founded by AI entrepreneur Matt Schlicht, the project has analyzed over 37,000 papers within a couple of months. This initiative, financed through a dedicated cryptocurrency, flags papers with identified flaws on its website. While many of these findings await human verification, YesNoError plans to implement a large-scale verification process.
Potential Risks and Concerns
While both projects are receiving tentative support from research integrity experts, some concerns have been raised. Michèle Nuijten, a researcher in metascience at Tilburg University, emphasizes the importance of verifying tool claims to prevent reputational damage. James Heathers, a forensic metascientist at Linnaeus University, suggests that AI could serve as a first step in identifying papers that warrant closer examination.
AI’s Role in Error Detection
Both the Black Spatula Project and YesNoError employ large language models (LLMs) to search for a variety of errors, including factual inaccuracies, calculation mistakes, and methodology issues. The systems extract information from the papers, create prompts for reasoning models, and analyze each paper. The cost of analyzing each paper ranges from 15 cents to a few dollars.
A significant hurdle for these tools is the rate of false positives. The Black Spatula Project’s system reports errors incorrectly about 10% of the time. YesNoError is quantifying the rate of false positives in mathematical errors and is planning to collaborate with ResearchHub to involve PhD scientists in peer review following AI analysis.
These AI tools offer the potential to significantly improve the quality and reliability of scientific research. Implementing these technologies early in the publication process could lead to reduced errors and a more trustworthy body of scientific literature.