Can robots help us fact-check?

Teacher grades with red pen.
Courtesy of Midjourney

The conventions are over, and presidential debates are nigh.

Vice President Kamala Harris and former President Donald Trump are scheduled to debate each other for the first time on Sept. 10. Presuming it happens – Trump has suggested he might skip it – moderators and independent fact-checkers at news outlets the world over will be looking to hold the candidates accountable for their claims.

In that spirit, we decided to test four AI-powered fact-checking services to see if they’re up to the task of sifting fact from fiction in future debates.

We came in skeptical: AI models are prone to hallucination, meaning they tend to make things up. They’re much better at mimicking writing style than figuring out what’s true and what’s not.

To test our assumptions, we checked out Originality.AI, a paid service that costs about $15 a month, though the kind folks there allowed us to try the product for free. Originality’s focus is testing plagiarism, but they’re expanding into the fact-checking business. They caution that their tool is in beta but claim better accuracy than OpenAI and Meta’s models. We also tested OpenAI’s ChatGPT, Anthropic’s Claude, and the AI search engine Perplexity.

We ran the introductory portion of Trump’s Republican National Convention address through all three tools. “Let me begin this evening by expressing my gratitude to the American people for your outpouring of love and support following the assassination attempt at my rally on Saturday,” Trump said, referencing the events of the previous week. Claude and ChatGPT cautioned that they could not rate claims that were so recent, though ChatGPT offered to search the web and found the statement accurate when it did. Originality mistakenly rated it false, saying that there was no assassination attempt in 2023 — it’s training data stops before 2024. Perplexity performed best, finding the statement accurate and providing information about the attempted assassination in Butler County, PA.

Given the date range limits, we tested a claim from Trump’s 2020 RNC speech about education policy. “Biden also vowed to oppose school choice and close all charter schools, ripping away the ladder of opportunity for Black and Hispanic children,” Trump said. “In a second term, I will expand charter schools and provide school choice to every family in America.” NPR rated this claim “false,” noting that Joe Biden never campaigned on closing charter schools.

Originality got this one correct. “The claim is false. The sources provided, including the Washington Post and Politico, contradict the claim that Joe Biden vowed to oppose school choice and close all charter schools. In fact, Biden has expressed support for charter schools and has not proposed any plans to close them.” Claude, ChatGPT, and Perplexity all called the claim misleading or false and gave ample explanations why.

Lastly, we tested the four services on a claim Harris made in her recent DNC address that in 2020 “Donald Trump tried to throw away your votes.” Perplexity said it’s an “oversimplification of a complex situation” and explained Trump’s legal challenges to the election results, his pressure campaign on state officials, and false claims about widespread voter fraud. Claude and ChatGPT made similar determinations. Originality said the claim was false but admitted that its cited sources indicated that Trump was trying to suppress votes but not throw them out.

Jonathan Gillham, Originality’s founder and CEO, said that the company’s foundation model is limited to information before the end of 2023 but does have access to Retrieval-Augmented Generation, or RAG, which allows models to fetch more current information and process them. But, he said, that process hasn’t yet produced the results he wants in terms of accuracy.

Ultimately, our cursory test shows the possibilities and limitations of AI fact-checking. They’re perhaps most useful for evaluating claims made in the distant past with ample public sources clarifying what’s true and what’s not that it can draw from. That’s not conducive for real-time fact-checking but depends on the work of (human) professionals doing that work. Perplexity performed best, however, and showed a glimpse of what a responsive and up-to-date AI fact-checking system might look like.

Gillham told GZERO he thinks that in the future, real-time AI fact-checking will be possible. “However, like all AI systems, it will not be perfect and make some number of mistakes.”

More from GZERO Media

Philemon Yang, president of the 79th session of the UN General Assembly, speaks at the opening of the UN General Assembly's 79th session at the UN headquarters on Sept. 10, 2024.
Wang Fan/China News Service/VCG via Reuters

GZERO will be on the ground at this year's UN General Assembly, providing coverage on high-level meetings and big speeches from leaders set to begin on Sept. 24. We’ll also be giving you an inside look at the Summit of the Future, which UN Secretary-General António Guterres says is a once-in-a-generation chance to create more effective and inclusive institutions.

People gather outside a hospital as more than 1,000 people, including Hezbollah fighters and medics, were wounded when the pagers they use to communicate exploded across Lebanon, according to a security source, in Beirut, Lebanon September 17, 2024.
REUTERS/Mohamed Azakir

People of a certain age will recall the metaphoric expression “blowing up my pager,” but this was something altogether more literal: On Tuesday at around 3:30 p.m. local time, pagers belonging to more than 2,800 people in Lebanon and Syria actually blew up, killing at least nine and maiming hundreds.

A Ukrainian serviceman commemorates his brothers-in-arms at a makeshift memorial for fallen Ukrainian soldiers on the Day of Remembrance of Ukraine's Defenders, amid Russia's attack on Ukraine, at Independence Square in Kyiv, Ukraine, on August 29, 2024
(Photo by Maxym Marusenko/NurPhoto)
Indian paramilitary soldiers stand alert while Jammu and Kashmir National Conference candidate Mubarak Gul arrives to file his nomination papers for assembly elections in Srinagar, Jammu and Kashmir, on September 2, 2024.
(Photo by Firdous Nazir/NurPhoto)

The Indian-occupied region of Kashmir kicks off its first phase of elections on Wednesday for its own truncated government and local legislative assembly, as New Delhi reintroduces some local authority after taking direct control in 2019.

European Commission President Ursula Von der Leyen.

(Photo by Nicolas Economou/NurPhoto

European Commission President Ursula Von der Leyen on Tuesday named the team that will work with her as she heads into her second term as the EU’s most powerful official.

Fed poised for 50 basis point rate cut

Kyodo

The Federal Reserve appears set to drop its benchmark interest rate by 50 base points today. That lending rate – which influences borrowing costs broadly – can put the economy in a chokehold when rates are high, or stimulate it when lowered.

Microsoft is teaming up with the Institute for Nonprofit News’ Rural News Network to equip local, regional, and statewide newsrooms with additional resources to help them cover the 2024 elections. Supported by Microsoft’s Democracy Forward Program, RNN’s Text RURAL is an SMS-based service that uses AI to send tailored, fact-based news straight to those living in areas where broadband may not yet be readily available. This initiative includes geo-targeted ads, multilingual translations, and multimedia guides to ensure rural voters are well-informed. The network, comprised of over 80 newsrooms, aims to strengthen democracy by providing crucial election information to often overlooked rural areas. Learn more about the technology.

Walmart is helping veterans and military families live better. At Walmart, veterans can access resources and benefits to help them apply their skills and build fulfilling careers. Since 2013, Walmart has hired over 430,000 veterans and in the last year alone the company has promoted over 5,000 veterans into positions of higher pay and greater responsibility. Learn more about Walmart’s commitment to the military community.

- YouTube

Why is Mexico's judiciary overhaul controversial? After losing another parliamentary seat, is Justin Trudeau's time as Canada's leader coming to an end? 2.5 years in, and 1 million now dead or injured. Is Russia's invasion of Ukraine any closer to resolution? Ian Bremmer shares his insights on global politics this week on World In :60.