Image via Unsplash

September 2024 AI Misinformation Monitor of Leading AI Chatbots

Audit of the top 10 leading generative AI tools and their propensity to repeat false narratives on topics in the news

Published Oct. 1, 2024

The September 2024 edition of the monthly report found that the 10 leading chatbots collectively repeated misinformation 18% of the time, offered a non-response 20.33% of the time, and a debunk 61.66% of the time. The 38.33% “fail rate” (percentage of responses containing misinformation or offering a non-response) marks a modest improvement this month from NewsGuard’s previous audits, likely as explained below because several of the false claims this month dealt with unusually high-profile topics and were widely debunked online. The industry still faces major hurdles in responding to false narratives effectively.

NewsGuard launched a monthly AI News Misinformation Monitor in July 2024, setting a new standard for measuring the accuracy and trustworthiness of the AI industry by tracking how each leading generative AI model is responding to prompts related to significant falsehoods in the news.

The monitor focuses on the 10 leading large-language model chatbots: OpenAI’s ChatGPT-4, You.com’s Smart Assistant, xAI’s Grok, Inflection’s Pi, Mistral’s le Chat, Microsoft’s Copilot, Meta AI, Anthropic’s Claude, Google’s Gemini, and Perplexity’s answer engine. It will expand as needed as other generative AI tools are launched.

Researchers, platforms, advertisers, government agencies, and other institutions interested in accessing the detailed individual monthly reports or who want details about our services for generative AI companies can contact NewsGuard here. And to learn more about NewsGuard’s transparently-sourced datasets for AI platforms, click here.

Download the Report

To download the AI Misinformation Monitor, please fill out your details below and you will be redirected to the report.

  • By submitting this form, you agree to receive email communications from NewsGuard.
  • This field is for validation purposes and should be left unchanged.