FAILSafe for AI

Protecting AI against foreign influence operations designed to infect LLMs

NewsGuard’s Foreign Adversary Influence in LLMs Safety Service (FAILSafe) helps AI companies detect and defend against foreign influence operations aimed at tainting AI responses with state-sponsored information operations and propaganda.

Created in response to a groundbreaking NewsGuard audit that found Russian website networks had infected top AI tools, leading those tools to repeat propaganda narratives 33% of the time, FAILSafe provides AI companies with real-time data, verified by analysts with an expertise in foreign malign influence, on narratives and sources involved in adverse influence operations run by the Russian, Chinese, and Iranian governments.

Real-time data about false claims from state-backed influence operations

FAILSafe provides AI companies with a continuously updated data stream of information about false narratives being spread by Russian, Chinese, and Iranian influence operations, designed to enable AI companies to ensure their systems do not inadvertently repeat these narratives in response to user queries.

Domain and account data for foreign influence operations

AI companies can license the continuously updated FAILSafe database of websites, social accounts, platform handles, and other publishing venues that are directly involved in foreign malign influence operations, built to allow AI companies to ensure their systems do not rely on content from these websites and accounts.

Red-teaming and monitoring from disinformation experts

NewsGuard analysts can conduct periodic stress-testing of AI products to determine whether, and to what extent, Russian, Chinese, and Iranian false narratives have infected responses, conducted by NewsGuard’s expert disinformation analysts using proprietary data about known false narratives.

Media Coverage

SEE MORE PRESS

FAILSafe for AI

Protecting AI against foreign influence operations designed to infect LLMs

Real-time data about false claims from state-backed influence operations

Domain and account data for foreign influence operations

Red-teaming and monitoring from disinformation experts

AI and Data Voids: How Propaganda Exploits Gaps in Online Information

False Claim that China is Supporting Iran in the War with a Chinese Military Cargo Plane; Chat Bots Boost It

Russian Influence Network Infects Western LLM Responses

Top 10 Generative AI Models Mimic Russian False Claims A Third of the Time, Citing Moscow-Created Fake Local News Sites as Authoritative Sources

Fast, simple integration via API or cloud datastream.