May 28, 2024
One of the biggest challenges facing artificial intelligence companies is that they don’t know everything about their algorithms. This so-called black box problem is exacerbated by the fact that deep learning models do precisely that — they learn. And when they learn they change. They take in enormous troves of data, detect patterns, and spit something out: How a sentence should read, what an image should look like, how a voice should sound.
But now researchers at Anthropic, the AI startup that makes the chatbot Claude, claim they’ve had a breakthrough in understanding their own model. In a blog post, Anthropic researchers disclosed that they’ve found 10 million “features” of their Claude 3 Sonnet language model, with certain patterns that pop up when a user inputs something it recognizes. They’ve been able to map features that are close to one another: One for the Golden Gate Bridge, for example, is close to another for Alcatraz Island, the Golden State Warrior, California Governor Gavin Newsom, and the Alfred Hitchcock film Vertigo — set in San Francisco. Knowing about these features allows Anthropic to turn them on or off, manipulating the model to break out of its typical mold.
More For You
Mastercard Economic Institute's Outlook 2026 explores the forces redefining global business. Tariffs, technology, and transformation define an adaptive economy for the year ahead. Expect moderate growth amid easing inflation, evolving fiscal policies, and rapid AI adoption, driving productivity. Digital transformation for SMEs and shifts in trade and consumer behavior will shape strategies worldwide. Stay ahead with insights to help navigate complexity and seize emerging opportunities. Learn more here.
Most Popular
- YouTube
Despite a ceasefire in Gaza, Israel is still not letting foreign journalists in to independently verify what’s happening on the ground, CNN’s Clarissa Ward tells Ian Bremmer on GZERO World.
- YouTube
On Ask Ian, Ian Bremmer breaks down the steady escalation of US pressure on Venezuela and why direct military action is now a real possibility.
US President Donald Trump arrives to announce reciprocal tariffs against US trading partners in the Rose Garden of the White House in Washington, DC, USA, on April 2, 2025.
POOL via CNP/INSTARimages.com
From civil conflicts to trade wars to the rise of new technologies, GZERO runs through the stories that have shaped this year in geopolitics.
© 2025 GZERO Media. All Rights Reserved | A Eurasia Group media company.
