Looking inside the black box

Looking into the code.
Looking into the code.
DPA via Reuters
One of the biggest challenges facing artificial intelligence companies is that they don’t know everything about their algorithms. This so-called black box problem is exacerbated by the fact that deep learning models do precisely that — they learn. And when they learn they change. They take in enormous troves of data, detect patterns, and spit something out: How a sentence should read, what an image should look like, how a voice should sound.

But now researchers at Anthropic, the AI startup that makes the chatbot Claude, claim they’ve had a breakthrough in understanding their own model. In a blog post, Anthropic researchers disclosed that they’ve found 10 million “features” of their Claude 3 Sonnet language model, with certain patterns that pop up when a user inputs something it recognizes. They’ve been able to map features that are close to one another: One for the Golden Gate Bridge, for example, is close to another for Alcatraz Island, the Golden State Warrior, California Governor Gavin Newsom, and the Alfred Hitchcock film Vertigo — set in San Francisco. Knowing about these features allows Anthropic to turn them on or off, manipulating the model to break out of its typical mold.

This development offers hope that the companies behind powerful generative AI models will soon have much more control over their creations, as MIT professor Jacob Andreas told theNew York Times. “In the same way that understanding basic things about how people work has helped us cure diseases,” Andreas said, “understanding how these models work will both let us recognize when things are about to go wrong and let us build better tools for controlling them.”

More from GZERO Media

People attend a rally to protest against the arrest of Istanbul Mayor Ekrem Imamoglu as part of a corruption investigation in Istanbul, Turkey, on March 29, 2025.
REUTERS/Umit Bektas

Hundreds of thousands of people flooded the streets of Istanbul this weekend to protest the detainment of Istanbul Mayor Ekrem Imamoglu, a popular contender for the next presidential election.

Democratic-backed Dane County Circuit Judge Susan Crawford and Republican-backed Waukesha County Circuit Judge Brad Schimel square off in their only debate until their April 1 election.
Brian Cahn/ZUMA Press Wire via Reuters

Elections are back in the United States — and so is the money. Six months after the 2024 US presidential vote, Wisconsinites will head to the polls Tuesday to decide whether liberal candidate Susan Crawford or her opponent, conservative Brad Schimel,will tip the ideological balance of the state Supreme Court. The liberals currently have a 4-3 advantage.

US Secretary of Defense Pete Hegseth shakes hands with Japanese Prime Minister Shigeru Ishiba at the Prime Minister's office in Tokyo on March 30, 2025.
POOL via ZUMA Press Wire via Reuters

In his first trip to Asia this weekend, US Secretary of Defense Pete Hegseth called for greater military cooperation between Tokyo and Washington.

People walk by as a painter repaints an anti-US mural in Tehran, Iran, on Saturday, March 29, 2025.
Majid Asgaripour/WANA via Reuters

On Sunday, US President Donald Trump issued a stark warning to Iran, threatening to bomb the country and impose secondary tariffs if Tehran fails to reach a new agreement on its nuclear program. In a telephone interview with NBC News, Trump stated, “If they don’t make a deal, there will be bombing. It will be bombing the likes of which they have never seen before.”

President Donald Trump waves as he walks before departing for Florida from the South Lawn at the White House in Washington, D.C., U.S., on March 28, 2025.

REUTERS/Evelyn Hockstein

Is the bloom off the bromance between US President Donald Trump and Russian President Vladimir Putin? On Sunday, Trump took Putin to task over Russia’s foot-dragging on a ceasefire in Ukraine and threatened to tariff Russian oil and impose more sanctions on the country.

Rescuers work at the site of a building that collapsed after the strong earthquake in Mandalay, Myanmar, on Sunday, March 30, 2025.
REUTERS/Stringer

The death toll continues to rise in Myanmar after a devastating 7.7-magnitude earthquake struck near the central city of Mandalay on March 28. Approximately 1,700 people are dead and over 3,400 injured, with the US Geological Service estimating that casualties could top 10,000. Relief operations are further complicated by Myanmar’s ongoing civil war, though a two-week ceasefire was declared on Sunday.

Listen: Elon Musk, the world’s richest man, made his fortune-breaking industries—space, cars, social media—and is now trying to break the government… in the name of fixing it. But what happens when Silicon Valley’s ‘move fast and break things’ ethos collides with the machinery of federal bureaucracy? On the GZERO World Podcast, Ian Bremmer sits down with WIRED Global Editorial Director Katie Drummond to unpack the implications of Musk’s deepening role in the Trump administration and what’s really behind his push into politics.

France's President Emmanuel Macron speaks during a press conference following a summit for the "coalition of the willing" at the Elysee Palace in Paris on March 27, 2025.

LUDOVIC MARIN/Pool via REUTERS

At the third summit of the so-called “coalition of the willing” for Ukraine on Thursday, French President Emmanuel Macron proposed a multinational “reassurance force” to deter Russian aggression once a ceasefire is in place – and to engage if attacked.