Looking inside the black box

Looking into the code.
Looking into the code.
DPA via Reuters
One of the biggest challenges facing artificial intelligence companies is that they don’t know everything about their algorithms. This so-called black box problem is exacerbated by the fact that deep learning models do precisely that — they learn. And when they learn they change. They take in enormous troves of data, detect patterns, and spit something out: How a sentence should read, what an image should look like, how a voice should sound.

But now researchers at Anthropic, the AI startup that makes the chatbot Claude, claim they’ve had a breakthrough in understanding their own model. In a blog post, Anthropic researchers disclosed that they’ve found 10 million “features” of their Claude 3 Sonnet language model, with certain patterns that pop up when a user inputs something it recognizes. They’ve been able to map features that are close to one another: One for the Golden Gate Bridge, for example, is close to another for Alcatraz Island, the Golden State Warrior, California Governor Gavin Newsom, and the Alfred Hitchcock film Vertigo — set in San Francisco. Knowing about these features allows Anthropic to turn them on or off, manipulating the model to break out of its typical mold.

This development offers hope that the companies behind powerful generative AI models will soon have much more control over their creations, as MIT professor Jacob Andreas told theNew York Times. “In the same way that understanding basic things about how people work has helped us cure diseases,” Andreas said, “understanding how these models work will both let us recognize when things are about to go wrong and let us build better tools for controlling them.”

More from GZERO Media

A miniature statue of US President Donald Trump stands next to a model bunker-buster bomb, with the Iranian national flag in the background, in Kananaskis, Alberta, Canada, on June 19, 2025.
STR/NurPhoto

US President Donald Trump said Thursday that he will decide whether to bomb Iran’s nuclear facilities “in the next two weeks,” a move that re-opens the door to negotiations, but also gives the US more time to position military forces for an operation.

People ride motorcycles as South Korea's LGBTQ community and supporters attend a Pride parade, during the Seoul Queer Culture Festival, in Seoul, South Korea, June 14, 2025.
REUTERS/Kim Soo-hyeon

June is recognized in more than 100 countries in the world as “Pride Month,” marking 55 years since gay liberation marches began commemorating the Stonewall riots – a pivotal uprising against the police’s targeting of LGBTQ+ communities in New York.

Port of Nice, France, during the United Nations Oceans Conference in June 2025.
María José Valverde

Eurasia Group’s biodiversity and sustainability analyst María José Valverde sat down with Rebecca Hubbard, the director of the High Seas Alliance, to discuss the High Seas Treaty.

Housing shortages in the US and Canada have become a significant problem – and a contentious political issue – in recent years. New data on housing construction this week suggest neither country is making enough progress to solve the shortfalls. Here’s a snapshot of the situation on both sides of the border.

Ontario Premier Doug Ford speaks during a meeting of northeastern U.S. Governors and Canadian Premiers, in Boston, Massachusetts, U.S., June 16, 2025.
REUTERS/Sophie Park

While the national level drama played out between Donald Trump and Mark Carney at the G7 in Kananaskis, a lot of important US-Canada work was going on with far less fanfare in Boston, where five Canadian premiers met with governors and delegations from seven US states.

- YouTube

What’s next for Iran’s regime? Ian Bremmer says, “It’s much more likely that the supreme leader ends up out, but the military… continues to run the country.”

Enbridge’s 2024 Sustainability Report is now available, outlining our approach to meeting today’s energy needs while advancing solutions for tomorrow. Now in its 24th year, the report reflects our ongoing commitment to being a safe operator of essential energy infrastructure and a responsible environmental steward, principles at the heart of our mission to be North America’s first-choice energy delivery company. Highlights include a 40% reduction in emissions intensity, surpassing our 2030 target, and a 22% drop in absolute emissions since setting our goals in 2020. Explore the 2024 Sustainability Report today.