Looking inside the black box

Looking into the code.
Looking into the code.
DPA via Reuters
One of the biggest challenges facing artificial intelligence companies is that they don’t know everything about their algorithms. This so-called black box problem is exacerbated by the fact that deep learning models do precisely that — they learn. And when they learn they change. They take in enormous troves of data, detect patterns, and spit something out: How a sentence should read, what an image should look like, how a voice should sound.

But now researchers at Anthropic, the AI startup that makes the chatbot Claude, claim they’ve had a breakthrough in understanding their own model. In a blog post, Anthropic researchers disclosed that they’ve found 10 million “features” of their Claude 3 Sonnet language model, with certain patterns that pop up when a user inputs something it recognizes. They’ve been able to map features that are close to one another: One for the Golden Gate Bridge, for example, is close to another for Alcatraz Island, the Golden State Warrior, California Governor Gavin Newsom, and the Alfred Hitchcock film Vertigo — set in San Francisco. Knowing about these features allows Anthropic to turn them on or off, manipulating the model to break out of its typical mold.

This development offers hope that the companies behind powerful generative AI models will soon have much more control over their creations, as MIT professor Jacob Andreas told theNew York Times. “In the same way that understanding basic things about how people work has helped us cure diseases,” Andreas said, “understanding how these models work will both let us recognize when things are about to go wrong and let us build better tools for controlling them.”

More from GZERO Media

Last week, Microsoft committed $15.2 billion to the UAE. This strategic investment expands cloud and AI infrastructure in the Middle East. It aims to boost regional innovation, economic diversification, and digital resilience. The move underscores tech’s role in shaping global competitiveness and security. A milestone for the UAE — and a signal of where the digital future is headed. Read the full blog here.

US President Donald Trump welcomes Indian Prime Minister Narendra Modi to the White House for bilateral discussions about trade and security on February 13, 2025.
India PM Office handout via EYEPRESS

After months of tensions between the world’s richest country and the world’s most populous one, it appears that the United States and India are on the verge of making a trade deal.

Members of the media gather outside Broadcasting House, the BBC headquarters in central London, as BBC Director General Tim Davie and BBC News CEO Deborah Turness resign following accusations of bias and the controversy surrounding the editing of the Trump speech before the Capitol riots on 6 January 2021 in a BBC Panorama documentary.
(Credit Image: © Vuk Valcic/ZUMA Press Wire)

+26: Two BBC leaders, Director-General Tim Davie and BBC News Head Deborah Turness, resigned on Sunday after it emerged that the British news organization edited footage of US President Donald Trump in a misleading fashion.

Senate Minority Leader Chuck Schumer (D-NY) heads back to his office following a press conference at the U.S. Capitol on November 5, 2025 in Washington, D.C. The shutdown of the Federal Government has become the longest in U.S. history after surpassing the 35 day shutdown that occurred during President Trumps first term that began in the end of 2018.
(Photo by Samuel Corum/Sipa USA)