Looking inside the black box

Looking into the code.
Looking into the code.
DPA via Reuters
One of the biggest challenges facing artificial intelligence companies is that they don’t know everything about their algorithms. This so-called black box problem is exacerbated by the fact that deep learning models do precisely that — they learn. And when they learn they change. They take in enormous troves of data, detect patterns, and spit something out: How a sentence should read, what an image should look like, how a voice should sound.

But now researchers at Anthropic, the AI startup that makes the chatbot Claude, claim they’ve had a breakthrough in understanding their own model. In a blog post, Anthropic researchers disclosed that they’ve found 10 million “features” of their Claude 3 Sonnet language model, with certain patterns that pop up when a user inputs something it recognizes. They’ve been able to map features that are close to one another: One for the Golden Gate Bridge, for example, is close to another for Alcatraz Island, the Golden State Warrior, California Governor Gavin Newsom, and the Alfred Hitchcock film Vertigo — set in San Francisco. Knowing about these features allows Anthropic to turn them on or off, manipulating the model to break out of its typical mold.

This development offers hope that the companies behind powerful generative AI models will soon have much more control over their creations, as MIT professor Jacob Andreas told theNew York Times. “In the same way that understanding basic things about how people work has helped us cure diseases,” Andreas said, “understanding how these models work will both let us recognize when things are about to go wrong and let us build better tools for controlling them.”

More from GZERO Media

The biggest story of our G-Zero world, Ian Bremmer explains, is that the United States – still the world’s most powerful nation – has chosen to walk away from the international system it built and led for three-quarters of a century. Not because it's weak. Not because it has to. But because it wants to.

Wreckage of public transport buses involved in a head-on collision is parked at a police station near the scene of the deadly crash on the Kampala-Gulu highway in Kiryandongo district, near Gulu, northern Uganda, October 22, 2025.
REUTERS/Stringer

A horrific multi-vehicle crash on the Kampala-Gulu Highway in Uganda late last night has left 46 people dead. The pile up began after two buses traveling in opposite directions reportedly clashed “head on” as they tried to overtake two other vehicles.

U.S. President Donald Trump attends a bilateral meeting with China's President Xi Jinping during the G20 leaders summit in Osaka, Japan, June 29, 2019.
REUTERS/Kevin Lamarque

As China’s Communist Party gathers this week to draft the country’s 15th five-year plan, the path it’s charting is clear: Beijing wants to develop dominance over 21st century technologies, as its economy struggles with the burgeoning US trade war, a slow-boil real-estate crisis, and weak consumer demand.

When Walmart stocks its shelves with homegrown products like Fischer & Wieser’s peach jam, it’s not just selling food — it’s creating opportunity. Over two-thirds of what Walmart buys is made, grown, or assembled in America, fueling jobs and growth in communities nationwide. Walmart’s $350 billion commitment to US manufacturing is supporting 750,000 jobs and empowering small businesses to sell more, hire more, and strengthen their hometowns. From farms to shelves, Walmart’s investment keeps local businesses thriving. Learn how Walmart's commitment to US manufacturing is supporting 750K American jobs.

Last week, Microsoft released its 2025 Digital Defense Report, highlighting the evolving cybersecurity landscape and Microsoft's commitment to defending against emerging threats. The report provides an in-depth analysis of the current threat environment, including identity and access threats, human-operated attacks, ransomware, fraud, social engineering, and nation-state adversary threats. It also outlines advancements in AI for cyber-attack and defense, as well as the emerging cybersecurity threat of quantum technology. The report emphasizes the need for international collaboration, proactive regulatory alignment, and the development of new tools and practices to enhance cybersecurity resilience. Explore the report here.

Saudi Crown Prince Mohammed bin Salman chairs the inaugural session of the Shura Council in Riyadh, Saudi Arabia, on September 10, 2025.

Saudi Press Agency/Handout via REUTERS

There are a lot of good vibes between the United States and Saudi Arabia right now. Whether that stretches to the Riyadh normalizing relations with Israel is another matter.