Looking inside the black box

GZERO AI
        Looking inside the black box
    
Looking into the code.

        DPA via Reuters
    
By Scott NoverMay 28, 2024
Scott Nover
Scott Nover is the lead writer for GZERO AI. He's a contributing writer for Slate and was previously a staff writer at Quartz and Adweek. His writing has appeared in The Atlantic, Fast Company, Vox.com, and The Washington Post, among other outlets. He currently lives near Washington, DC, with his wife and pup.
 See Full Bio 
Trending Stories

        Taylor Swift AI images & the rise of the deepfakes problem
    
        Hard Numbers: Voters express AI skepticism, Mastercard’s latest purchase, China’s AI deficit, the Taylor Swift effect, Intel’s European delays
    
        Will AI companies ever be profitable?
    
        Can AI help doctors act more human?
    
        Europe’s biggest companies want to “Buy European”
    
        Europe’s AI deepfake raid
    
One of the biggest challenges facing artificial intelligence companies is that they don’t know everything about their algorithms. This so-called black box problem is exacerbated by the fact that deep learning models do precisely that — they learn. And when they learn they change. They take in enormous troves of data, detect patterns, and spit something out: How a sentence should read, what an image should look like, how a voice should sound.But now researchers at Anthropic, the AI startup that makes the chatbot Claude, claim they’ve had a breakthrough in understanding their own model. In a blog post, Anthropic researchers disclosed that they’ve found 10 million “features” of their Claude 3 Sonnet language model, with certain patterns that pop up when a user inputs something it recognizes. They’ve been able to map features that are close to one another: One for the Golden Gate Bridge, for example, is close to another for Alcatraz Island, the Golden State Warrior, California Governor Gavin Newsom, and the Alfred Hitchcock film Vertigo — set in San Francisco. Knowing about these features allows Anthropic to turn them on or off, manipulating the model to break out of its typical mold.
This development offers hope that the companies behind powerful generative AI models will soon have much more control over their creations, as MIT professor Jacob Andreas told the New York Times. “In the same way that understanding basic things about how people work has helped us cure diseases,” Andreas said, “understanding how these models work will both let us recognize when things are about to go wrong and let us build better tools for controlling them.”
llmanthropicaiartificial intelligencelarge language model

        Latest Videos
        
    play icon
Ian Explains
        Can we still trust Wikipedia?
    
GZERO World Clips
        Geoffrey Hinton on how humanity can survive AI
    
ask ian
        Notre Dame, politics, and playing by their own rules
    
Quick Take
        Trump’s new national security strategy targets Europe
    
GZERO World with Ian Bremmer
        'Godfather of AI' warns of existential risks
    
Ian Explains
        Will AI replace human workers?
    
        Related
        
GZERO AI
        Apple faces false advertising lawsuit over AI promises
    
GZERO AI
        Hard Numbers: Perplexity’s fundraising drive, CoreWeave prepares for an IPO, Next stop: India, Aardvark will forecast the weather, Bowdoin’s big AI gift
    
GZERO AI
        The Vatican wants to protect children from AI dangers

   
        
        More For You
        
    
        What we learned from a week of AI-generated cartoons
    
Scott Nover
Apr 01, 2025

        Courtesy of ChatGPT
    
Last week, OpenAI released its GPT-4o image-generation model, which is billed as more responsive to prompts, more capable of accurately rendering text, and better at producing higher-fidelity images than previous AI image generators. Within hours, ChatGPT users flooded social media with cartoons they made using the model in the style of the [...]
Microsoft

        Tools and Weapons – In Conversation with Ed Policy
    Dec 09, 2025
Microsoft
In this episode of Tools and Weapons, Microsoft Vice Chair and President Brad Smith sits down with Ed Policy, President and CEO of the Green Bay Packers, to discuss how purpose-driven leadership and innovation are shaping the future of one of the world’s most iconic sports franchises. Ed shares how technology and community-focused initiatives, [...]

   
        
        Most Popular
        
    
Analysis
        The genocide no one talks about any more
    
Alex Kliment
Dec 05, 2025
What We're Watching
        EU set to use Russian assets for Ukraine loan | Trump wants to make friends and get minerals | Backlash to Seattle’s World Cup Pride plan
    
Riley Callanan
Zac Weisz
Dec 12, 2025
by ian bremmer
        The Ukraine peace push is failing. Here's why.
    
Ian Bremmer
Dec 03, 2025
What We're Watching
        Zelensky willing to compromise | Bulgaria’s government falls | US seizes Venezuelan oil tanker
    
Riley Callanan
Zac Weisz
Alex Kliment
Dec 11, 2025
Analysis
        Why won’t the right unite in Western Europe?
    
Zac Weisz
Dec 04, 2025
GZERO World with Ian Bremmer Podcast
        The human cost of AI, with Geoffrey Hinton
    
GZERO Staff
Dec 06, 2025
Get the latest news from GZERO!

            Dive deeper with our top stories and analysis.
          

                I consent to the
                Privacy Policy
                and
                Terms of Use

        Nvidia delays could slow down China at a crucial time
    
Scott Nover
Apr 01, 2025
The flag of China is displayed on a smartphone with a NVIDIA chip in the background in this photo illustration.

        Jonathan Raa/NurPhoto via Reuters
    
H3C, one of China’s biggest server makers, has warned about running out of Nvidia H20 chips, the most powerful AI chips Chinese companies can legally purchase under US export controls. [...]

        North Korea preps new kamikaze drones
    
Scott Nover
Apr 01, 2025
North Korean leader Kim Jong Un supervises the test of suicide drones with artificial intelligence at an unknown location, in this photo released by North Korea's official Korean Central News Agency on March 27, 2025. 

        KCNA via REUTERS
    
Hermit Kingdom leader Kim Jong Un has reportedly supervised AI-powered kamikaze drone tests. He told KCNA, the state news agency, that developing unmanned aircraft and AI should be a top priority to modernize North Korea’s armed forces. [...]

        Meet Isomorphic Labs, the Google spinoff that aims to cure you
    
Scott Nover
Apr 01, 2025
The logo for Isomorphic Labs is displayed on a tablet in this illustration.

        Igor Golovniov/SOPA Images/Sipa USA via Reuters
    
In 2024, Demis Hassabis won a Nobel Prize in chemistry for his work in predicting protein structures through his company, Isomorphic Labs. The lab, which broke off from Google's DeepMind in 2021, raised $600 million from investors in a new funding round led by Thrive Capital on Monday. The company did not disclose a valuation. [...]
Load More

Site Navigation

Looking inside the black box

Taylor Swift AI images & the rise of the deepfakes problem

Hard Numbers: Voters express AI skepticism, Mastercard’s latest purchase, China’s AI deficit, the Taylor Swift effect, Intel’s European delays

Will AI companies ever be profitable?

Can AI help doctors act more human?

Europe’s biggest companies want to “Buy European”

Europe’s AI deepfake raid

Latest Videos

Can we still trust Wikipedia?

Geoffrey Hinton on how humanity can survive AI

Notre Dame, politics, and playing by their own rules

Trump’s new national security strategy targets Europe

'Godfather of AI' warns of existential risks

Will AI replace human workers?

Apple faces false advertising lawsuit over AI promises

Hard Numbers: Perplexity’s fundraising drive, CoreWeave prepares for an IPO, Next stop: India, Aardvark will forecast the weather, Bowdoin’s big AI gift

The Vatican wants to protect children from AI dangers

It’s official: Trump wants a weaker European Union

How chads and China shaped our world

Honduras awaits election results, but will they be believed?

Egypt’s Undemocratic Election - And Why the West doesn’t care

Republicans lose on Trump’s home turf again

More For You

What we learned from a week of AI-generated cartoons

Tools and Weapons – In Conversation with Ed Policy

Most Popular

The genocide no one talks about any more

EU set to use Russian assets for Ukraine loan | Trump wants to make friends and get minerals | Backlash to Seattle’s World Cup Pride plan

The Ukraine peace push is failing. Here's why.

Zelensky willing to compromise | Bulgaria’s government falls | US seizes Venezuelan oil tanker

Why won’t the right unite in Western Europe?

The human cost of AI, with Geoffrey Hinton

Nvidia delays could slow down China at a crucial time

North Korea preps new kamikaze drones

Meet Isomorphic Labs, the Google spinoff that aims to cure you

Latest Stories

Start your day right!

Looking inside the black box

Latest Videos

Related

More For You

Most Popular

GZERO Daily: our free newsletter about global politics