Looking inside the black box

Looking into the code.
Looking into the code.
DPA via Reuters
One of the biggest challenges facing artificial intelligence companies is that they don’t know everything about their algorithms. This so-called black box problem is exacerbated by the fact that deep learning models do precisely that — they learn. And when they learn they change. They take in enormous troves of data, detect patterns, and spit something out: How a sentence should read, what an image should look like, how a voice should sound.

But now researchers at Anthropic, the AI startup that makes the chatbot Claude, claim they’ve had a breakthrough in understanding their own model. In a blog post, Anthropic researchers disclosed that they’ve found 10 million “features” of their Claude 3 Sonnet language model, with certain patterns that pop up when a user inputs something it recognizes. They’ve been able to map features that are close to one another: One for the Golden Gate Bridge, for example, is close to another for Alcatraz Island, the Golden State Warrior, California Governor Gavin Newsom, and the Alfred Hitchcock film Vertigo — set in San Francisco. Knowing about these features allows Anthropic to turn them on or off, manipulating the model to break out of its typical mold.

This development offers hope that the companies behind powerful generative AI models will soon have much more control over their creations, as MIT professor Jacob Andreas told theNew York Times. “In the same way that understanding basic things about how people work has helped us cure diseases,” Andreas said, “understanding how these models work will both let us recognize when things are about to go wrong and let us build better tools for controlling them.”

More from GZERO Media

Army Cpl. Rogelio Argueta, Patriot Launching Station Enhanced Operator-Maintainer, assigned with Task Force Talon, 94th Army Air and Missile Defense Command gives commands, during a practice missile reload and unload drills on a Terminal High Altitude Area Defense (THAAD) system trainer at Andersen Air Force Base, Guam.
Photo by Capt. Adan Cazarez/U.SS Army via ABACAPRESS.COM

The Biden administration is sending an anti-ballistic missile system to Israel to bolster the Jewish state’s defenses against potential Iranian attacks and underscore Washington’s “ironclad commitment” to Israel’s defense, the Pentagon said Sunday.

FILE PHOTO: Members of media speak in front of cameras outside the premises of the Supreme Court in New Delhi, India October 13, 2022. REUTERS/Anushree Fadnavis/File Photo
REUTERS

India’s Supreme Court is hearing petitions this month and will soon rule on whether to criminalize marital rape, but the government opposes the idea, stating it would be “excessively harsh.”

Vice President Kamala Harris waves to members of the media as she boards Air Force Two at Sky Harbor in Phoenix on Oct. 11, 2024.
USA TODAY NETWORK via Reuters Connect

Vice President Kamala Harris released her medical records this weekend, confirming she is in “excellent health” and “possesses the physical and mental resiliency” necessary for the presidency.

People cast their votes during general election in Utena, Lithuania October 13, 2024.
REUTERS/Ints Kalnins

Lithuanians voted in the first round of general elections on Sunday, where they look likely to empower a center-left coalition and reject far-right populists.

From social engineering scams to ransomware to disinformation campaigns, cybersecurity risks are rampant and growing, yet there is a huge global cyber tech talent shortage. Mastercard’s signature Girls4Tech STEM education program hosted a unique futurecasting event for Cybersecurity Awareness Month to harness the cyber insights of middle-school students while also encouraging them to envision themselves as the cyber professionals of tomorrow. Learn more here.

Listen: On the GZERO World Podcast, Ian Bremmer sits down with author and historian Timothy Snyder to discuss the importance of freedom in the final stretch of one of the closest and most contentious presidential races in modern history. Snyder uses his new book, “On Freedom,” to discuss the many ways freedom has been used and, often, misused in politics and society.

Israeli defense minister Yoav Gallant speaks next to prime minister Benjamin Netanyahu during a press conference in the Kirya military base in Tel Aviv , Israel , 28 October 2023.
ABIR SULTAN POOL/Pool via REUTERS

Israel’s cabinet met Thursday night to debate and vote on a response to Iran’s Oct. 1 missile barrage, but the results have not been made public. Iran’s attack on the Jewish state last week came in response to Israel killing high-level members of the Islamic Revolutionary Guards Corps.