Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek
  • Home
  • Nation
  • Politics
  • Economy
  • Sports
  • Entertainment
  • International
  • Technology
  • Auto News
Reading: Why AI Goes Rogue: Unraveling the Dark Side of Technology
Share
Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeekBreaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek
Search
  • Home
  • Nation
  • Politics
  • Economy
  • Sports
  • Entertainment
  • International
  • Technology
  • Auto News
© 2024 All Rights Reserved | Powered by India News Week
Why AI Breaks Bad
Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek > Technology > Why AI Goes Rogue: Unraveling the Dark Side of Technology
Technology

Why AI Goes Rogue: Unraveling the Dark Side of Technology

October 29, 2025 5 Min Read
Share
SHARE

Still, the models are improving much faster than the efforts to understand them. And the Anthropic team admits that as AI agents proliferate, the theoretical criminality of the lab grows ever closer to reality. If we don’t crack the black box, it might crack us.

life has been focused on trying to do things I believe are important. When I was 18, I dropped out of university to support a friend accused of terrorism, because I believe it’s most important to support people when others don’t. When he was found innocent, I noticed that deep learning was going to affect society, and dedicated myself to figuring out how humans could understand neural networks. I’ve spent the last decade working on that because I think it could be one of the keys to making AI safe.”

So begins Chris Olah’s “date me doc,” which he posted on Twitter in 2022. He’s no longer single, but the doc remains on his Github site “since it was an important document for me,” he writes.

Olah’s description leaves out a few things, including that despite not earning a university degree he’s an Anthropic cofounder. A less significant omission is that he received a Thiel Fellowship, which bestows $100,000 on talented dropouts. “It gave me a lot of flexibility to focus on whatever I thought was important,” he told me in a 2024 interview. Spurred by reading articles in WIRED, among other things, he tried building 3D printers. “At 19, one doesn’t necessarily have the best taste,” he admitted. Then, in 2013, he attended a seminar series on deep learning and was galvanized. He left the sessions with a question that no one else seemed to be asking: What’s going on in those systems?

Olah had difficulty interesting others in the question. When he joined Google Brain as an intern in 2014, he worked on a strange product called Deep Dream, an early experiment in AI image generation. The neural net produced bizarre, psychedelic patterns, almost as if the software was on drugs. “We didn’t understand the results,” says Olah. “But one thing they did show is that there’s a lot of structure inside neural networks.” At least some elements, he concluded, could be understood.

Olah set out to find such elements. He cofounded a scientific journal called Distill to bring “more transparency” to machine learning. In 2018, he and a few Google colleagues published a paper in Distill called “The Building Blocks of Interpretability.” They’d identified, for example, that specific neurons encoded the concept of floppy ears. From there, Olah and his coauthors could figure out how the system knew the difference between, say, a Labrador retriever and a tiger cat. They acknowledged in the paper that this was only the beginning of deciphering neural nets: “We need to make them human scale, rather than overwhelming dumps of information.”

The paper was Olah’s swan song at Google. “There actually was a sense at Google Brain that you weren’t very serious if you were talking about AI safety,” he says. In 2018 OpenAI offered him the chance to form a permanent team on interpretability. He jumped. Three years later, he joined a group of his OpenAI colleagues to cofound Anthropic.

TAGGED:EducationTechnology
Share This Article
Twitter Copy Link
Previous Article Orkla India raises ₹500 crore from anchor investors Orkla India Secures ₹500 Crore Investment from Anchor Investors
Next Article RBI’s gold reserves soar 25 tonnes to record 880 tonnes amid forex dip RBI’s Gold Reserves Surge to Record 880 Tonnes Amid Forex Decline
Leave a comment Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Latest News

From evaluation to execution: Why CIOs can’t afford to wait on cloud

Why CIOs Must Act Now: The Imperative of Cloud Adoption

March 17, 2026
Chennai Super Kings announce James Foster as new fielding coach ahead of IPL 2026

Chennai Super Kings Appoint James Foster as Fielding Coach for IPL 2026

March 17, 2026
'No one cares about you': Liam Livingstone tears into McCullum and England management

Liam Livingstone Slams McCullum: ‘No One Cares About You’ Says England Star

March 17, 2026
Fire alarm creates chaos during cricket match in Australia, play stopped for 25 minutes

Cricket Match Disrupted for 25 Minutes as Fire Alarm Sparks Chaos in Australia

March 16, 2026
Aryna Sabalenka defeats Elena Rybakina, clinches Indian Wells 2026 title after stellar showing

Sabalenka Tops Rybakina to Secure 2026 Indian Wells Championship in Dominant Fashion

March 16, 2026
Reliance Industries’ Durga Prasad Dube outlines ‘10 laws of cyber defence’ inspired by Sun Tzu

10 Cyber Defense Principles by Reliance’s Durga Prasad Dube, Inspired by Sun Tzu

March 16, 2026

You Might Also Like

Pentagon Cuts Threaten Programs That Secure Loose Nukes and Weapons of Mass Destruction
Technology

Pentagon Budget Cuts Endanger Security of Loose Nukes and WMDs Programs

4 Min Read
Essential skills and strategies for young professionals in a tech-driven corporate landscape
Technology

Key Skills and Strategies for Young Professionals in a Tech-Driven Workplace

5 Min Read
DeepSeek’s Popular AI App Is Explicitly Sending US Data to China
Technology

DeepSeek’s Controversial AI App: US Data Transfer to China Exposed

4 Min Read
NTA Exam Calendar 2025: JEE Main 2025, NEET UG, CUET UG and UGC NET exams date, other details here
Technology

2025 NTA Exam Schedule: JEE Main, NEET UG, CUET UG, & UGC NET Dates

3 Min Read
Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek
Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek

Welcome to IndiaNewsWeek, your reliable source for all the essential news and insights from across the nation. Our mission is to provide timely and accurate news that reflects the diverse perspectives and voices within India.

  • Home
  • Nation News
  • Economy News
  • Politics News
  • Sports News
  • Technology
  • Entertainment
  • International
  • Auto News
  • Bookmarks
  • About us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms of Service
  • Home
  • Nation
  • Politics
  • Economy
  • Sports
  • Entertainment
  • International
  • Technology
  • Auto News
  • About us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms of Service

© 2024 All Rights Reserved | Powered by India News Week

Welcome Back!

Sign in to your account

Lost your password?