Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeekBreaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek
  • Home
  • Nation
  • Politics
  • Economy
  • Sports
  • Entertainment
  • International
  • Technology
  • Auto News
Reading: Why AI Goes Rogue: Unraveling the Dark Side of Technology
Share
Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeekBreaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek
  • Home
  • Nation
  • Politics
  • Economy
  • Sports
  • Entertainment
  • International
  • Technology
  • Auto News
© 2024 All Rights Reserved | Powered by India News Week
Trending Now: Stay updated with the latest breaking news from India and around the world
Why AI Breaks Bad
Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek > Technology > Why AI Goes Rogue: Unraveling the Dark Side of Technology
Technology

Why AI Goes Rogue: Unraveling the Dark Side of Technology

Technology Desk By Technology Desk October 29, 2025 5 Min Read
Share
SHARE

Still, the models are improving much faster than the efforts to understand them. And the Anthropic team admits that as AI agents proliferate, the theoretical criminality of the lab grows ever closer to reality. If we don’t crack the black box, it might crack us.

life has been focused on trying to do things I believe are important. When I was 18, I dropped out of university to support a friend accused of terrorism, because I believe it’s most important to support people when others don’t. When he was found innocent, I noticed that deep learning was going to affect society, and dedicated myself to figuring out how humans could understand neural networks. I’ve spent the last decade working on that because I think it could be one of the keys to making AI safe.”

So begins Chris Olah’s “date me doc,” which he posted on Twitter in 2022. He’s no longer single, but the doc remains on his Github site “since it was an important document for me,” he writes.

Olah’s description leaves out a few things, including that despite not earning a university degree he’s an Anthropic cofounder. A less significant omission is that he received a Thiel Fellowship, which bestows $100,000 on talented dropouts. “It gave me a lot of flexibility to focus on whatever I thought was important,” he told me in a 2024 interview. Spurred by reading articles in WIRED, among other things, he tried building 3D printers. “At 19, one doesn’t necessarily have the best taste,” he admitted. Then, in 2013, he attended a seminar series on deep learning and was galvanized. He left the sessions with a question that no one else seemed to be asking: What’s going on in those systems?

Olah had difficulty interesting others in the question. When he joined Google Brain as an intern in 2014, he worked on a strange product called Deep Dream, an early experiment in AI image generation. The neural net produced bizarre, psychedelic patterns, almost as if the software was on drugs. “We didn’t understand the results,” says Olah. “But one thing they did show is that there’s a lot of structure inside neural networks.” At least some elements, he concluded, could be understood.

Olah set out to find such elements. He cofounded a scientific journal called Distill to bring “more transparency” to machine learning. In 2018, he and a few Google colleagues published a paper in Distill called “The Building Blocks of Interpretability.” They’d identified, for example, that specific neurons encoded the concept of floppy ears. From there, Olah and his coauthors could figure out how the system knew the difference between, say, a Labrador retriever and a tiger cat. They acknowledged in the paper that this was only the beginning of deciphering neural nets: “We need to make them human scale, rather than overwhelming dumps of information.”

The paper was Olah’s swan song at Google. “There actually was a sense at Google Brain that you weren’t very serious if you were talking about AI safety,” he says. In 2018 OpenAI offered him the chance to form a permanent team on interpretability. He jumped. Three years later, he joined a group of his OpenAI colleagues to cofound Anthropic.

TAGGED:EducationTechnology
Share This Article
Twitter Copy Link
Previous Article Orkla India raises ₹500 crore from anchor investors Orkla India Secures ₹500 Crore Investment from Anchor Investors
Next Article RBI’s gold reserves soar 25 tonnes to record 880 tonnes amid forex dip RBI’s Gold Reserves Surge to Record 880 Tonnes Amid Forex Decline
Leave a comment Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Latest News

Madame Tussauds London unveils ‘Icons of India’ with Bollywood, cricket stars

Madame Tussauds London Debuts ‘Icons of India’ Featuring Bollywood and Cricket Legends

May 1, 2026
Nifty holds 24,000 on expiry day; oil, Iran talks keep markets on edge

Sensex and Nifty Dip as Market Sentiment Declines Sharply on April 30

May 1, 2026
Global markets mixed amid May Day closures, oil steady at $111 per barrel

Global Markets Show Mixed Trends as May Day Celebrations Impact Oil Stays Steady at $111

May 1, 2026
Indus Water Treaty: Asymmetric obligations, unequal concessions and Pakistan's aggression

Indus Water Treaty: Disparities in Obligations and Concessions Amid Rising Tensions with Pakistan

May 1, 2026
Markets shut today for Maharashtra Day after sharp losses, trading to resume May 4

Maharashtra Day Closes Markets Amid Sharp Losses; Trading Set to Resume on May 4

May 1, 2026
FPIs pull out ₹60,847 cr in Apr; outflows hit ₹1.92 lakh cr in first four months of 2026

FPIs Withdraw ₹60,847 Crore in April; 2026 Outflows Reach ₹1.92 Lakh Crore in Four Months

May 1, 2026

You Might Also Like

Wipro’s client zero and AI strategy is a gamechanger, says CIO Anup Purohit
Technology

Wipro’s Innovative AI Strategy: Insights from CIO Anup Purohit on Client Zero Impact

5 Min Read
What it really takes to build voice AI that feels human
Technology

Unlocking the Secrets to Creating Human-Like Voice AI: What You Need to Know

5 Min Read
The 9 Best Electric Toothbrushes, Tested and Reviewed (2024)
Technology

Top 9 Electric Toothbrushes of 2024: Tested and Reviewed for Optimal Oral Care

6 Min Read
Best Wi-Fi Routers of 2025 Tested and Reviewed by Experts
Technology

Top Wi-Fi Routers of 2025: Expert Reviews and Testing Insights

6 Min Read

About IndiaNewsWeek

IndiaNewsWeek is your trusted source for breaking news, in-depth analysis, and comprehensive coverage of India and the world. We deliver accurate, timely reporting across politics, economy, sports, entertainment, and technology.

contact@indianewsweek.com

Quick Links

  • Nation
  • Politics
  • Economy
  • International
  • Sports
  • Entertainment

More Sections

  • Technology
  • Auto News
  • Education
  • About Us
  • Contact
  • Privacy Policy

Stay Connected

Follow us on social media for the latest updates and breaking news.

Facebook
X (Twitter)
YouTube
Follow US
© 2026 IndiaNewsWeek. All Rights Reserved.
Welcome Back!

Sign in to your account

Lost your password?