Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek
  • Home
  • Nation
  • Politics
  • Economy
  • Sports
  • Entertainment
  • International
  • Technology
  • Auto News
Reading: Chinese AI Startup DeepSeek Develops Competitive Model Against OpenAI
Share
Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeekBreaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek
Search
  • Home
  • Nation
  • Politics
  • Economy
  • Sports
  • Entertainment
  • International
  • Technology
  • Auto News
© 2024 All Rights Reserved | Powered by India News Week
How Chinese AI Startup DeepSeek Made a Model that Rivals OpenAI
Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek > Technology > Chinese AI Startup DeepSeek Develops Competitive Model Against OpenAI
Technology

Chinese AI Startup DeepSeek Develops Competitive Model Against OpenAI

January 27, 2025 5 Min Read
Share
SHARE

Currently, DeepSeek stands as one of the few prominent AI companies in China that operates independently of financial support from tech behemoths like Baidu, Alibaba, or ByteDance.

A Young Collective of Innovators Eager to Make Their Mark

Liang shared that when he assembled the research team at DeepSeek, his goal was not to recruit seasoned engineers to develop a consumer-oriented product. Instead, he sought PhD candidates from China’s leading universities, such as Peking University and Tsinghua University, who were eager to showcase their capabilities. Many of these individuals had been published in prestigious journals and received accolades at international conferences, but still lacked practical experience, according to the Chinese tech outlet QBitAI.

“Our primary technical roles are mainly occupied by individuals who graduated this year or within the last couple of years,” Liang told 36Kr in 2023. This hiring approach has fostered a collaborative environment where team members can freely utilize abundant computing resources to engage in unconventional research endeavors. This contrasts sharply with established internet firms in China, where teams often vie for limited resources. (A recent illustration: ByteDance accused a former intern—an esteemed award winner, no less—of undermining his colleagues’ projects to seize more computing power for his own team.)

Liang believes that students may be better suited for high-risk, low-reward research settings. “When young, many individuals can fully commit themselves to a cause without self-serving motivations,” he elucidated. His message to potential recruits is that DeepSeek was founded to tackle “the most challenging questions facing the world.”

Experts suggest that the nearly exclusive education of these young researchers within China contributes to their motivation. “This younger generation also embodies a sense of nationalism, especially as they navigate US restrictions and challenges in critical hardware and software technologies,” explains Zhang. “Their resolve to transcend these obstacles reflects not only personal ambition but also a broader dedication to elevating China’s status as a global leader in innovation.”

Innovation Arising from Adversity

In October 2022, the US government began instituting export controls that significantly limited Chinese AI firms’ access to advanced chips like Nvidia’s H100. This posed a substantial challenge for DeepSeek. Although the company had initially secured a stockpile of 10,000 H100s, it required additional resources to compete with entities like OpenAI and Meta. “Our challenge has never been funding; it has been the export restrictions on advanced chips,” Liang told 36Kr in a follow-up interview in 2024.

DeepSeek was compelled to devise more efficient methods for training its models. “They fine-tuned their model architecture through a series of engineering strategies—custom communication schemes between chips, minimizing field sizes to conserve memory, and innovative application of the mix-of-models approach,” remarks Wendy Chang, a software engineer who transitioned to a policy analyst role at the Mercator Institute for China Studies. “While many of these tactics are not novel, adeptly merging them to create a state-of-the-art model is an impressive accomplishment.”

DeepSeek has also made notable advancements in Multi-head Latent Attention (MLA) and Mixture-of-Experts, two technical frameworks that enhance the cost-effectiveness of DeepSeek’s models by necessitating fewer computing resources for training. In fact, their latest model is so effective that it utilized just one-tenth of the computational power needed to train Meta’s corresponding Llama 3.1 model, according to the research organization Epoch AI.

DeepSeek’s openness in sharing these innovations with the public has garnered significant goodwill within the global AI research community. For many Chinese AI enterprises, developing open-source models represents a viable strategy to catch up with their Western peers, as it attracts a broader user base and contributors, aiding in the models’ development. “They have successfully shown that cutting-edge models can be constructed with less, albeit still significant, investment and that the prevailing standards of model development leave substantial room for refinement,” Chang expresses. “We are certain to witness more initiatives along these lines in the near future.”

This trend could pose challenges for the existing US export controls aimed at creating bottlenecks in computing resources. “Current estimates of China’s AI computing capacity, and what they can accomplish with it, could be significantly altered,” Chang notes.

TAGGED:EducationTechnology
Share This Article
Twitter Copy Link
Previous Article Markets extend losses on FII selling, tech stocks drag FII selling drags markets down as tech stocks continue to slide
Next Article Adani stocks mixed after Hindenburg research announces closure  Adani Total Gas experiences 15% volume surge with supply shifts.
Leave a comment Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Latest News

Chandresh Dedhia exits Zepto

Chandresh Dedhia exits Zepto Rewrite this headline into a unique, engaging, SEO-friendly news title. Use only English. Maximum 12 words. Output only the new title.

November 7, 2025
Financials propel market recovery as Nifty, Sensex snap back from day’s lows

Financials propel market recovery as Nifty, Sensex snap back from day’s lows Rewrite this headline into a unique, engaging, SEO-friendly news title. Use only English. Maximum 12 words. Output only the new title.

November 7, 2025
Nykaa Q2 profit jumps 154% YoY to ₹33 crore; revenue up 25% to ₹2,346 crore

Nykaa Q2 profit jumps 154% YoY to ₹33 crore; revenue up 25% to ₹2,346 crore Rewrite this headline into a unique, engaging, SEO-friendly news title. Use only English. Maximum 12 words. Output only the new title.

November 7, 2025
LIC shares edge higher as analysts eye growth despite mixed Q2

LIC shares edge higher as analysts eye growth despite mixed Q2 Rewrite this headline into a unique, engaging, SEO-friendly news title. Use only English. Maximum 12 words. Output only the new title.

November 7, 2025
Markets sink on global selloff; Sensex down 564 points, Nifty below 25,350

Markets sink on global selloff; Sensex down 564 points, Nifty below 25,350 Rewrite this headline into a unique, engaging, SEO-friendly news title. Use only English. Maximum 12 words. Output only the new title.

November 7, 2025
'Isn’t Rs 4 lakh enough?’: SC questions Shami’s ex-wife Hasin Jahan in alimony case

SC Questions Hasin Jahan: Is Rs 4 Lakh Alimony Not Enough?

November 7, 2025

You Might Also Like

Top 5 CX trends that will shape 2025: Study by SurveySensum
Technology

Five Customer Experience Trends Predicted to Transform 2025: Insights from SurveySensum

5 Min Read
TikTok Is Already Back Online
Technology

TikTok Resumes Service After Brief Interruption: What You Need to Know

5 Min Read
Government Tech Workers Forced to Defend Projects to Random Elon Musk Bros
Technology

Government Tech Workers Pressure-Test Projects in Front of Unqualified Elon Musk Enthusiasts

4 Min Read
AI and the End of Accents
Technology

How AI is Shaping the Future of Accents and Communication

5 Min Read
Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek
Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek

Welcome to IndiaNewsWeek, your reliable source for all the essential news and insights from across the nation. Our mission is to provide timely and accurate news that reflects the diverse perspectives and voices within India.

  • Home
  • Nation News
  • Economy News
  • Politics News
  • Sports News
  • Technology
  • Entertainment
  • International
  • Auto News
  • Bookmarks
  • About us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms of Service
  • Home
  • Nation
  • Politics
  • Economy
  • Sports
  • Entertainment
  • International
  • Technology
  • Auto News
  • About us
  • Contact Us
  • Disclaimer
  • Privacy Policy
  • Terms of Service

© 2024 All Rights Reserved | Powered by India News Week

Welcome Back!

Sign in to your account

Lost your password?