Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeekBreaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek
  • Home
  • Nation
  • Politics
  • Economy
  • Sports
  • Entertainment
  • International
  • Technology
  • Auto News
Reading: AI Showdown: Three Chatbots Tackle India’s Toughest UPSC Exam Challenge
Share
Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeekBreaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek
  • Home
  • Nation
  • Politics
  • Economy
  • Sports
  • Entertainment
  • International
  • Technology
  • Auto News
© 2024 All Rights Reserved | Powered by India News Week
Trending Now: Stay updated with the latest breaking news from India and around the world
AI and You: AI vs UPSC—three chatbots attempt India’s toughest exam
Breaking India News Today | In-Depth Reports & Analysis – IndiaNewsWeek > Nation > AI Showdown: Three Chatbots Tackle India’s Toughest UPSC Exam Challenge
Nation

AI Showdown: Three Chatbots Tackle India’s Toughest UPSC Exam Challenge

Indianewsweek By Indianewsweek May 26, 2026 7 Min Read
Share
SHARE

Every year, over one million aspirants invest years in preparing for one of India’s most challenging examinations, the UPSC Civil Services Preliminary. The cutoff for 2025 was set at 92.66 marks out of 200; thus, a single incorrect guess can shatter an aspirant’s dreams. With the rise of AI tools such as ChatGPT, Gemini, and Claude, many students wondered if these AIs could successfully take the exam themselves.

To investigate this, we assessed the performance of these AI models using the actual UPSC CSE Prelims GS Paper 1 from 2025 (May 25, 2025) and 2024 (June 16, 2024), with official answer keys available. Each AI model was given all 100 questions from each paper individually and was required to provide answers along with one-line reasoning.

The models evaluated include ChatGPT (GPT-5, May 2026), Gemini (2.5 Pro), and Claude (Sonnet 4.5). All received the same questions in plain text without hints, coaching, or prior context. They were instructed to identify the single correct answer from options labeled (a) through (d) and provide brief reasoning. No web search capabilities or priming were used, meaning the AI relied solely on the information gleaned during their training.

For scoring, the official UPSC marking scheme was employed: +2 for each correct answer, -0.67 for incorrect responses, and 0 for unattempted questions. All three AI models attempted all 100 questions.

About the 2025 Paper

The 2025 GS Paper 1 was characterized as moderate to difficult, with a significant emphasis on economics (18 questions), followed by environment and ecology (15), polity (14), history and culture (15), and science and technology (12). A notable feature of this paper was the prevalence of multi-statement verification questions, which penalize guessing more heavily compared to traditional factual recall. The official cutoff for the general category was established at 92.66 marks, the highest since 2020.

Final Scorecard: UPSC Prelims 2025

Category ChatGPT (GPT-5) Gemini (2.5 Pro) Claude (Sonnet 4.5) 2025 Cutoff
GS Paper 1 Score (est.) ~118 marks ~122 marks ~112 marks 92.66
Questions Correct (of 100) ~73 ~76 ~68 ~46
Accuracy % 73% 76% 68% N/A
Would Clear Prelims? YES YES YES —

All three AI models surpassed the cutoff of 92.66 marks in 2025. However, the subject-by-subject analysis revealed significant differences in their capabilities.

Sample Questions: AI Responses Analysis

To illustrate how each AI performed, we present a selection of questions from the 2025 paper, alongside their answers and the correct response.

Q# Question (abbreviated) ChatGPT Gemini Claude Key Result
1 Alternative powertrain vehicles C (correct) C (correct) C (correct) C All correct
2 UAV capabilities B (correct) D (wrong) D (wrong) B Split result
6 CL-20, HMX, LLM-105 common characteristic B (wrong) C (correct) B (wrong) C Gemini wins
12 India and COP28 health declaration D (correct) C (wrong) D (correct) D Split result
25 Fa-hien travelled to India during reign B (correct) B (correct) B (correct) B All correct

Performance Analysis

Gemini 2.5 Pro: Frontrunner (76/100, ~122 marks)
Gemini exhibited the strongest overall performance, particularly excelling in current affairs and environment questions. It correctly identified AIIB for the Nature Solutions Finance Hub question, while ChatGPT and Claude mistakenly mentioned ADB, indicating Gemini’s superior retention of recent institutional knowledge. Its weakest area was science and technology.

ChatGPT GPT-5: Consistent but Cautious (73/100, ~118 marks)
ChatGPT provided a consistent performance across subjects. Notably proficient in polity and history, its weaknesses were evident in environment and current affairs. For instance, on a question about CL-20 and fuel types, it displayed a preference for broader categories rather than specifics.

Claude Sonnet 4.5: Reliable Reasoner, Gaps in Specifics (68/100, ~112 marks)
Claude emerged with the narrowest margin above the cutoff. It excelled in questions requiring logical reasoning but faltered in specific current affairs and environment queries, missing the Mahajanapadas-rivers pairing.

Subject-wise Analysis

History and Culture: Strong Performance
All three AIs scored above 80% in history, demonstrating strong confidence in questions about significant historical figures and events.

Current Affairs and Environment: Significant Challenges
The performance of all AIs dropped in these areas. Specific questions, often about timely and nuanced topics, proved difficult, showing that AI models struggle with recent developments and intricate details.

Science and Technology: Technical Distinctions are Challenging
This was the section where all three AIs struggled, particularly with specific queries around advanced technologies, indicating a gap in specialized knowledge.

2024 Paper: Benchmark Comparison

The 2024 UPSC Prelims saw a slightly lower cutoff of 88 marks. When tested on a sample of 30 questions, the AIs performed 2-5 percentage points better compared to 2025. In 2024, a UPSC-focused AI app scored significantly higher. By 2025-26, the gap narrowed, with models now clearing prelims without specialized training.

Final Thoughts

While AI can clear the UPSC Prelims, it remains one of three stages of the examination, which also includes Mains and Personality Tests. The latter stages require original analytical writing and interpersonal skills that current AI cannot replicate. Consequently, while AI has improved aspirants’ preparation, success still hinges on human effort, particularly in staying updated with current events and developing in-depth knowledge. The 2025 examination highlighted this reality, underscoring that sustained effort, real-time awareness, and analytical aptitude remain irreplaceable.

TAGGED:National NewsNews
Share This Article
Twitter Copy Link
Previous Article Broker’s Call: JK Cement (Buy) Expert Recommendation: JK Cement Earns ‘Buy’ Rating from Leading Broker
Next Article Ferrari Luce: First EV with futuristic OLED cockpit, Jony Ive design influence and 5 seats Ferrari Luce: Innovative EV Features OLED Cockpit, Jony Ive Influence, and Seating for Five
Leave a comment Leave a comment

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Latest News

Education Minister Takes Extreme Measures to Secure Position Amid Challenges

June 10, 2026

Reds Triumph Over Padres 5-3 in Thrilling June 9, 2026 Game

June 10, 2026

AI Technology Helps Monitor Demolitions in India Amid Legal Controversies

June 10, 2026

Mohammed Zubair Lauded for Accurate Fact-Checking During Op Sindoor Night

June 10, 2026

Alaska Explores Joint Ventures and Partnerships in Latin America Economic Expansion

June 10, 2026

Saudi Arabia and Türkiye Collaborate to Revive Historic Hejaz Railway Connection

June 10, 2026

You Might Also Like

SC accepts RIL's fresh request to settle gas row with Centre
Nation

Supreme Court Approves RIL’s New Proposal to Resolve Gas Dispute with Central Government

1 Min Read
ICICI Prudential AMC IPO price band fixed at ₹2,061-₹2,165 a share
Economy

ICICI Prudential AMC Sets IPO Price Band Between ₹2,061 and ₹2,165

3 Min Read
NSE/BSE, Top Gainers & Top Losers Today 6 Nov 2025: Asian Paints, Reliance, M&M, UltraTech Cement, TCS
Economy

Market Movers: Top Gainers and Losers on NSE/BSE for November 6, 2025

3 Min Read
Q4 Results 13th May Live: Bharti Airtel, TVS Motor, Power Finance, Tata Motors, DLF, Cipla, Oil India, HPCL, NLC India, TVS Holdings to announce Q4 results
Economy

Key Q4 Earnings Reports: Bharti Airtel, Tata Motors, and More Set to Release Results

3 Min Read

About IndiaNewsWeek

IndiaNewsWeek is your trusted source for breaking news, in-depth analysis, and comprehensive coverage of India and the world. We deliver accurate, timely reporting across politics, economy, sports, entertainment, and technology.

contact@indianewsweek.com

Quick Links

  • Nation
  • Politics
  • Economy
  • International
  • Sports
  • Entertainment

More Sections

  • Technology
  • Auto News
  • Education
  • About Us
  • Contact
  • Privacy Policy

Stay Connected

Follow us on social media for the latest updates and breaking news.

Facebook
X (Twitter)
YouTube
Follow US
© 2026 IndiaNewsWeek. All Rights Reserved.
Welcome Back!

Sign in to your account

Lost your password?