Close Menu
Mondo NewsMondo News
  • Technology
  • Science
  • Blockchain
What's Hot
The Harm Of Toxic Positivity: How Relentless Optimism Can Negatively
Science

The harm of toxic positivity: How relentless optimism can negatively impact your health and mental wellbeing

Grok AI by Elon Musk Claims Trump Won the 2020
Technology

Grok AI by Elon Musk Claims Trump Won the 2020 Presidential Election

The Ideal Location Of Our Milky Way Galaxy For Discovering
Science

The Ideal Location of Our Milky Way Galaxy for Discovering Extraterrestrial Life

  • About Us
  • Privacy Policy
  • Terms & Conditions
Facebook X (Twitter) Instagram
Facebook X (Twitter) Instagram
Mondo NewsMondo News
  • Technology
    Exploring the Limitations of AI Safety Management Practices

    Exploring the Limitations of AI Safety Management Practices

    May 14, 2026
    What is the likelihood of an asteroid impacting Earth

    What is the likelihood of an asteroid impacting Earth?

    December 21, 2025
    Understanding Britains Debt Through Biscuits How Labour MPs Embrace Viral

    Understanding Britain’s Debt Through Biscuits: How Labour MPs Embrace Viral Trends

    December 5, 2025
    Tesla Launches Affordable Model 3 in Europe Amid Criticism of

    Tesla Launches Affordable Model 3 in Europe Amid Criticism of Mask Sales

    December 5, 2025
    Horror Game Horses Banned Is the Controversy Bigger Than You

    Horror Game Horses Banned: Is the Controversy Bigger Than You Think?

    December 5, 2025
  • Science
    Ancient Human Habitation Uncovered at 2000 Meters Experts Stunned by

    Ancient Human Habitation Uncovered at 2,000 Meters: Experts Stunned by Mountain Discovery

    June 2, 2026
    7 Reasons We Overtrust AI and the Hidden Costs Were

    7 Reasons We Overtrust AI and the Hidden Costs We’re Already Facing

    June 2, 2026
    Webb Space Telescope Discovers Methane in Interstellar Comet 3IATLAS

    Webb Space Telescope Discovers Methane in Interstellar Comet 3I/ATLAS

    June 2, 2026
    Newly Discovered Axolotl Fossil Unearthed in Mexico

    Newly Discovered Axolotl Fossil Unearthed in Mexico

    June 2, 2026
    Breakthrough Pancreatic Cancer Drug Doubles Survival Rates A Revolutionary Treatment

    Breakthrough Pancreatic Cancer Drug Doubles Survival Rates: A Revolutionary Treatment

    June 2, 2026
  • Blockchain
    Top 5 Best Altcoins Of 2024 Revealed: Etfs (etfs), Pepe

    Top 4 Altcoins Unveiled by Expert for 100x Portfolio Growth: Blockchain News, Opinion, TV, Jobs

    May 21, 2024
    Blockchain Experts Forecast Which Tokens Will Generate Profits

    Blockchain experts forecast which tokens will generate profits

    May 17, 2024
    The Leading Platform For Seasoned Traders Featuring Blockchain News,

    The Leading Platform for Seasoned Traders – Featuring Blockchain News, Insights, TV, and Job Listings

    May 8, 2024
    Darklume Fantasy Metaverse: Presale Now Available Latest Blockchain Updates,

    Darklume Fantasy Metaverse: Presale Now Available – Latest Blockchain Updates, Opinions, Television, and Job Listings

    April 30, 2024
    Sui Collaborates With Google Cloud To Drive Web3 Advancement Through

    Sui collaborates with Google Cloud to drive Web3 advancement through improved security, scalability, and AI features

    April 30, 2024
Mondo NewsMondo News
You are at:Home » DeepMind and OpenAI Achieve Victory in the International Mathematics Olympiad
DeepMind and OpenAI Achieve Victory in the International Mathematics Olympiad
Science July 22, 2025

DeepMind and OpenAI Achieve Victory in the International Mathematics Olympiad

Share
Facebook Twitter LinkedIn Pinterest Email

AIs are improving at solving mathematics challenges

Andresr/ Getty Images

AI models developed by Google DeepMind and OpenAI have achieved exceptional performance at the International Mathematics Olympiad (IMO).

While companies herald this as a significant advancement for AIs that might one day tackle complex scientific or mathematical challenges, mathematicians urge caution, as the specifics of the models and their methodologies remain confidential.

The IMO is one of the most respected contests for young mathematicians, often viewed by AI researchers as a critical test of mathematical reasoning, an area where AI traditionally struggles.

Following last year’s competition in Bath, UK, Google investigated how its AI systems, Alpha Proof and Alpha Jometry, achieved silver-level performance, though their submissions were not evaluated by the official competition judges.

Various companies, including Google, Huawei, and TikTok’s parent company, approached the IMO organizers requesting formal evaluation of their AI models during this year’s contest, as stated by Gregor Drinner, the President of IMO. The IMO consented, stipulating that results should be revealed only after the full closing ceremony on July 28th.

OpenAI also expressed interest in participating in the competition but did not respond or register upon being informed of the official procedures, according to Dolinar.

On July 19th, OpenAI announced the development of a new AI that achieved a gold medal score alongside three former IMO medalists, separately from the official competition. OpenAI stated the AI correctly answered five out of six questions within the same 4.5-hour time limit as human competitors.

Two days later, Google DeepMind revealed that its AI system, Gemini Deep Think, had also achieved gold-level performance within the same constraints. Dolinar confirmed that this result was validated by the official IMO judges.

Unlike Google’s Alpha Proof and Alpha Jometry, which were designed for competition, Gemini Deep Think was specifically crafted to tackle questions posed in a programming language used by both Google and OpenAI.

Utilizing LEAN, the AI was capable of quickly verifying correctness, although the output is challenging for non-experts to interpret. Thang Luong from Google indicated that a natural language approach can yield more comprehensible results while remaining applicable to broadly useful AI frameworks.

Luong noted that advancements in reinforcement learning—a training technique designed to guide AI through success and failure—have enabled large language models to validate solutions efficiently, a method essential to Google’s earlier achievements with gameplay AIs, such as AlphaZero.

Google’s model employs a technique known as parallel thinking, considering multiple solutions simultaneously. The training data comprises mathematical problems particularly relevant to the IMO.

OpenAI has disclosed few specifics regarding their system, only mentioning that it incorporates augmented learning and “experimental research methods.”

“While progress appears promising, it lacks rigorous scientific validation, making it difficult to assess at this point,” remarked Terence Tao from UCLA. “We anticipate that the participating companies will publish papers featuring more comprehensive data, allowing others to access the model and replicate its findings. However, for now, we must rely on the companies’ claims regarding their results.”

Geordy Williamson from the University of Sydney shared this sentiment, stating, “It’s remarkable to see advancements in this area, yet it’s frustrating how little in-depth information is available from inside these companies.”

Natural language systems might be beneficial for individuals without a mathematical background, but they also risk presenting complications if models produce lengthy proofs that are hard to verify, warned Joseph Myers, a co-organizer of this year’s IMO. “If AIs generate solutions to significant unsolved questions that seem plausible yet contain subtle, critical errors, we must be cautious before putting confidence in lengthy AI outputs.”

The companies plan to initially provide these systems for testing by mathematicians in the forthcoming months before making broader public releases. The models claim they could potentially offer rapid solutions for challenging problems in scientific research, as stated by June Hyuk Jeong from Google, who contributed to Gemini Deep Think. “There are numerous unresolved challenges within reach,” he noted.

Topics:

Source: www.newscientist.com

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
Previous ArticleImproved Air Quality Linked to Rise in Urban Heat Waves
Next Article The pandemic might have accelerated brain aging, even before we contracted Covid-19.

Related Posts

Ancient Human Habitation Uncovered at 2000 Meters Experts Stunned by
Science

Ancient Human Habitation Uncovered at 2,000 Meters: Experts Stunned by Mountain Discovery

7 Reasons We Overtrust AI and the Hidden Costs Were
Science

7 Reasons We Overtrust AI and the Hidden Costs We’re Already Facing

Webb Space Telescope Discovers Methane in Interstellar Comet 3IATLAS
Science

Webb Space Telescope Discovers Methane in Interstellar Comet 3I/ATLAS

Newly Discovered Axolotl Fossil Unearthed in Mexico
Science

Newly Discovered Axolotl Fossil Unearthed in Mexico

Breakthrough Pancreatic Cancer Drug Doubles Survival Rates A Revolutionary Treatment
Science

Breakthrough Pancreatic Cancer Drug Doubles Survival Rates: A Revolutionary Treatment

How Pigeons Use Superparamagnetic Immune Cells in Their Livers to
Science

How Pigeons Use Superparamagnetic Immune Cells in Their Livers to Detect Earth’s Magnetic Field

Leveraging Human Error as a Tactic Against Large Scale Language Models
Science

Leveraging Human Error as a Tactic Against Large-Scale Language Models

Exploring the Real Health Benefits of Turmeric and Curcumin
Science

Exploring the Real Health Benefits of Turmeric and Curcumin

Leave A Reply Cancel Reply

Stay In Touch
  • Facebook
  • Twitter
  • Instagram
  • Pinterest
Quote of the day

A good traveler has no fixed plans, and is not intent on arriving.

Lao Tzu
Exchange Rate

Exchange Rate EUR: Tue, 2 Jun.

Top Insights
Spinal Cord Stimulation Aids In The Recovery Of Stroke Patients Science

Spinal cord stimulation aids in the recovery of stroke patients

How Seasonal Rhythms of the Body Clock Influence Vaccine Effectiveness Science

How Seasonal Rhythms of the Body Clock Influence Vaccine Effectiveness

Thousands Of Devices Are Now Online Following Global Outage, According Technology

Thousands of devices are now online following global outage, according to CrowdStrike; Microsoft IT also affected

Categories
  • Blockchain (65)
  • Science (7,685)
  • Technology (2,968)
Top Posts
UK Government to Renew Dispute with Apple Over Access to

UK Government to Renew Dispute with Apple Over Access to User Data | Data Protection

October 2, 2025
Ai Invents New Battery Design That Decreases Lithium Usage By

AI invents new battery design that decreases lithium usage by 70%

January 9, 2024
Human Level AI is Inevitable Harnessing the Power to Influence the

Human-Level AI is Inevitable: Harnessing the Power to Influence the Journey | Garrison Nice

July 21, 2025

Mondo News is a Professional Technology & Science Blog. Here we will provide you with only exciting content that you will enjoy and find useful. We’re working to turn our passion into a successful website. We hope you enjoy our Content as much as we enjoy offering them to you.

Facebook X (Twitter) Instagram Pinterest
Categories
  • Blockchain (65)
  • Science (7,685)
  • Technology (2,968)
Most Popular
Understanding Sora Ai: A Comprehensive Guide To Openai's Text To Video Tools
Science

Understanding Sora AI: A Comprehensive Guide to OpenAI’s Text-to-Video Tools

CERN Physicists Discover New Exotic Particles Key Breakthrough in Particle
Science

CERN Physicists Discover New Exotic Particles: Key Breakthrough in Particle Physics

SiteLock
© 2026 Mondo News.
  • Home
  • About Us
  • Privacy Policy
  • Terms & Conditions

Type above and press Enter to search. Press Esc to cancel.

We are using cookies to give you the best experience on our website.

You can find out more about which cookies we are using or switch them off in .

Ad Blocker Enabled!
Ad Blocker Enabled!
Our website is made possible by displaying online advertisements to our visitors. Please support us by disabling your Ad Blocker.
Go to mobile version
Powered by  GDPR Cookie Compliance
Privacy Overview

This website uses cookies so that we can provide you with the best user experience possible. Cookie information is stored in your browser and performs functions such as recognising you when you return to our website and helping our team to understand which sections of the website you find most interesting and useful.

Strictly Necessary Cookies

Strictly Necessary Cookie should be enabled at all times so that we can save your preferences for cookie settings.