Close Menu
News Frame For You — Latest Updates on AI, Sports, Europe, Asia & Business
  • Home
  • AI
  • Asia
  • Business
  • Education
  • Europe
  • Life & Style
  • Sports
  • USA
  • Store

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

What's Hot

John Harbaugh interviews in person for the Giants’ coaching vacancy, AP source says

January 15, 2026

Real Madrid crash out of Copa del Rey at lowly Albacete on Arbeloa debut | Football News

January 15, 2026

Oglala Sioux Tribe president says three tribal citizens transferred to ICE facility

January 15, 2026
Facebook X (Twitter) Instagram
News Frame For You — Latest Updates on AI, Sports, Europe, Asia & Business
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
  • Home
  • AI
  • Asia
  • Business
  • Education
  • Europe
  • Life & Style
  • Sports
  • USA
  • Store
News Frame For You — Latest Updates on AI, Sports, Europe, Asia & Business
Home » AI models are starting to crack high-level math problems 
AI

AI models are starting to crack high-level math problems 

adminBy adminJanuary 14, 2026No Comments4 Mins Read
Share Facebook Twitter Pinterest LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email Copy Link


Over the weekend, Neel Somani, who is a software engineer, former quant researcher, and a startup founder, was testing the math skills of OpenAI’s new model when he made an unexpected discovery. After pasting the problem into ChatGPT and letting it think for 15 minutes, he came back to a full solution. He evaluated the proof and formalized it with a tool called Harmonic — but it all checked out. 

“I was curious to establish a baseline for when LLMs are effectively able to solve open math problems compared to where they struggle,” Somani said. The surprise was that, using the latest model, the frontier started to push forward a bit. 

ChatGPT’s chain of thought is even more impressive, rattling off mathematical axioms like Legendre’s formula, Bertrand’s postulate, and the Star of David theorum. Eventually, the model found a Math Overflow post from 2013, where Harvard mathematician Noam Elkies had given an elegant solution to a similar problem. But ChatGPT’s final proof differed from Elkies’ work in important ways, and gave a more complete solution to a version of the problem posed by legendary mathematician Paul Erdős, whose vast collection of unsolved problems has become a proving ground for AI.

For anyone skeptical of machine intelligence, it’s a surprising result — and it’s not the only one. AI tools have become ubiquitous in mathematics, from formalization-oriented LLMs like Harmonic’s Aristotle to literature review tools like OpenAI’s deep research. But since the release of GPT 5.2 — which Somani describes as “anecdotally more skilled at mathematical reasoning than previous iterations” — the sheer volume of solved problems has become difficult to ignore, raising new questions about large language models’ ability to push the frontiers of human knowledge.  

Somani was looking at the Erdős problems, a set of over 1,000 conjectures by the Hungarian mathematician that are maintained online. The problems have become a tempting target for AI-driven mathematics, varying significantly in both subject matter and difficulty. The first batch of autonomous solutions came in November from a Gemini-powered model called AlphaEvolve — but more recently, Somani and others have found GPT 5.2 to be remarkably adept with high-level math.  

Since Christmas, 15 problems have been moved from “open” to “solved” on the Erdős website — and 11 of the solutions have specifically credited AI models as involved in the process. 

The revered mathematician Terence Tao has a more nuanced look at the progress on his GitHub page, counting eight different problems where AI models made meaningful autonomous progress on an Erdős problem, with six other cases where progress was made by locating and building on previous research. It’s a long way from AI systems being able to do math without human intervention, but it’s clear that there’s an important role for large models to play. 

Techcrunch event

San Francisco
|
October 13-15, 2026

On Mastodon, Tao conjectured that the scalable nature of AI systems makes them “better suited for being systematically applied to the ‘long tail’ of obscure Erdős problems, many of which actually have straightforward solutions.”

“As such, many of these easier Erdős problems are now more likely to be solved by purely AI-based methods than by human or hybrid means,” Tao continued.

Another driving force is a recent shift toward formalization, a labor-intensive task that makes mathematical reasoning easier to verify and extend. Formalization doesn’t require use of AI or even computers, but a new crop of automated tools have made the process far easier. The open source “proof assistant” Lean, which was developed at Microsoft Research in 2013, has become widely used within the field as a way of formalizing proof— and AI tools like Harmonic’s Aristotle promise to automate much of the work of formalization. 

For Harmonic founder Tudor Achim, the sudden jump in solved Erdős problems is less important than the fact that the world’s greatest mathematicians are starting to take those tools seriously. “I care more about the fact that math and computer science professors are using [AI tools],” Achim said. “These people have reputations to protect, so when they’re saying they use Aristotle or they use ChatGPT, that’s real evidence.” 



Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Email
admin
  • Website

Related Posts

Google’s Trends Explore page gets new Gemini capabilities 

January 14, 2026

The multibillion-dollar AI security problem enterprises can’t ignore 

January 14, 2026

AI security firm, depthfirst, announces $40 million Series A

January 14, 2026
Leave A Reply Cancel Reply

Don't Miss
Sports

John Harbaugh interviews in person for the Giants’ coaching vacancy, AP source says

John Harbaugh interviewed in person with the New York Giants for their head coaching vacancy,…

Real Madrid crash out of Copa del Rey at lowly Albacete on Arbeloa debut | Football News

January 15, 2026

Oglala Sioux Tribe president says three tribal citizens transferred to ICE facility

January 15, 2026

New University of Michigan President Kent Syverud inherits a host of challenges

January 14, 2026
Top Posts

Are Iran’s protests different this time around? | Protests News

January 14, 2026

As hate spirals in India, Hindu extremists turn to Christian targets | Politics

January 14, 2026

Bangladesh won’t play T20 World Cup matches in India, BCB reaffirms | Cricket News

January 13, 2026

Trump announces new 25% tariff: How will it impact Iran’s trading partners? | International Trade News

January 13, 2026

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

About Us
About Us

Welcome to News Frame For You — Your Window to the World! 🌍

At News Frame For You, we bring you the latest and most reliable updates from across the globe, focusing on what truly shapes our modern world. From cutting-edge AI innovations to thrilling sports moments, from the heart of Europe’s business scene to the pulse of Asia’s emerging markets, we frame the news that matters to you — clearly, quickly, and intelligently.

Our Picks

John Harbaugh interviews in person for the Giants’ coaching vacancy, AP source says

January 15, 2026

Real Madrid crash out of Copa del Rey at lowly Albacete on Arbeloa debut | Football News

January 15, 2026

Oglala Sioux Tribe president says three tribal citizens transferred to ICE facility

January 15, 2026
Most Popular

Laude Institute announces first batch of ‘Slingshots’ AI grants

November 7, 2025

Sam Altman says OpenAI has $20B ARR and about $1.4 trillion in data center commitments

November 7, 2025

Amazon launches an AI-powered Kindle Translate service for e-book authors

November 7, 2025
  • Home
  • About Us
  • Advertise With Us
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 newsframeforyou. Designed by newsframeforyou.

Type above and press Enter to search. Press Esc to cancel.