| Part of a series on |
| Artificial intelligence (AI) |
|---|
FrontierMath is a test bed to benchmark[1] various artificial intelligences in their attempts to solve 14 bespoke[2] heretofore unexamined mathematical problems[3] (none of which are on the scale of the Millennium Problems). It was established by the non-profit research organization Epoch AI in November 2024.[4] The first such open problem—of the “moderately interesting” rank—to be solved was in hypergraph theory: “A Constant-Factor Lower Bound For H (n)” by GPT-5.4.[5]
See also
References
- ^ Glazer, Elliot; Erdil, Ege; Besiroglu, Tamay; Chicharro, Diego; Chen, Evan; Gunning, Alex; Olsson, Caroline Falkman; Denain, Jean-Stanislas; Ho, Anson (2025-12-23), FrontierMath: A Benchmark for Evaluating Advanced Mathematical Reasoning in AI, arXiv, doi:10.48550/arXiv.2411.04872, arXiv:2411.04872, retrieved 2026-05-16
- ^ Team, MindStudio (April 7, 2026). “What Is the Frontier Math Benchmark? Why Open Research Problems Expose True AI Reasoning”. MindStudio.
- ^ “FrontierMath: Open Problems – Unsolved Mathematical Challenges”. Epoch AI.
- ^ “AI Math Benchmarks: AI’s Growing Capabilities – IEEE Spectrum”. spectrum.ieee.org.
- ^ Johnson, Olivia (March 14, 2026). “GPT-5.4 solves its first open math problem from FrontierMath benchmark”. remio.