What will be true of the SOTA AI on the FrontierMath benchmark, before 2028?
What will be true of the SOTA AI on the FrontierMath benchmark, before 2028?
Plus
7
Ṁ11582028
85%
Transformer-based architecture
77%
Over 1T parameters
50%
Part of the AlphaProof family of models (AlphaProof N and variations)
37%
Developed by OpenAI
34%
Narrow domain of knowledge. ie Does not know random facts such as when Google was founded, or who won the 1960 presidential election.
31%
Developed by Google Deepmind
31%
Part of the o1 family of models (o1, o2, etc. and variations)
26%
Developed by a non-British and non-American company
20%
Part of the GPT-N family of models (GPT-5, GPT-6, and variations)
10%
Energy-based Model (https://en.wikipedia.org/wiki/Energy-based_model)
9%
Based on Symbolic AI (https://en.wikipedia.org/wiki/Symbolic_artificial_intelligence)
An option resolves YES if it is true about the AI model, or program, known to be State of the Art in terms of the FrontierMath benchmark, at the end of the year 2027. It resolves NO otherwise.
You're welcome to add any interesting facts that might or might not be true about the state of the art in math problems, as defined by achieving the highest score on the FrontierMath benchmarks.
I reserve the right to cancel any option that is too vague, too improbable, etc.
See also:
/Bayesian/what-will-true-of-the-sota-ai-on-th-y0LE5uE9n9
/Bayesian/what-will-true-of-the-sota-ai-on-th-ROldIhZZgt
/Bayesian/what-will-true-of-the-sota-ai-on-th-RQptyR5uO8 (this market)
/Bayesian/will-an-ai-achieve-85-performance-o-hyPtIE98qZ
This question is managed and resolved by Manifold.
Get
1,000and
3.00
What is this?
What is Manifold?
Manifold is the world's largest social prediction market.
Get accurate real-time odds on politics, tech, sports, and more.
Win cash prizes for your predictions on our sweepstakes markets! Always free to play. No purchase necessary.
Are our predictions accurate?
Yes! Manifold is very well calibrated, with forecasts on average within 4 percentage points of the true probability. Our probabilities are created by users buying and selling shares of a market.
In the 2022 US midterm elections, we outperformed all other prediction market platforms and were in line with FiveThirtyEight’s performance. Many people who don't like trading still use Manifold to get reliable news.
How do I win cash prizes?
Manifold offers two market types: play money and sweepstakes.
All questions include a play money market which uses mana
and can't be cashed out.
Selected markets will have a sweepstakes toggle. These require sweepcash
to participate and winners can withdraw sweepcash as a cash prize. You can filter for sweepstakes markets on the browse page.
Redeem your sweepcash won from markets at
1.00 → $1.00, minus a 5% fee.
Learn more.Related questions
What is this?
What is Manifold?
Manifold is the world's largest social prediction market.
Get accurate real-time odds on politics, tech, sports, and more.
Win cash prizes for your predictions on our sweepstakes markets! Always free to play. No purchase necessary.
Are our predictions accurate?
Yes! Manifold is very well calibrated, with forecasts on average within 4 percentage points of the true probability. Our probabilities are created by users buying and selling shares of a market.
In the 2022 US midterm elections, we outperformed all other prediction market platforms and were in line with FiveThirtyEight’s performance. Many people who don't like trading still use Manifold to get reliable news.
How do I win cash prizes?
Manifold offers two market types: play money and sweepstakes.
All questions include a play money market which uses mana
and can't be cashed out.
Selected markets will have a sweepstakes toggle. These require sweepcash
to participate and winners can withdraw sweepcash as a cash prize. You can filter for sweepstakes markets on the browse page.
Redeem your sweepcash won from markets at
1.00 → $1.00, minus a 5% fee.
Learn more.Related questions
Will an AI score over 80% on FrontierMath Benchmark in 2025
32% chance
Will an AI achieve >85% performance on the FrontierMath benchmark before 2028?
81% chance
What will be the best performance on FrontierMath by December 31st 2025?
What will be true of the SOTA AI on the FrontierMath benchmark, before 2026?
What will be true of the SOTA AI on the FrontierMath benchmark, before 2027?
Will an AI achieve >85% performance on the FrontierMath benchmark before 2027?
58% chance
Will an AI achieve >80% performance on the FrontierMath benchmark before 2027?
83% chance
Will any AI model achieve > 40% on Frontier Math before 2026?
89% chance
Will any AI model score >80% on Epoch's Frontier Math Benchmark in 2025?
26% chance
Will Alphaproof achieve >30% performance on the FrontierMath benchmark before 2026?
22% chance