Before February 2025, will a Gemini model exceed Claude 3.5 Sonnet 10/22's Global Average score on Simple Bench?
Basic
11
Ṁ501
Feb 2
9%
chance

https://simple-bench.com/ Claude 3.5 Sonnet 10/22 achieves 41.4% whereas the best Gemini model scores 27.1%

  • Update 2025-22-01 (PST): - Resolution Date: The market will now be resolved on February 1st, 2025 instead of the previously stated date. (AI summary of creator comment)

Get
Ṁ1,000
and
S3.00
Sort by:
bought Ṁ50 YES

Can we go ahead and resolve this one?

@rogs Won't resolve it until February 1st

bought Ṁ5 NO

Now I'm looking at my comment above and wondering what I was thinking. Did I think this was a post about OpenAI models vs Claude rather than about Gemini vs Claude? Why did I think it was resolvable already?

© Manifold Markets, Inc.Terms + Mana-only TermsPrivacyRules