Will an LLM break 1400 ELO on LMSys before February?
๐Ÿ’Ž
Premium
49
แน€120k
Feb 2
3%
chance

Google currently leads with Gemini -- which has two models at around 1370

But OpenAI just announced O3 -- which is getting great marks on things like hard science questions.
https://deepnewz.com/ai-modeling/openai-unveils-o3-o3-mini-models-exceeding-human-performance-on-arc-agi-4f05e4f7

The resolution is simple. Will and LMSys update contain a model with 1400 ELO? Cutoff is last day in January (East Coast time).

  • Update 2025-26-01 (PST): - Resolution Criteria Update:

    • The resolution will be based on the information available on the website on February 1st. (AI summary of creator comment)

Get
แน€1,000
and
S3.00
Sort by:
bought แน€3,250 NO

@Moscow25 is it per status on the last of January, or will you include whatever's the first update in February? I think you usually go by the latter on your markets (am I conflating creators, perhaps?), but unless it's in description I imagine it should be by the former.

@HenriThunberg whatever is on the website on Feb 1st

bought แน€333 YES

Let's see if DeepSeek R1 makes a dent!

We are running low on time. But the models are pretty good. 1374 ELO

Big question is will there be a model launch in time...

@ChinmayTheMathGuy cool -- gave you some liquidity for that market -- needs more

I would recommend rewording the title to before february

bought แน€1,000 NO

also @Moscow25 wanna bet more at 55%? Got a limit up for an hour or so

ยฉ Manifold Markets, Inc.โ€ขTerms + Mana-only Termsโ€ขPrivacyโ€ขRules