Resolves yes if Google states that the model now available as gemini-exp-1206 is an early checkpoint of Gemini 2 Pro.
Resolves when the answer is clear from the paper, model card, or communications of Google DeepMind.
Resolves NO when it's stated that Gemini Experimental 1206 is an early checkpoint of Gemini 2 Ultra, Flash, or any other model.
If they don't say anything about these experimental models, resolves N/A.
If there is no Gemini 2 Pro model, resolves N/A.
🏅 Top traders
# | Name | Total profit |
---|---|---|
1 | Ṁ643 | |
2 | Ṁ427 | |
3 | Ṁ169 | |
4 | Ṁ77 | |
5 | Ṁ62 |
Google has called the previously experimental model 1206 „Gemini 2.0 Pro Experimental“ in AI Studio. It is the model in the question. Does anyone disagree with me resolving the market as a yes?

https://x.com/AndrewCurran_/status/1869069344296346017
This moved my estimates up significantly; why would Sundar himself tweet this if it weren't a model different from Flash?
Because gemini-exp-1206 scores higher than gemini-2.0-flash-exp on lmarena, I don't feel like I have enough information to resolve this market yet.
Yah, if anything it is evidence that 1121 was an early version of 2.0 flash (it's at most a few ELO alead of 1121) and 1206 at around 30 ELO higher is something else (pro?)