GPT-5.2 is looking like another leap forward
Leaked internal benchmarks for GPT-5.2 “Thinking” have been posted by Sam Altman, and quite frankly, the numbers are ridiculous. We aren’t talking about incremental gains here. For some reference: AIME 2025: 100.0%. It solved it. This is a big math test and it means that competition math is effectively “completed” for this model. ARC-AGI-2: This […]















