The new o3 model scored 75.7% on this ARC-AGI benchmark, when restricted to less than $10,000 in computing expense, and 87.5% with an unrestricted compute budget. OpenAI’s relatively capable GPT-4o ...
OpenAI is planning to release its first open-weight language model with reasoning capabilities since GPT‑2 in the coming months, CEO Sam Altman said on Monday. An open-weight language model's trained ...