In recent months, the release of Open AI's O1 preview has sparked increased debate around the term "reasoning" as applied to LLMs. This talk will illustrate the monumental strides that have been made in mathematical reasoning by AI at a technical level in the past year.
In a paper published in January in Nature, DeepMind introduce AlphaGeometry, an AI system that solves complex geometry problems at a level approaching a human Olympiad gold-medalist.
In July, DeepMind announced a generalised method for solving maths Olympiad problems called AlphaProof, including an improved AlphaGeometry 2.
The talk will discuss the methodology of the January paper in detail, with the goal of understanding how AlphaGeometry could reach the ability of the world's best Olympiad contestants without human demonstration.
Finally we will discuss the implications for the future of AI reasoning in more general domains, both within and beyond mathematics.
Useful link:
AlphaGeometry: https://www.nature.com/articles/s41586-023-06747-5