AIME 2024
Dataset.
Appearances in this wiki
- DeepSeek-R1 Incentivizing Reasoning Capability in LLMs via Reinforcement Learing — A high-level mathematics competition benchmark used to evaluate and report the advanced reasoning capabilities of DeepSeek-R1.