February1 DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL Date: February 10, 2025 | Estimated Reading Time: 10 min | Author: Michael Luo, Sijun Tan, Tianjun Zhang