Welcome to the Agentica Project! 👋

We are an open-source initiative to democratize reinforcement learning (RL) techniques and develop scalable systems for large language models (LLMs) and agents.

DeepScaleR: Surpassing O1-Preview with a 1.5B Model by Scaling RL

DeepScaleR is an open-source effort to fully democratize reinforcement learning (RL) for LLMs and reproduce DeepSeek R1 and OpenAI O1/O3 at scale on real tasks. We introduce DeepScaleR-1.5B-Preview, a language model finetuned from Deepseek-R1-Distilled-Qwen-1.5B using distributed reinforcement learning (RL). It achieves an impressive 43.1% Pass@1 accuracy on AIME 2024 surpassing the performance of OpenAI's o1-preview with just 1.5B parameters...

Core Contributors

Contributors

Advisors