RenCoder Series
Collection
All RenCoder Series models RL with Coding Tasks • 4 items • Updated • 1
This model is a SFT + RLVR (DPO+GRPO) version of mistralai/Devstral-Small-2507 on muliple agentic coding datasets (SWE-Bench, NVIDIA Terminal Corpus etc).
"Obsessed with building Open Source AGI, So am I ! Let's create together 🚀 https://www.linkedin.com/in/pankajam"
This model inherits the Apache 2.0 license from the base Devstral-Small-2507 model.
Base model
mistralai/Mistral-Small-3.1-24B-Base-2503