Post
96
Google DeepMind releases FunctionGemma, a 240M model specialized in π§ tool calling, built for fine-tuning
TRL has day-0 support. To celebrate, weβre sharing 2 new resources:
> Colab guide to fine-tune it for π browser control with BrowserGym OpenEnv
> Standalone training script
> Colab notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_functiongemma_browsergym_openenv.ipynb
> Training script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/browsergym_llm.py (command to run it inside the script)
> More notebooks in TRL: https://huggingface.co/docs/trl/example_overview#notebooks
TRL has day-0 support. To celebrate, weβre sharing 2 new resources:
> Colab guide to fine-tune it for π browser control with BrowserGym OpenEnv
> Standalone training script
> Colab notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_functiongemma_browsergym_openenv.ipynb
> Training script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/browsergym_llm.py (command to run it inside the script)
> More notebooks in TRL: https://huggingface.co/docs/trl/example_overview#notebooks