Sergio Paniego's picture

Building on HF

Sergio Paniego PRO

sergiopaniego

huggingface

·

https://sergiopaniego.github.io/

AI & ML interests

None yet

Recent Activity

updated a dataset 10 minutes ago

huggingface-projects/Deep-RL-Course-Certification

posted an update about 8 hours ago

Google DeepMind releases FunctionGemma, a 240M model specialized in 🔧 tool calling, built for fine-tuning TRL has day-0 support. To celebrate, we’re sharing 2 new resources: > Colab guide to fine-tune it for 🌐 browser control with BrowserGym OpenEnv > Standalone training script > Colab notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_functiongemma_browsergym_openenv.ipynb > Training script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/browsergym_llm.py (command to run it inside the script) > More notebooks in TRL: https://huggingface.co/docs/trl/example_overview#notebooks

updated a dataset about 9 hours ago

agents-course/final-certificates

View all activity

Organizations

Posts 59

Post

96

Google DeepMind releases FunctionGemma, a 240M model specialized in 🔧 tool calling, built for fine-tuning

TRL has day-0 support. To celebrate, we’re sharing 2 new resources:

> Colab guide to fine-tune it for 🌐 browser control with BrowserGym OpenEnv
> Standalone training script

> Colab notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/grpo_functiongemma_browsergym_openenv.ipynb
> Training script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/browsergym_llm.py (command to run it inside the script)
> More notebooks in TRL: https://huggingface.co/docs/trl/example_overview#notebooks

Articles 11

Article

22

Tokenization in Transformers v5: Simpler, Clearer, and More Modular

View all Articles

Collections 7

View 7 collections

spaces 52

VLM Object Understanding

Explore object detection, visual grounding, keypoint Detecti

Qwen2-VL-7B

Ask questions about charts in images

SmolVLM-trl-dpo-rlaif-v

Generate text from an image and question

SmolVLM-trl-sft-ChartQA

Ask questions about charts in images

Browsergym-grpo-Qwen-Qwen3-0.6B-2025-12-18 14-36-57

Display live tracking data

Trl Trackio

Show live tracking data

models 98

sergiopaniego/Qwen2-7B-Instruct-GRPO-merged

Text Generation • 8B • Updated 7 days ago • 37

sergiopaniego/Qwen2-7B-Instruct-GRPO

Updated 7 days ago

sergiopaniego/EssentialAI-rnj-1-instruct-trl-grpo

Updated 10 days ago

sergiopaniego/Ministral-3-3B-Instruct-trl-sft-test

Updated 14 days ago

sergiopaniego/Ministral-3-3B-Instruct-trl-grpo

Updated 15 days ago

sergiopaniego/Ministral-3-3B-Instruct-trl-sft

Updated 15 days ago

sergiopaniego/grpo_biogrid_qwen_3g-1.7b

Text Generation • 2B • Updated 23 days ago • 57

sergiopaniego/wordle-grpo-Qwen3-1.7B

Text Generation • 2B • Updated 28 days ago • 34

sergiopaniego/wordle-grpo-Qwen3-1.7B-Instruct-updated

Text Generation • 2B • Updated 29 days ago • 71

sergiopaniego/wordle-grpo-Qwen2.5-0.5B-Instruct-updated

Updated 30 days ago

datasets 5

sergiopaniego/sample_videos

Viewer • Updated Jun 30 • 2 • 28

sergiopaniego/difficult_prompts

Viewer • Updated Jun 20 • 38 • 26

sergiopaniego/ourworldindata_example

Viewer • Updated Dec 2, 2024 • 13 • 102 • 1

sergiopaniego/faiss_embeddings

Updated Oct 3, 2024 • 30

sergiopaniego/CarlaFollowLanePreviousV

Viewer • Updated Sep 6, 2023 • 59.6k • 40