alcompa

alcompa

AI & ML interests

None yet

Recent Activity

updated a dataset about 1 month ago

aimagelab/CHAIR-DPO_preference_datasets

updated a collection about 2 months ago

CHAIR-DPO

updated a collection about 2 months ago

CHAIR-DPO

View all activity

Organizations

upvoted an article 3 months ago

Article

There is no such thing as a tokenizer-free lunch

Sep 25, 2025

•

upvoted an article 5 months ago

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Aug 8, 2025

•

upvoted an article 7 months ago

Article

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

Jun 3, 2025

•

upvoted 2 articles 8 months ago

Article

The N Implementation Details of RLHF with PPO

Oct 24, 2023

•

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

269

upvoted an article 11 months ago

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

•

280

upvoted an article about 1 year ago

Article

Decoding Strategies in Large Language Models

Oct 29, 2024

•

102

alcompa

AI & ML interests

Recent Activity

Organizations

alcompa's activity

There is no such thing as a tokenizer-free lunch

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL

The N Implementation Details of RLHF with PPO

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

How to generate text: using different decoding methods for language generation with Transformers

Decoding Strategies in Large Language Models