Tanay's picture

Tanay PRO

Tanaybh

·

tanaybhardwaj

AI & ML interests

Exploring RLHF/RLAIF techniques, LoRA adapters, and dialogue optimization. Building models that better understand and respond to human intent

Organizations

Tanaybh 's models 9

Tanaybh/microllm-v1

Updated Oct 20 • 5

Tanaybh/gpt-rope-swiglu

7.88M • Updated Oct 17 • 87

Tanaybh/nano-gpt-from-scratch

Text Generation • 1.07M • Updated Oct 5 • 21

Tanaybh/gpt2-rlhf-anthropic

Text Generation • 0.1B • Updated Oct 2 • 21

Tanaybh/gpt2-got-therapy

Text Generation • 0.1B • Updated Sep 30 • 12

Tanaybh/bipedal-walker-ppo

Reinforcement Learning • Updated Sep 21 • 14

Tanaybh/lunar-lander-ppo

Reinforcement Learning • Updated Sep 21 • 13

Tanaybh/my-first-lora-trash-model

Updated Sep 3 • 3

Tanaybh/dialogpt-medium-qlora-alpaca

Updated Sep 3 • 2