Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
4.6
TFLOPS
Tanay
PRO
Tanaybh
Follow
yunihg's profile picture
LinguaGlory's profile picture
John6666's profile picture
10 followers
·
5 following
tanaybhardwaj
AI & ML interests
Exploring RLHF/RLAIF techniques, LoRA adapters, and dialogue optimization. Building models that better understand and respond to human intent
Organizations
Tanaybh
's models
9
Sort: Recently updated
Tanaybh/microllm-v1
Updated
Oct 20
•
5
Tanaybh/gpt-rope-swiglu
7.88M
•
Updated
Oct 17
•
87
Tanaybh/nano-gpt-from-scratch
Text Generation
•
1.07M
•
Updated
Oct 5
•
21
Tanaybh/gpt2-rlhf-anthropic
Text Generation
•
0.1B
•
Updated
Oct 2
•
21
Tanaybh/gpt2-got-therapy
Text Generation
•
0.1B
•
Updated
Sep 30
•
12
Tanaybh/bipedal-walker-ppo
Reinforcement Learning
•
Updated
Sep 21
•
14
Tanaybh/lunar-lander-ppo
Reinforcement Learning
•
Updated
Sep 21
•
13
Tanaybh/my-first-lora-trash-model
Updated
Sep 3
•
3
Tanaybh/dialogpt-medium-qlora-alpaca
Updated
Sep 3
•
2