Pre-computed Q-Filters for efficient KV cache compression.
Nathan Godey
nthngdy
AI & ML interests
None yet
Recent Activity
updated a model about 10 hours ago
nthngdy/matryoshka-baselines updated a model about 14 hours ago
nthngdy/matryoshka-1B updated a model 6 days ago
nthngdy/matryoshka-3B