AI & ML interests

Low-bit Quantization of Large Language Models (LLMs)

Efficient-ML 's collections 2