Inference Providers
Active filters: quant
AngelSlim/Hy-MT1.5-1.8B-1.25bit
Translation
• Updated • 16.3k
• 88
tencent/Hy-MT1.5-1.8B-2bit
Translation
• 2B • Updated • 172
• 29
AngelSlim/Hy-MT1.5-1.8B-1.25bit-GGUF
Translation
• 2B • Updated • 2.74k
• 21
tencent/Hy-MT1.5-1.8B-2bit-GGUF
Translation
• 2B • Updated • 3.46k
• 19
tencent/Hy-MT1.5-1.8B-1.25bit
Translation
• Updated • 166
• 22
tencent/Hy-MT1.5-1.8B-1.25bit-GGUF
Translation
• 2B • Updated • 2.18k
• 12
AngelSlim/Hy-MT1.5-1.8B-2bit-GGUF
Translation
• 2B • Updated • 2.24k
• 11
AngelSlim/Hy-MT1.5-1.8B-2bit
Translation
• 2B • Updated • 464
• 7
2B • Updated • 27
• 58
AngelSlim/HY-1.8B-2Bit-GGUF
2B • Updated • 726
• 42
eaddario/Qwen3.6-27B-GGUF
Image-Text-to-Text
• 27B • Updated • 848
• 2
jackcloudman/Qwen3-Next-80B-A3B-Thinking-GGUF
Text Generation
• 80B • Updated • 82
• 2
Text-to-Image
• Updated • 16.2k
• 47
tsqn/Z-Image-Turbo_fp8_comfyui
Text-to-Image
• Updated • 656
• 5
eaddario/Qwen3.6-35B-A3B-GGUF
Image-Text-to-Text
• 35B • Updated • 845
• 1
digitous/13B-HyperMantis_GPTQ_4bit-128g
Text Generation
• Updated • 22
• 12
pszemraj/nougat-small-onnx-quant_avx2
Image-Text-to-Text
• Updated • 3
pszemraj/nougat-base-onnx-quant_avx2
Image-Text-to-Text
• Updated • 5
fhai50032/RolePlayLake-7B-GGUF
7B • Updated • 15
• 3
oldbridge/latxa-7b-instruct-q8
Text Generation
• 7B • Updated • 7
pszemraj/nougat-small-onnx-quant_avx512_vnni
Image-Text-to-Text
• Updated • 3
RDson/Llama-3-Magenta-Instruct-4x8B-MoE-GGUF
25B • Updated • 136
• 1
TroyDoesAI/Codestral-21B-Pruned
Text Generation
• 21B • Updated • 12
• 2
mradermacher/Codestral-21B-Pruned-GGUF
21B • Updated • 319
mradermacher/Codestral-21B-Pruned-i1-GGUF
21B • Updated • 560
pszemraj/candle-flanUL2-quantized
Text Generation
• 19B • Updated • 24
byroneverson/gemma-2-27b-it-abliterated-gguf
Text Generation
• 27B • Updated • 116
• 12
QuantFactory/gemma-2-27b-it-abliterated-GGUF
Text Generation
• 27B • Updated • 593
• 7