Inference Providers
Active filters: vLLM
QuantTrio/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 237
• 9
QuantTrio/GLM-4.5-GPTQ-Int4-Int8Mix
Text Generation
• 55B • Updated • 113
• 5
Text Generation
• 53B • Updated • 27
• 9
QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ-BF16Mix
Text Generation
• 14B • Updated • 46
• 4
QuantTrio/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 201
• 2
QuantTrio/Qwen3-30B-A3B-Thinking-2507-AWQ
Text Generation
• 31B • Updated • 7.83k
• 4
QuantTrio/KAT-V1-40B-GPTQ-Int4-Int8Mix
Text Generation
• 47B • Updated • 5
QuantTrio/Qwen3-Coder-30B-A3B-Instruct-GPTQ-Int8
Text Generation
• 31B • Updated • 1.1k
• 8
QuantTrio/Qwen3-Coder-30B-A3B-Instruct-AWQ
Text Generation
• 31B • Updated • 70.6k
• 6
EliovpAI/Qwen3-14B-FP8-KV
Text Generation
• 15B • Updated • 2
• 2
Image-Text-to-Text
• 17B • Updated • 543
• 19
QuantTrio/Seed-OSS-36B-Instruct-AWQ
Text Generation
• 36B • Updated • 121
• 8
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
• 36B • Updated • 68
• 4
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
• 36B • Updated • 110
• 5
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int3
Text Generation
• 34B • Updated • 5
• 3
amakhov/tiny-random-llama
Text Generation
• 4.18M • Updated • 16
Text Generation
• 41B • Updated • 6
• 2
QuantTrio/DeepSeek-V3.1-AWQ
Text Generation
• 485B • Updated • 319
• 5
QuantTrio/DeepSeek-V3.1-AWQ-Fp16Mix
Text Generation
• 286B • Updated • 10
• 1
QuantTrio/DeepSeek-V3.1-AWQ-Lite
Text Generation
• 684B • Updated • 94
• 3
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int4
Text Generation
• 4B • Updated • 132k
• 2
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int8
Text Generation
• 4B • Updated • 19
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int4
Text Generation
• 4B • Updated • 61
• 1
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int8
Text Generation
• 4B • Updated • 7
• 2
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int4
Text Generation
• 31B • Updated • 33.3k
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
• 31B • Updated • 4
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int4
Text Generation
• 31B • Updated • 42
JunHowie/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
• 8B • Updated • 6
JunHowie/Qwen2-7B-Instruct-GPTQ-Int8
Text Generation
• 8B • Updated • 2
EliovpAI/Deepseek-R1-0528-Qwen3-8B-FP8-KV
Text Generation
• 8B • Updated • 4