Inference Providers
Active filters: int4
shieldstar/Qwen3.5-122B-A10B-int4-AutoRound-EC
Image-Text-to-Text
• 21B • Updated • 180k
• 1
Advantech-EIOT/intel_llama-2-chat-7b
Text Generation
• Updated • 10
RedHatAI/zephyr-7b-beta-marlin
Text Generation
• 1B • Updated • 33
RedHatAI/TinyLlama-1.1B-Chat-v1.0-marlin
Text Generation
• 0.3B • Updated • 761
• 2
RedHatAI/OpenHermes-2.5-Mistral-7B-marlin
Text Generation
• 1B • Updated • 15
• 2
RedHatAI/Nous-Hermes-2-Yi-34B-marlin
Text Generation
• 5B • Updated • 17
• 5
ecastera/ecastera-eva-westlake-7b-spanish-int4-gguf
7B • Updated • 11
• 2
softmax/Llama-2-70b-chat-hf-marlin
Text Generation
• 10B • Updated • 4
softmax/falcon-180B-chat-marlin
Text Generation
• 26B • Updated • 14
study-hjt/Meta-Llama-3-8B-Instruct-GPTQ-Int4
Text Generation
• 8B • Updated • 22
study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int4
Text Generation
• 71B • Updated • 8
• 6
study-hjt/Meta-Llama-3-70B-Instruct-AWQ
Text Generation
• 71B • Updated • 6
study-hjt/Qwen1.5-110B-Chat-GPTQ-Int4
Text Generation
• 111B • Updated • 9
• 2
study-hjt/CodeQwen1.5-7B-Chat-GPTQ-Int4
Text Generation
• 7B • Updated • 17
study-hjt/Qwen1.5-110B-Chat-AWQ
Text Generation
• 111B • Updated • 4
modelscope/Yi-1.5-34B-Chat-AWQ
Text Generation
• 34B • Updated • 554
• 2
modelscope/Yi-1.5-6B-Chat-GPTQ
Text Generation
• 6B • Updated • 3
modelscope/Yi-1.5-6B-Chat-AWQ
Text Generation
• 6B • Updated • 7
modelscope/Yi-1.5-9B-Chat-GPTQ
Text Generation
• 9B • Updated • 7
• 1
modelscope/Yi-1.5-9B-Chat-AWQ
Text Generation
• 9B • Updated • 10
modelscope/Yi-1.5-34B-Chat-GPTQ
Text Generation
• 34B • Updated • 3
• 1
jojo1899/Phi-3-mini-128k-instruct-ov-int4
Text Generation
• Updated • 5
jojo1899/Llama-2-13b-chat-hf-ov-int4
Text Generation
• Updated • 29
jojo1899/Mistral-7B-Instruct-v0.2-ov-int4
Text Generation
• Updated • 4
model-scope/glm-4-9b-chat-GPTQ-Int4
Text Generation
• 9B • Updated • 144
• 6
ModelCloud/Mistral-Nemo-Instruct-2407-gptq-4bit
Text Generation
• 12B • Updated • 173
• 5
ModelCloud/Meta-Llama-3.1-8B-Instruct-gptq-4bit
Text Generation
• 8B • Updated • 306
• 4
ModelCloud/Meta-Llama-3.1-8B-gptq-4bit
Text Generation
• 8B • Updated • 140
ModelCloud/Meta-Llama-3.1-70B-Instruct-gptq-4bit
Text Generation
• 71B • Updated • 124
• 4
ModelCloud/Mistral-Large-Instruct-2407-gptq-4bit
Text Generation
• 123B • Updated • 7
• 1