Inference Providers
Active filters: torchao
gurro/llama-3.1-8B-torchao-int4wo-128
Text Generation
• Updated • 5
gurro/llama-3.1-8B-torchao-int4wo-256
Text Generation
• Updated • 3
jerryzh168/llama3-8b-autoquant
Text Generation
• Updated • 7
medmekk/Llama-3.1-8B-Instruct-torchao-int8_weight_only
Updated
medmekk/Llama-3.1-8B-Instruct-torchao-int8wo
medmekk/Llama-3.1-8B-Instruct-torchao-int8da8w
Updated
medmekk/Llama-3.2-3B-Instruct-torchao-int8wo
Updated
medmekk/Llama-3.2-1B-torchao-int8wo
Updated
medmekk/Llama-3.2-1B-torchao-int8da8w
Updated
medmekk/Llama-3.2-3B-Instruct-torchao-int8da8w
Updated
medmekk/Llama-3.1-70B-Instruct-torchao-int8da8w
Updated
jerryzh168/Meta-Llama-3-8B-torchao-int8_weight_only
jerryzh168/Meta-Llama-3-8B-torchao-int4_weight_only-gs_128
Updated
jerryzh168/Meta-Llama-3-8B-torchao-int4_weight_only-gs_64
Updated
HF-Quantization/Llama-3.2-1B-TORCHAO-W8
HF-Quantization/Llama-3.2-1B-TORCHAO-W8A8
HF-Quantization/Llama-3.2-1B-TORCHAO-W4
HF-Quantization/Llama-3.3-70B-Instruct-TORCHAO-W4
jpablomch/Meta-Llama-3-8B-Instruct-torchao
Text Generation
• Updated • 5
jerryzh168/llama3-8b-int4wo-128
Text Generation
• Updated • 5
jerryzh168/llama3-8b-int8wo
Text Generation
• Updated • 8
alpindale/Meta-Llama-3-8B-torchao-int8_weight_only
Text Generation
• Updated • 11
Text Generation
• Updated • 14
drisspg/float8_dynamic_act_float8_weight-opt-125m
Text Generation
• Updated • 8
marksaroufim/Meta-Llama-3-8B-torchao-int8_weight_only
Text Generation
• Updated • 4
Text Generation
• Updated • 4
Image-Text-to-Text
• Updated • 3
jerryzh168/gemma3-4b-it-float8dq
Image-Text-to-Text
• Updated • 2