Turkish-LLM-32B-Instruct-GGUF

GGUF quantizations of Turkish-LLM-32B-Instruct.

Part of the Turkish LLM Family.

Available Quantizations

File Size Use Case
Q4_K_M 19GB Best balance of quality and size. Recommended for most users.
Q5_K_M 22GB Higher quality, slightly larger.
Q8_0 33GB Near-original quality. Requires 40GB+ VRAM.
F16 62GB Full precision. Research use.

Usage with Ollama

# Download and run
ollama run hf.co/ogulcanaydogan/Turkish-LLM-32B-Instruct-GGUF:Q4_K_M

Usage with llama.cpp

./llama-cli -m Turkish-LLM-32B-Instruct-Q4_K_M.gguf -p "Turkiye'nin ekonomik durumu hakkinda bilgi ver." -n 256

Benchmark Results

Benchmark Base Model Turkish-LLM-32B Delta
MMLU-TR 0.6518 0.6564 +0.46
XNLI-TR 0.4578 0.4610 +0.32
XCOPA-TR 0.6800 0.6740 -0.60
Downloads last month
20
GGUF
Model size
33B params
Architecture
qwen2
Hardware compatibility
Log In to add your hardware

4-bit

5-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ogulcanaydogan/Turkish-LLM-32B-Instruct-GGUF

Base model

Qwen/Qwen2.5-32B
Quantized
(3)
this model

Collection including ogulcanaydogan/Turkish-LLM-32B-Instruct-GGUF