turboderp commited on
Commit
4313a12
·
verified ·
1 Parent(s): e1353e8

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -0
README.md ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ license_link: https://ai.google.dev/gemma/docs/gemma_4_license
4
+ base_model: google/gemma-4-31B-it
5
+ base_model_relation: quantized
6
+ quantized_by: turboderp
7
+ tags:
8
+ - exl3
9
+ ---
10
+
11
+ EXL3 quants of [gemma-4-31B-it](https://huggingface.co/google/gemma-4-31B-it)
12
+
13
+ ⚠️ Requires ExLlamaV3 v0.0.29 (or v0.0.28 `dev` branch)
14
+
15
+ [2.00 bits per weight](https://huggingface.co/turboderp/gemma-4-26B-A4B-exl3/tree/2.00bpw)
16
+ [2.25 bits per weight](https://huggingface.co/turboderp/gemma-4-26B-A4B-exl3/tree/2.25bpw)
17
+ [2.50 bits per weight](https://huggingface.co/turboderp/gemma-4-26B-A4B-exl3/tree/2.50bpw)
18
+ [3.00 bits per weight](https://huggingface.co/turboderp/gemma-4-26B-A4B-exl3/tree/3.00bpw)
19
+ [3.50 bits per weight](https://huggingface.co/turboderp/gemma-4-26B-A4B-exl3/tree/3.50bpw)
20
+ [4.00 bits per weight](https://huggingface.co/turboderp/gemma-4-26B-A4B-exl3/tree/4.00bpw)
21
+ [5.00 bits per weight](https://huggingface.co/turboderp/gemma-4-26B-A4B-exl3/tree/5.00bpw)
22
+ [6.00 bits per weight](https://huggingface.co/turboderp/gemma-4-26B-A4B-exl3/tree/6.00bpw)
23
+
24
+ ![kld](https://cdn-uploads.huggingface.co/production/uploads/6383dc174c48969dcf1b4fce/BmKtZQhNDK5oORg0t-sPC.png)