FIX transformers compat

#28

by Qubitium - opened Jun 6, 2024

base: refs/heads/main

←

from: refs/pr/28

Discussion Files changed

+10

-10

FIX autogptq compat3ad2fbdd

Qubitium

Jun 6, 2024

We have a pending autogptq PR that will allow gptq quant of gllm. For the augptq PR to work we need this simple method def/typing fix to resolve compat issues with transformers and autogptq.

Ready gptq quants for testing:

https://huggingface.co/LnL-AI/glm-4-9b-gptq-4bit-qubitium-r1
https://huggingface.co/LnL-AI/glm-4-9b-chat-gptq-4bit-qubitium-r1

Qubitium changed pull request title from FIX autogptq compat to FIX transformers compat Nov 10, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Cannot merge

This branch has merge conflicts in the following files:

modeling_chatglm.py

· Sign up or log in to comment