Huihui-Qwen3.6-35B-A3B-abliterated-mlx-4bit

MLX-VLM conversion of huihui-ai/Huihui-Qwen3.6-35B-A3B-abliterated.

Overview

Format: MLX-VLM
Precision: 4bit
Size: 19G
Source model type: Qwen3_5MoeForConditionalGeneration
Source pipeline: image-text-to-text
Intended runtime: mlx-vlm, LM Studio
Quantization result: 4-bit affine

Validation

Local checks on Apple Silicon:

model loading in mlx-vlm: passed
text generation smoke test: passed
image generation path smoke test: passed

Smoke-test logs were produced locally before upload.

Notes

This is an abliterated model conversion. Outputs may be less filtered than standard instruction-tuned models. Use responsibly and comply with applicable laws and platform policies.

Files

Important files in this repo:

config.json
generation_config.json
chat_template.jinja
tokenizer.json
tokenizer_config.json
model.safetensors.index.json
model-*.safetensors

Usage

Text generation

mlx_vlm.generate \
  --model /path/to/Huihui-Qwen3.6-35B-A3B-abliterated-mlx-4bit \
  --prompt "你好" \
  --max-tokens 128 \
  --trust-remote-code \
  --processor-kwargs '{"enable_thinking": false}'

Image prompt

mlx_vlm.generate \
  --model /path/to/Huihui-Qwen3.6-35B-A3B-abliterated-mlx-4bit \
  --image /path/to/example.png \
  --prompt "Describe this image." \
  --max-tokens 128 \
  --trust-remote-code \
  --processor-kwargs '{"enable_thinking": false}'

Downloads last month: 530

Safetensors

Model size

6B params

Tensor type

BF16

U32

MLX

Hardware compatibility

4-bit

Model tree for vanch007/Huihui-Qwen3.6-35B-A3B-abliterated-mlx-4bit

Base model

Qwen/Qwen3.6-35B-A3B

Finetuned

huihui-ai/Huihui-Qwen3.6-35B-A3B-abliterated

Quantized

(15)

this model