Llama-3.3-8B-Nymphaea-RP

A fine-tune of Llama 3.3 8B Instruct for roleplay and creative writing.

I've trained this mostly for merging with Llama 3.1/3.3 8B fine-tunes.

The SillyTavern preset is available here. For custom presets, please use the Llama 3 instruct template.

GGUF

Here is my custom mixed-quant GGUF, optimized for 8/12GB VRAM.

GGUF recipe
llama-quantize \
  --imatrix imatrix.gguf \
  --token-embedding-type q8_0 \
  --output-tensor-type q8_0 \
  --tensor-type ".*attn_q.weight=q8_0" \
  --tensor-type ".*attn_k.weight=q8_0" \
  --tensor-type ".*attn_v.weight=q5_k" \
  --tensor-type ".*attn_output.weight=q5_k" \
  --tensor-type ".*ffn_up.weight=iq4_nl" \
  --tensor-type ".*ffn_gate.weight=iq4_nl" \
  --tensor-type ".*ffn_down.weight=q5_k" \
  Llama-3.3-8B-Nymphaea-RP.F16.gguf \
  Llama-3.3-8B-Nymphaea-RP.Q5_Mix.gguf \
  q5_k

Imatrix file for making your own quants is available here. I used this calibration dataset to create it, expanding it with RP and creative writing data (about 400k tokens).

Training Notes

Trained on the latest iteration of my Darkmere dataset. This version features expanded genre variety, built upon a mix of manually curated synthetics and human-written stories.

The base weights are abliterated via Heretic prior to fine-tuning, so this fine-tune is quite uncensored.

Training Specs

Method:

  • Training Method: DoRA (Weight-Decomposed LoRA)
  • Target Modules all-linear
  • LoRA Rank: 64
  • LoRA Alpha: 64
  • LoRA Dropout: 0.05

Hyperparameters:

  • Batch Size: 2 (Per-device)
  • Gradient Accumulation: 2
  • Epochs: 2
  • Learning Rate: 1e-4
  • Optimizer: adamw_torch_fused
  • LR Scheduler: cosine
  • Noise Level: neftune_noise_alpha=5

Special Thanks

This fine-tune wouldn't be possible without the incredible work of the community:

Downloads last month
951
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for 0xA50C1A1/Llama-3.3-8B-Nymphaea-RP

Collection including 0xA50C1A1/Llama-3.3-8B-Nymphaea-RP