Instructions to use redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax with Transformers:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("text-generation", model="redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax")
messages = [
    {"role": "user", "content": "Who are you?"},
]
pipe(messages)

# Load model directly
from transformers import AutoTokenizer, AutoModelForCausalLM

tokenizer = AutoTokenizer.from_pretrained("redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax")
model = AutoModelForCausalLM.from_pretrained("redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax")
messages = [
    {"role": "user", "content": "Who are you?"},
]
inputs = tokenizer.apply_chat_template(
	messages,
	add_generation_prompt=True,
	tokenize=True,
	return_dict=True,
	return_tensors="pt",
).to(model.device)

outputs = model.generate(**inputs, max_new_tokens=40)
print(tokenizer.decode(outputs[0][inputs["input_ids"].shape[-1]:]))

Notebooks
Google Colab
Kaggle
Local Apps

vLLM

How to use redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax with vLLM:

Install from pip and serve model

# Install vLLM from pip:
pip install vllm
# Start the vLLM server:
vllm serve "redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax"
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:8000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker

docker model run hf.co/redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax

SGLang

How to use redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax with SGLang:

Install from pip and serve model

# Install SGLang from pip:
pip install sglang
# Start the SGLang server:
python3 -m sglang.launch_server \
    --model-path "redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax" \
    --host 0.0.0.0 \
    --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Use Docker images

docker run --gpus all \
    --shm-size 32g \
    -p 30000:30000 \
    -v ~/.cache/huggingface:/root/.cache/huggingface \
    --env "HF_TOKEN=<secret>" \
    --ipc=host \
    lmsysorg/sglang:latest \
    python3 -m sglang.launch_server \
        --model-path "redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax" \
        --host 0.0.0.0 \
        --port 30000
# Call the server using curl (OpenAI-compatible API):
curl -X POST "http://localhost:30000/v1/chat/completions" \
	-H "Content-Type: application/json" \
	--data '{
		"model": "redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax",
		"messages": [
			{
				"role": "user",
				"content": "What is the capital of France?"
			}
		]
	}'

Docker Model Runner
How to use redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax with Docker Model Runner:
```
docker model run hf.co/redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax
```

Token leak

by deleted - opened Dec 11, 2024

Discussion

deleted

Dec 11, 2024

•

edited Dec 11, 2024

Hello! Was about to test this model, but upon generating a first message, I'm instantly met with it ending with "user" 😅

I regenerated, now it's either "assistant" or my "name:", like it has no stop string.

redrix

Owner Dec 12, 2024

Thanks for letting me know! I made the error of specifiying Mag-Mell as the tokenizer source, instead of creating a union. I had trouble getting the tokenizer working so I messed with the merge config a lot.
I'll remerge this model and redrix/matricide-12B-Unslop-Unleashed to use a union tokenizer and then reupload them.
I apologize for the mistake, I'm quite busy right now and just doing this on the side. I also recently switched my computer's operating system (windows -> fedora linux) and am working on a lot of assignments for school, so something was bound to slip through the cracks.
If you use ST, you can specify custom stopping strings to band-aid fix this. I've no access to my PC at the moment so I can share those later. Something like ["\n{{user}}:", "\nAssistant:"] might help.

redrix

Owner Dec 14, 2024

Update: redrix/nepoticide-12B-Unslop-Unleashed-Mell-RPMax-v2 released!

redrix changed discussion status to closed Dec 14, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment