Instructions to use alpha-ai/SpeakSpace-Assistant-v1-3B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.

Libraries

How to use alpha-ai/SpeakSpace-Assistant-v1-3B with llama-cpp-python:

# !pip install llama-cpp-python

from llama_cpp import Llama

llm = Llama.from_pretrained(
	repo_id="alpha-ai/SpeakSpace-Assistant-v1-3B",
	filename="unsloth.Q4_K_M.gguf",
)

llm.create_chat_completion(
	messages = "\"The answer to the universe is 42\""
)

Notebooks
Google Colab
Kaggle
Local Apps

llama.cpp

How to use alpha-ai/SpeakSpace-Assistant-v1-3B with llama.cpp:

Install from brew

brew install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M

Install from WinGet (Windows)

winget install llama.cpp
# Start a local OpenAI-compatible server with a web UI:
llama-server -hf alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M
# Run inference directly in the terminal:
llama-cli -hf alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M

Use pre-built binary

# Download pre-built binary from:
# https://github.com/ggerganov/llama.cpp/releases
# Start a local OpenAI-compatible server with a web UI:
./llama-server -hf alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M
# Run inference directly in the terminal:
./llama-cli -hf alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M

Build from source code

git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
cmake -B build
cmake --build build -j --target llama-server llama-cli
# Start a local OpenAI-compatible server with a web UI:
./build/bin/llama-server -hf alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M
# Run inference directly in the terminal:
./build/bin/llama-cli -hf alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M

Use Docker

docker model run hf.co/alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M

LM Studio
Jan
Ollama
How to use alpha-ai/SpeakSpace-Assistant-v1-3B with Ollama:
```
ollama run hf.co/alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M
```

Unsloth Studio new

How to use alpha-ai/SpeakSpace-Assistant-v1-3B with Unsloth Studio:

Install Unsloth Studio (macOS, Linux, WSL)

curl -fsSL https://unsloth.ai/install.sh | sh
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for alpha-ai/SpeakSpace-Assistant-v1-3B to start chatting

Install Unsloth Studio (Windows)

irm https://unsloth.ai/install.ps1 | iex
# Run unsloth studio
unsloth studio -H 0.0.0.0 -p 8888
# Then open http://localhost:8888 in your browser
# Search for alpha-ai/SpeakSpace-Assistant-v1-3B to start chatting

Using HuggingFace Spaces for Unsloth

# No setup required
# Open https://huggingface.co/spaces/unsloth/studio in your browser
# Search for alpha-ai/SpeakSpace-Assistant-v1-3B to start chatting

Pi new

How to use alpha-ai/SpeakSpace-Assistant-v1-3B with Pi:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M

Configure the model in Pi

# Install Pi:
npm install -g @mariozechner/pi-coding-agent
# Add to ~/.pi/agent/models.json:
{
  "providers": {
    "llama-cpp": {
      "baseUrl": "http://localhost:8080/v1",
      "api": "openai-completions",
      "apiKey": "none",
      "models": [
        {
          "id": "alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M"
        }
      ]
    }
  }
}

Run Pi

# Start Pi in your project directory:
pi

Hermes Agent new

How to use alpha-ai/SpeakSpace-Assistant-v1-3B with Hermes Agent:

Start the llama.cpp server

# Install llama.cpp:
brew install llama.cpp
# Start a local OpenAI-compatible server:
llama-server -hf alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M

Configure Hermes

# Install Hermes:
curl -fsSL https://hermes-agent.nousresearch.com/install.sh | bash
hermes setup
# Point Hermes at the local server:
hermes config set model.provider custom
hermes config set model.base_url http://127.0.0.1:8080/v1
hermes config set model.default alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M

Run Hermes

hermes

Docker Model Runner
How to use alpha-ai/SpeakSpace-Assistant-v1-3B with Docker Model Runner:
```
docker model run hf.co/alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M
```

Lemonade

How to use alpha-ai/SpeakSpace-Assistant-v1-3B with Lemonade:

Pull the model

# Download Lemonade from https://lemonade-server.ai/
lemonade pull alpha-ai/SpeakSpace-Assistant-v1-3B:Q4_K_M

Run and chat with the model

lemonade run user.SpeakSpace-Assistant-v1-3B-Q4_K_M

List all available models

lemonade list

SpeakSpace-Assistant-v1-3B

Alpha AI (www.alphaai.biz) fine-tuned canopylabs/orpheus-3b-0.1-ft to create SpeakSpace-Assistant-v1-3B — an English-only, single-speaker voice assistant model. The fine-tune uses custom voice recordings plus the Elise dataset (~3 hours, single-speaker English speech). Transcripts were augmented with emotion/expression tags like <sigh> and <laughs>, added as special tokens in the Orpheus tokenizer.

⚠️ Important: This model is intended for research, prototyping, and internal product demos. Do not use it to impersonate a real person without explicit consent. Review base-model and dataset licenses before commercial use.

TL;DR

Base: canopylabs/orpheus-3b-0.1-ft (~3B params).
Data: Custom Alpha AI dataset + MrDragonFox/Elise (English, ~3 hours).
Objective: Produce natural, expressive speech with inline emotion cues (<laughs>, <sigh>).
Language: English only.
Repo: Suggested as alpha-ai/SpeakSpace-Assistant-v1-3B.

Intended Use & Limitations

Intended use:

Internal voice assistants and demos.
Research on expressive TTS and emotion-tag-conditioned speech.
Applications where transcripts include small expressive markers.

Limitations:

Not multi-speaker or multilingual.
Quality limited by dataset size (~3 hrs + custom data).
Requires Orpheus vocoder/decoder to convert tokens to waveform.
Do not deploy for impersonation without explicit consent.

Model Details

Family: Orpheus 3B (decoder-based speech model).
Tokenizer: Extended with special tokens (<laughs>, <sigh>).
Fine-tuning: Supervised finetuning on audio–transcript pairs.
Output: Discrete audio tokens; decode with Orpheus vocoder.

Data

Sources:

Alpha AI custom speech dataset.
MrDragonFox/Elise (~3 hrs English single-speaker).

Preprocessing:

Aligned utterances with transcripts.
Expression tags inserted inline.
Special tokens added to tokenizer.

Prompt & Input Format

Model accepts text input with optional inline expressions:

Hello! <laughs> I can help with your schedule today.

Workflow: tokenize → generate audio tokens → decode via vocoder.

Training Summary

Objective: Predict audio tokens from transcripts (with expression markers).
Loss: Causal LM loss.
Optimizer: AdamW or AdamW-8bit (please add exact values).
Hyperparameters: Learning rate, batch size, gradient accumulation, seed — to be filled with actual values.

Evaluation

Recommended:

MOS (Mean Opinion Score): naturalness & expressiveness.
Speaker similarity: ABX or MOS vs. ground truth.
Intelligibility: WER via ASR.
Emotion accuracy: Human rating of <laughs>, <sigh> cues.

Add quantitative results when available.

Safety & Responsible Use

Use only with documented consent for training voices.
Guard against impersonation risks.
Consider watermarking or metadata tagging for provenance.
Do not generalize beyond training speaker’s identity.

License & Attribution

Base model: canopylabs/orpheus-3b-0.1-ft (review base license).
Dataset: MrDragonFox/Elise (check dataset license).
Fine-tune: Ensure compatibility of licenses.

Suggested citation:

SpeakSpace-Assistant-v1-3B — fine-tune of canopylabs/orpheus-3b-0.1-ft on Alpha AI custom dataset + MrDragonFox/Elise.

Acknowledgements

canopylabs — Orpheus base model.
MrDragonFox — Elise dataset.
Alpha AI research & engineering team.

Contact

Questions, issues, or collaborations:

Open a discussion on the Hugging Face repo.
Enterprise contact (Alpha AI): www.alphaai.biz | corporate@alphaai.biz
Enterprise contact (SpeakSpace): www.speakspace.co | connect@speakspace.co

Downloads last month: 32

Safetensors

Model size

3B params

Tensor type

F16

Model tree for alpha-ai/SpeakSpace-Assistant-v1-3B

Base model

meta-llama/Llama-3.2-3B-Instruct

Finetuned

canopylabs/orpheus-3b-0.1-pretrained

Finetuned

canopylabs/orpheus-3b-0.1-ft

Quantized

(30)

this model