Configuration Parsing Warning:In UNKNOWN_FILENAME: "auto_map.AutoTokenizer" must be a string

YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

atrost/nanochat-d12-nested-kl-long-20260413

This repo contains the full nested Nanochat d12 checkpoint at the repo root and extracted prefix submodels in subfolders.

Loading with Transformers

from transformers import AutoModelForCausalLM, AutoTokenizer

repo_id = "atrost/nanochat-d12-nested-kl-long-20260413"
tokenizer = AutoTokenizer.from_pretrained(repo_id, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained(repo_id, trust_remote_code=True, torch_dtype="auto")

small = AutoModelForCausalLM.from_pretrained(
    repo_id,
    subfolder="prefix-1",
    trust_remote_code=True,
    torch_dtype="auto",
)

Available extracted prefixes: prefix-1, prefix-2, prefix-3, prefix-4, prefix-5.

trust_remote_code=True is required because this model uses a custom nested architecture and tokenizer.

Downloads last month
13
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support