Char Level Models My Character level models I trained. Corianas/Microllama_Char_88k_step Text Generation • 85.2M • Updated Feb 3, 2025 • 14 Corianas/Corianas-micro-reactor Text Generation • 85.2M • Updated Feb 17, 2025 • 12 Corianas/Microllama_Char_100k_step Text Generation • 85.2M • Updated Feb 3, 2025 • 17 Corianas/Microllama_Char_300k_step Text Generation • 85.2M • Updated Feb 3, 2025 • 15
Foundational_data TinyGSM: achieving >80% on GSM8k with small language models Paper • 2312.09241 • Published Dec 14, 2023 • 40 TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper • 2305.07759 • Published May 12, 2023 • 45
TinyGSM: achieving >80% on GSM8k with small language models Paper • 2312.09241 • Published Dec 14, 2023 • 40
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper • 2305.07759 • Published May 12, 2023 • 45
Char Level Models My Character level models I trained. Corianas/Microllama_Char_88k_step Text Generation • 85.2M • Updated Feb 3, 2025 • 14 Corianas/Corianas-micro-reactor Text Generation • 85.2M • Updated Feb 17, 2025 • 12 Corianas/Microllama_Char_100k_step Text Generation • 85.2M • Updated Feb 3, 2025 • 17 Corianas/Microllama_Char_300k_step Text Generation • 85.2M • Updated Feb 3, 2025 • 15
Foundational_data TinyGSM: achieving >80% on GSM8k with small language models Paper • 2312.09241 • Published Dec 14, 2023 • 40 TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper • 2305.07759 • Published May 12, 2023 • 45
TinyGSM: achieving >80% on GSM8k with small language models Paper • 2312.09241 • Published Dec 14, 2023 • 40
TinyStories: How Small Can Language Models Be and Still Speak Coherent English? Paper • 2305.07759 • Published May 12, 2023 • 45