Inference Providers
Active filters: RL
nvidia/Nemotron-Cascade-2-30B-A3B
Text Generation
• 32B • Updated • 252k
• 486
NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated • 14
• 4
NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated • 11
• 7
NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos
Reinforcement Learning
• 8B • Updated • 9
• 9
NousResearch/DeepHermes-ToolCalling-Specialist-Atropos
Reinforcement Learning
• 8B • Updated • 53
• 17
NousResearch/DeepHermes-Financial-Fundamentals-Prediction-Specialist-Atropos
Text Generation
• 8B • Updated • 21
• 16
NousResearch/DeepHermes-Egregore-v2-RLAIF-8b-Atropos-GGUF
Reinforcement Learning
• 8B • Updated • 25
• 3
NousResearch/DeepHermes-Egregore-v1-RLAIF-8b-Atropos-GGUF
Reinforcement Learning
• 8B • Updated • 51
• 5
NousResearch/DeepHermes-AscensionMaze-RLAIF-8b-Atropos-GGUF
Reinforcement Learning
• 127k • Updated • 65
• 8
nvidia/Nemotron-Cascade-14B-Thinking
Text Generation
• Updated • 4.86k
• 79
nvidia/Nemotron-Cascade-8B
Text Generation
• Updated • 1.71k
• 67
mlx-community/Nemotron-Cascade-2-30B-A3B-4bit
Text Generation
• 32B • Updated • 3.12k
• 18
cyankiwi/Nemotron-Cascade-2-30B-A3B-AWQ-8bit
Text Generation
• 10B • Updated • 199
• 1
stanfordnlp/SteamSHP-flan-t5-xl
Updated • 16
• 43
stanfordnlp/SteamSHP-flan-t5-large
Updated • 41
• 33
SultanR/SmolTulu-1.7b-Reinforced
Text Generation
• 2B • Updated • 12
• 5
mradermacher/SmolTulu-1.7b-Reinforced-GGUF
2B • Updated • 45
Daemontatox/Llama3.3-70B-CogniLink
Text Generation
• 71B • Updated • 80
• • 3
mradermacher/Llama3.3-70B-CogniLink-GGUF
Text Generation
• 71B • Updated • 194
mradermacher/Llama3.3-70B-CogniLink-i1-GGUF
Text Generation
• 71B • Updated • 288
JHuel/Mistral-Nemo-Instruct-2407_DPO_qlora
Reinforcement Learning
• Updated JHuel/Mistral-Nemo-Instruct-2407_ORPO
Text Generation
• Updated Ihor/Text2Graph-R1-Qwen2.5-0.5b
Text Generation
• 0.5B • Updated • 91
• 24
Reinforcement Learning
• Updated mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF
0.5B • Updated • 233
• 1
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
0.5B • Updated • 233
• 1
mradermacher/QuadConnect2.5-0.5B-v0.0.3b-GGUF
0.5B • Updated • 110
Text Generation
• 684B • Updated • 155
• 1
mradermacher/QuadConnect2.5-0.5B-v0.0.8b-GGUF
0.5B • Updated • 59
Lyte/QuadConnect2.5-0.5B-v0.0.9b
Text Generation
• 0.5B • Updated • 22