-
-
-
-
-
-
Inference Providers
Active filters:
RL
nvidia/Nemotron-Cascade-14B-Thinking
Text Generation
•
15B
•
Updated
•
7.52k
•
72
Delta-Vector/Austral-70B-Winton
Text Generation
•
71B
•
Updated
•
40
•
•
6
nvidia/Nemotron-Cascade-8B-Thinking
Text Generation
•
8B
•
Updated
•
1.37k
•
36
nvidia/Nemotron-Cascade-8B
Text Generation
•
8B
•
Updated
•
3.39k
•
60
nvidia/Nemotron-Cascade-8B-Intermediate-ckpts
Text Generation
•
Updated
•
11
stanfordnlp/SteamSHP-flan-t5-xl
Updated
•
6
•
43
stanfordnlp/SteamSHP-flan-t5-large
Updated
•
105
•
33
SultanR/SmolTulu-1.7b-Reinforced
Text Generation
•
2B
•
Updated
•
8
•
5
mradermacher/SmolTulu-1.7b-Reinforced-GGUF
2B
•
Updated
•
48
Daemontatox/Llama3.3-70B-CogniLink
Text Generation
•
71B
•
Updated
•
45
•
•
3
mradermacher/Llama3.3-70B-CogniLink-GGUF
Text Generation
•
71B
•
Updated
•
63
mradermacher/Llama3.3-70B-CogniLink-i1-GGUF
Text Generation
•
71B
•
Updated
•
82
JHuel/Mistral-Nemo-Instruct-2407_DPO_qlora
Reinforcement Learning
•
Updated
JHuel/Mistral-Nemo-Instruct-2407_ORPO
Text Generation
•
Updated
Ihor/Text2Graph-R1-Qwen2.5-0.5b
Text Generation
•
0.5B
•
Updated
•
23
•
24
Reinforcement Learning
•
Updated
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-GGUF
0.5B
•
Updated
•
115
•
1
mradermacher/Text2Graph-R1-Qwen2.5-0.5b-i1-GGUF
0.5B
•
Updated
•
225
•
1
mradermacher/QuadConnect2.5-0.5B-v0.0.3b-GGUF
0.5B
•
Updated
•
49
Text Generation
•
684B
•
Updated
•
50
•
1
mradermacher/QuadConnect2.5-0.5B-v0.0.8b-GGUF
0.5B
•
Updated
•
151
Lyte/QuadConnect2.5-0.5B-v0.0.9b
Text Generation
•
0.5B
•
Updated
•
37
mradermacher/QuadConnect2.5-0.5B-v0.0.9b-GGUF
0.5B
•
Updated
•
52
Lyte/QuadConnect2.5-1.5B-v0.1.0b
Text Generation
•
2B
•
Updated
•
42
•
1
mradermacher/QuadConnect2.5-1.5B-v0.1.0b-GGUF
2B
•
Updated
•
62
•
1
mradermacher/Zireal-0-GGUF
mradermacher/Magellanic-Qwen-25B-R999-GGUF
25B
•
Updated
•
39
•
1
mradermacher/Magellanic-Qwen-25B-R999-i1-GGUF
25B
•
Updated
•
97
•
1
VaidikML0508/Shark-Tank-Offer-Evaluator-llama3.2-3B-Instruct-SFT-DPO-4bits-V1
Text Generation
•
3B
•
Updated
•
1
Teen-Different/squiral_maze
Reinforcement Learning
•
Updated