Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
OpenLearnLM
's Collections
Special-R1
PedagogyRL-Experiments
PedagogyRL-Experiments
updated
19 days ago
Upvote
-
OpenLearnLM/deepseek_qwen3_8b_pedagogical_think_reward_grpo_step_300
8B
•
Updated
Jul 9, 2025
•
15
OpenLearnLM/deepseek_qwen3_8b_pedagogical_think_noreward_grpo_step_300
8B
•
Updated
Jul 9, 2025
•
20
OpenLearnLM/deepseek_qwen3_8b_think_noreward_grpo_step_300
8B
•
Updated
Jul 9, 2025
•
22
OpenLearnLM/deepseek_qwen3_8b_think_reward_grpo_step_300
8B
•
Updated
Jul 9, 2025
•
49
OpenLearnLM/qwen2.5_7b_nothink_noreward_grpo_step_300
8B
•
Updated
19 days ago
•
45
Upvote
-
Share collection
View history
Collection guide
Browse collections