DeepSeek-GRM BBQGOD/DeepSeek-GRM-16B Text Generation • 16B • Updated Sep 1, 2025 • 312 • 4 BBQGOD/DeepSeek-GRM-27B Text Generation • 28B • Updated Sep 1, 2025 • 72 • 4 BBQGOD/DeepSeek-GRM-27B-MetaRM Text Classification • 27B • Updated Sep 1, 2025 • 7 • 3 Inference-Time Scaling for Generalist Reward Modeling Paper • 2504.02495 • Published Apr 3, 2025 • 58
Inference-Time Scaling for Generalist Reward Modeling Paper • 2504.02495 • Published Apr 3, 2025 • 58
DeepSeek-GRM BBQGOD/DeepSeek-GRM-16B Text Generation • 16B • Updated Sep 1, 2025 • 312 • 4 BBQGOD/DeepSeek-GRM-27B Text Generation • 28B • Updated Sep 1, 2025 • 72 • 4 BBQGOD/DeepSeek-GRM-27B-MetaRM Text Classification • 27B • Updated Sep 1, 2025 • 7 • 3 Inference-Time Scaling for Generalist Reward Modeling Paper • 2504.02495 • Published Apr 3, 2025 • 58
Inference-Time Scaling for Generalist Reward Modeling Paper • 2504.02495 • Published Apr 3, 2025 • 58