Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
12
4
2
Jay Gala
jaygala24
Follow
himanshubeniwal's profile picture
invincible-jha's profile picture
kurianbenoy's profile picture
4 followers
·
4 following
https://jaygala24.github.io/
jaygala24
jaygala24
AI & ML interests
Machine Learning, Natural Language Processing, Language and Vision Intersection, Fairness and Biases
Recent Activity
updated
a dataset
9 days ago
jaygala24/reasoning-geometry
published
a dataset
11 days ago
jaygala24/reasoning-geometry
updated
a collection
11 days ago
RL post-training
View all activity
Organizations
jaygala24
's models
25
Sort: Recently updated
jaygala24/Qwen3-4B-DAPO-math-reasoning
Text Generation
•
4B
•
Updated
11 days ago
•
683
jaygala24/Qwen3-4B-RLOO-math-reasoning
Text Generation
•
4B
•
Updated
14 days ago
•
428
jaygala24/Qwen3-1.7B-RLOO-math-reasoning
Text Generation
•
2B
•
Updated
15 days ago
•
953
jaygala24/Qwen2.5-3B-RLOO-math-reasoning
Text Generation
•
3B
•
Updated
15 days ago
•
882
jaygala24/Qwen2.5-1.5B-RLOO-math-reasoning
Text Generation
•
2B
•
Updated
15 days ago
•
837
jaygala24/Qwen2.5-0.5B-RLOO-math-reasoning
Text Generation
•
0.5B
•
Updated
15 days ago
•
782
jaygala24/Qwen3-1.7B-DAPO-math-reasoning
Text Generation
•
2B
•
Updated
15 days ago
•
834
jaygala24/Qwen2.5-3B-DAPO-math-reasoning
Text Generation
•
3B
•
Updated
15 days ago
•
798
jaygala24/Qwen2.5-1.5B-DAPO-math-reasoning
Text Generation
•
2B
•
Updated
15 days ago
•
980
jaygala24/Qwen2.5-0.5B-DAPO-math-reasoning
Text Generation
•
0.5B
•
Updated
15 days ago
•
768
jaygala24/Qwen3-4B-ReMax-math-reasoning
Text Generation
•
4B
•
Updated
20 days ago
•
901
jaygala24/Qwen3-4B-GRPO-math-reasoning
Text Generation
•
4B
•
Updated
20 days ago
•
471
jaygala24/Qwen3-4B-GRPO-KL-math-reasoning
Text Generation
•
4B
•
Updated
20 days ago
•
559
jaygala24/Qwen3-1.7B-ReMax-math-reasoning
Text Generation
•
2B
•
Updated
20 days ago
•
997
jaygala24/Qwen3-1.7B-GRPO-math-reasoning
Text Generation
•
2B
•
Updated
20 days ago
•
434
jaygala24/Qwen3-1.7B-GRPO-KL-math-reasoning
Text Generation
•
2B
•
Updated
20 days ago
•
419
jaygala24/Qwen2.5-3B-ReMax-math-reasoning
Text Generation
•
3B
•
Updated
20 days ago
•
534
jaygala24/Qwen2.5-3B-GRPO-math-reasoning
Text Generation
•
3B
•
Updated
20 days ago
•
374
jaygala24/Qwen2.5-3B-GRPO-KL-math-reasoning
Text Generation
•
3B
•
Updated
20 days ago
•
383
jaygala24/Qwen2.5-1.5B-ReMax-math-reasoning
Text Generation
•
2B
•
Updated
20 days ago
•
516
jaygala24/Qwen2.5-1.5B-GRPO-math-reasoning
Text Generation
•
2B
•
Updated
20 days ago
•
656
jaygala24/Qwen2.5-1.5B-GRPO-KL-math-reasoning
Text Generation
•
2B
•
Updated
20 days ago
•
594
jaygala24/Qwen2.5-0.5B-ReMax-math-reasoning
Text Generation
•
0.5B
•
Updated
20 days ago
•
502
jaygala24/Qwen2.5-0.5B-GRPO-math-reasoning
Text Generation
•
0.5B
•
Updated
20 days ago
•
637
jaygala24/Qwen2.5-0.5B-GRPO-KL-math-reasoning
Text Generation
•
0.5B
•
Updated
20 days ago
•
602