·
AI & ML interests
None yet
Organizations
None yet
models 652
ajagota71/qwen3b-alpaca-sft
Updated
ajagota71/smollm2-360m-saferlhf-ppo-lag-10epoch
Text Generation
• 0.4B • Updated • 1
ajagota71/smollm2-360m-saferlhf-ppo-lag-3epoch
Text Generation
• 0.4B • Updated • 1
ajagota71/smollm2-360m-saferlhf-ppo-1epoch
Text Generation
• 0.4B • Updated ajagota71/tinyllama-saferlhf-ppo-1epoch
Text Generation
• 1B • Updated • 1
Text Generation
• 0.1B • Updated • 7
ajagota71/gemma-3-270m-detox
Reinforcement Learning
• 0.3B • Updated • 2
ajagota71/gemma-3-270m-detox-checkpoint-epoch-100
Reinforcement Learning
• 0.3B • Updated • 3
ajagota71/gemma-3-270m-detox-checkpoint-epoch-80
Reinforcement Learning
• 0.3B • Updated • 3
ajagota71/Qwen2.5-0.5B-detox
Reinforcement Learning
• 0.5B • Updated • 2