Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
10
33
84
Sukesh Perla
hitchhiker3010
Follow
Mi6paulino's profile picture
Shivanininin's profile picture
AmitUtsaah's profile picture
10 followers
ยท
34 following
hitchhiker3010
hitchhiker3010
sukesh-perla
AI & ML interests
None yet
Recent Activity
updated
a collection
23 days ago
AI Ads
reacted
to
sergiopaniego
's
post
with ๐ฅ
about 1 month ago
New TRL + OpenEnv example! ๐ฅ Fine tune an LLM for playing Sudoku using an RL env via OpenEnv Includes a script that runs on 1 or multiple GPUs with vLLM, plus a Colab-ready notebook. Enjoy! Notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/openenv_sudoku_grpo.ipynb Script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/sudoku.py
upvoted
an
article
about 1 month ago
Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective
View all activity
Organizations
hitchhiker3010
's Spaces
2
Sort:ย Recently updated
Sleeping
Token Visualizer
๐
Visualize tokens from text using a tokenizer
Runtime error
Quickdraw
๐