Low Horng Jiun
NickolasLow1
ยท
AI & ML interests
None yet
Recent Activity
reacted
to
sergiopaniego's
post
with ๐ฅ
about 20 hours ago
New REPL environment in OpenEnv available! โจ
Used in the Recursive Language Models (RLM) paper by Alex Zhang.
Ready for inference & post-training using trajectories. Handles long contexts:
> Run Python code in a sandbox
> Make recursive calls to LMs
> Explore data programmatically
> Return final result
Docs: https://meta-pytorch.org/OpenEnv/environments/repl/
Inference script: https://github.com/meta-pytorch/OpenEnv/blob/main/examples/repl_oolong_simple.py
reacted
to
sergiopaniego's
post
with ๐
about 2 months ago
Interested in RL training environments?
We just released a beginner-friendly walkthrough notebook!
Train a model to play Wordle using TRL + OpenEnv (TextArena) + GRPO + vLLM.
happy learning! ๐ฑ
Notebook: https://github.com/huggingface/trl/blob/main/examples/notebooks/openenv_wordle_grpo.ipynb
OpenEnv guide in TRL: https://huggingface.co/docs/trl/main/en/openenv
updated
a model
about 2 months ago
NickolasLow1/Qwen2.5-7B-Instruct
Organizations
None yet