Low Horng Jiun's picture

1

Low Horng Jiun

NickolasLow1

·

AI & ML interests

None yet

Recent Activity

reacted to sergiopaniego's post with 🔥 about 20 hours ago

New REPL environment in OpenEnv available! ✨ Used in the Recursive Language Models (RLM) paper by Alex Zhang. Ready for inference & post-training using trajectories. Handles long contexts: > Run Python code in a sandbox > Make recursive calls to LMs > Explore data programmatically > Return final result Docs: https://meta-pytorch.org/OpenEnv/environments/repl/ Inference script: https://github.com/meta-pytorch/OpenEnv/blob/main/examples/repl_oolong_simple.py

reacted to sergiopaniego's post with 👍 about 2 months ago

Interested in RL training environments? We just released a beginner-friendly walkthrough notebook! Train a model to play Wordle using TRL + OpenEnv (TextArena) + GRPO + vLLM. happy learning! 🌱 Notebook: https://github.com/huggingface/trl/blob/main/examples/notebooks/openenv_wordle_grpo.ipynb OpenEnv guide in TRL: https://huggingface.co/docs/trl/main/en/openenv

updated a model about 2 months ago

NickolasLow1/Qwen2.5-7B-Instruct

View all activity

Organizations

None yet

NickolasLow1 's datasets

None public yet