ilho ahn

soleaf

https://flonelin.wordpress.com

soleaf

AI & ML interests

NLP, Vision, Multimodal models

Recent Activity

liked a Space 7 days ago

nanotron/ultrascale-playbook

updated a collection 7 days ago

ReviewTheme Models V1

updated a collection 7 days ago

ReviewTheme Models V1

View all activity

Organizations

liked a Space 7 days ago

The Ultra-Scale Playbook

🌌

3.6k

The ultimate guide to training LLM on large GPU Clusters

updated 3 collections 7 days ago

upvoted a paper 5 months ago

Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination

Paper • 2507.10532 • Published Jul 14 • 89

updated a model 7 months ago

oliveyoung-ai/review-positive-cls

0.1B • Updated May 28 • 21

published a model 7 months ago

oliveyoung-ai/review-positive-cls

0.1B • Updated May 28 • 21

updated a Space 8 months ago

README

👀

published a Space 8 months ago

README

👀

upvoted an article 8 months ago

Article

The 4 Things Qwen-3’s Chat Template Teaches Us

Apr 30

•

upvoted 5 papers over 1 year ago

Automated Design of Agentic Systems

Paper • 2408.08435 • Published Aug 15, 2024 • 40

To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 45

TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

Paper • 2408.09174 • Published Aug 17, 2024 • 52

Self-Play Preference Optimization for Language Model Alignment

Paper • 2405.00675 • Published May 1, 2024 • 28

MoDE: CLIP Data Experts via Clustering

Paper • 2404.16030 • Published Apr 24, 2024 • 15

ilho ahn

AI & ML interests

Recent Activity

Organizations

soleaf's activity

The Ultra-Scale Playbook

README

README

The 4 Things Qwen-3’s Chat Template Teaches Us