omkarenator (Omkar Pangarkar)

liked a Space 4 months ago

The Smol Training Playbook

📚

3k

The secrets to building world-class LLMs

liked a dataset 4 months ago

bigcode/the-stack-github-issues

Viewer • Updated Mar 20, 2023 • 31M • 480 • 48

liked a Space 9 months ago

Predict Memory

🧮

106

Calculate and visualize model memory usage from config

liked a dataset 10 months ago

WebOrganizer/Corpus-200B

Preview • Updated Feb 19, 2025 • 9.76k • 11

liked a Space 11 months ago

TxT360: Trillion Extracted Text

📖

132

Explore and download the TxT360 LLM pre‑training dataset

liked a model 12 months ago

mlfoundations/fasttext-oh-eli5

Updated Aug 1, 2024 • 29

liked 2 Spaces about 1 year ago

The Ultra-Scale Playbook

🌌

3.7k

The ultimate guide to training LLM on large GPU Clusters

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

📝

88

Evaluate multilingual models using FineTasks

liked a dataset over 1 year ago

LLM360/TxT360

Updated May 26, 2025 • 14.5k • 248

liked a Space over 1 year ago

FineWeb: decanting the web for the finest text data at scale

🍷

1.3k

Generate a curated web‑text dataset for LLM training

liked 2 datasets over 1 year ago

Trelis/touch-rugby-rules-memorisation

Viewer • Updated Feb 28, 2024 • 363 • 5 • 2

commoncrawl/statistics

Viewer • Updated 25 days ago • 610k • 189 • 25

liked 2 models about 2 years ago

bigcode/starencoder

Updated May 10, 2023 • 27.8k • 57

microsoft/phi-2

Text Generation • 3B • Updated Dec 8, 2025 • 1.45M • 3.43k

liked 5 models over 2 years ago

Omkar Pangarkar

AI & ML interests

Organizations

The Smol Training Playbook

bigcode/the-stack-github-issues

Predict Memory

WebOrganizer/Corpus-200B

TxT360: Trillion Extracted Text

mlfoundations/fasttext-oh-eli5

The Ultra-Scale Playbook

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

LLM360/TxT360

FineWeb: decanting the web for the finest text data at scale

Trelis/touch-rugby-rules-memorisation

commoncrawl/statistics

bigcode/starencoder

microsoft/phi-2

microsoft/phi-1_5

microsoft/phi-1

adept/fuyu-8b

adept/persimmon-8b-base

stanfordnlp/backpack-gpt2

Omkar Pangarkar

AI & ML interests

Organizations

omkarenator's activity

The Smol Training Playbook

Predict Memory

TxT360: Trillion Extracted Text

The Ultra-Scale Playbook

Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks

FineWeb: decanting the web for the finest text data at scale