7 11 490

weijh

imweijh

AI & ML interests

None yet

Recent Activity

liked a model about 19 hours ago

nvidia/MiniMax-M2.7-NVFP4

liked a model 2 days ago

froggeric/Qwen3.6-27B-MTP-GGUF

liked a model 2 days ago

mudler/Qwen3.6-35B-A3B-APEX-GGUF

View all activity

Organizations

None yet

upvoted an article about 2 months ago

Article

Using OCR models with llama.cpp

ggml-org

•

Apr 10

• 29

upvoted 2 articles 4 months ago

Article

New in llama.cpp: Anthropic Messages API

ggml-org

•

Jan 19

• 45

Article

We Got Claude to Build CUDA Kernels and teach open models!

burtenshaw, evalstate, merve, pcuenq

•

Jan 28

• 156

upvoted an article 6 months ago

Article

New in llama.cpp: Model Management

ggml-org

•

Dec 11, 2025

• 136

upvoted a collection 6 months ago

Moxin-GGUF

Collection

Moxin x llama.cpp Customized Quant for Large MoEs • 9 items • Updated Feb 27 • 3

upvoted 2 articles 9 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

reach-vb, pcuenq, lewtun, clem, Rocketknight1, clefourrier, celinah, Wauplin, marcsun13, pagezyhf, ahadnagy, joaogante

•

Aug 5, 2025

• 513

Article

Chat Templates: An End to the Silent Performance Killer

Rocketknight1

•

Oct 3, 2023

• 32

upvoted a changelog 10 months ago

Hugging Face Changelog

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

Jul 30, 2025

• 203

upvoted a collection 10 months ago

Multimodal GGUFs

Collection

Vision and audio models compatible with llama-server and llama-mtmd-cli • 16 items • Updated Dec 18, 2025 • 21

upvoted an article over 1 year ago

Article

Code a simple RAG from scratch

ngxson

•

Oct 29, 2024

• 335

upvoted a collection over 1 year ago

Qwen 2.5 Coder

Collection

Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats. • 35 items • Updated Apr 22 • 40

weijh

AI & ML interests

Recent Activity

Organizations

imweijh's activity

Using OCR models with llama.cpp

New in llama.cpp: Anthropic Messages API

We Got Claude to Build CUDA Kernels and teach open models!

New in llama.cpp: Model Management

Welcome GPT OSS, the new open-source model family from OpenAI!

Chat Templates: An End to the Silent Performance Killer

Introducing HF Jobs: Run scalable compute jobs on Hugging Face

Code a simple RAG from scratch