Tom's picture

Tom

TomLucidor

·

AI & ML interests

None yet

Recent Activity

new activity 1 day ago

z-lab/gemma-4-26B-A4B-it-DFlash:This works with a quantsized version?

new activity 1 day ago

z-lab/Qwen3.6-35B-A3B-DFlash:Acceptance rate drop in non-thinking scenarios

new activity 1 day ago

z-lab/Qwen3.5-9B-DFlash:DFlash compatibility with AutoRound (W4A16) quantized Qwen3.5-9B in vLLM?

View all activity

Organizations

New activity in z-lab/gemma-4-26B-A4B-it-DFlash 1 day ago

This works with a quantsized version?

#4 opened 3 days ago by

New activity in z-lab/Qwen3.6-35B-A3B-DFlash 1 day ago

Acceptance rate drop in non-thinking scenarios

#7 opened 17 days ago by

New activity in z-lab/Qwen3.5-9B-DFlash 1 day ago

DFlash compatibility with AutoRound (W4A16) quantized Qwen3.5-9B in vLLM?

#1 opened about 2 months ago by

New activity in z-lab/Qwen3.6-35B-A3B-DFlash 1 day ago

Is have Qwen3.6 35B A3B FP8 version?

#4 opened 18 days ago by

New activity in z-lab/Qwen3.5-4B-DFlash 1 day ago

Qwen3.5 2B/0.5B

#1 opened 28 days ago by

New activity in prism-ml/Ternary-Bonsai-8B-mlx-2bit 6 days ago

30+ Billion Models?

#2 opened 22 days ago by

New activity in inclusionAI/Ling-2.6-flash-int4 11 days ago

Will there be Mini/Nano models as well?

#2 opened 11 days ago by

New activity in nebius/SWE-rebench-V2 about 2 months ago

Open-weights only toggle for Web UI

#4 opened about 2 months ago by

Sort by Pass@5 % on web UI

#3 opened about 2 months ago by

Can you add in Qwen3.5 and other series of models for testing?

#2 opened about 2 months ago by

New activity in tzervas/qwen2.5-coder-32b-bitnet-1.58b 2 months ago

Some questions on BitNet PTQ

#1 opened 3 months ago by

New activity in inclusionAI/Ring-2.5-1T 2 months ago

Will there be a base model?

#4 opened 3 months ago by

New activity in bknyaz/Qwen3-Next-80B-A3B-Instruct-REAM 2 months ago

Has REAM been checked for their resilience towards quantization?

#1 opened 3 months ago by

New activity in nightmedia/Qwen3.5-35B-A3B-Text-qx64-hi-mlx 2 months ago

How is this different from the other quants?

#1 opened 2 months ago by

New activity in TheDrummer/Cydonia-24B-v4.3 2 months ago

Will Hybrid Attention RP models get some love?

#7 opened 2 months ago by

New activity in mlx-community/LFM2-8B-A1B-4bit 3 months ago

ValueError: Model type lfm2_moe not supported.

#1 opened 7 months ago by

New activity in mlx-community/LFM2-8B-A1B-8bit-MLX 3 months ago

Runs Amazing on M4 Macbook air!

#1 opened 6 months ago by

New activity in bknyaz/Qwen3-Coder-Next-REAM 3 months ago

Could more benchmarks be done on Instruction Following / Function Calling?

#2 opened 3 months ago by

New activity in lmsys/SGLang-EAGLE3-Qwen3-Next-80B-A3B-Instruct-FP8-SpecForge-Meituan 3 months ago

The comparison with the original MTP

#2 opened 3 months ago by

New activity in deepgenteam/DeepGen-1.0 3 months ago

Using ZwZ and better VLM along side DeepGen

#6 opened 3 months ago by