2 3 12

Wang Yuanqiu

dadaniel

DanielDaniel2201

AI & ML interests

None yet

Recent Activity

liked a model about 2 months ago

dx8152/Qwen-Edit-2509-Multiple-angles

liked a model 6 months ago

Dream-org/Dream-v0-Instruct-7B

new activity 6 months ago

taobao-mnn/InternVL2_5-1B-MNN:Guide on converting original internVL into mnn format

View all activity

Organizations

None yet

liked a model about 2 months ago

dx8152/Qwen-Edit-2509-Multiple-angles

Image-to-Image • Updated Nov 28, 2025 • 61k • • 854

liked a model 6 months ago

Dream-org/Dream-v0-Instruct-7B

Text Generation • 8B • Updated Jul 15, 2025 • 41.1k • 146

New activity in taobao-mnn/InternVL2_5-1B-MNN 6 months ago

Guide on converting original internVL into mnn format

#1 opened 6 months ago by

dadaniel

liked 2 models 6 months ago

taobao-mnn/InternVL2_5-1B-MNN

Text Generation • Updated Apr 27, 2025 • 15 • 2

OpenGVLab/InternVL3-1B

Image-Text-to-Text • 0.9B • Updated Sep 11, 2025 • 119k • 77

upvoted a paper 7 months ago

Beyond the 80/20 Rule: High-Entropy Minority Tokens Drive Effective Reinforcement Learning for LLM Reasoning

Paper • 2506.01939 • Published Jun 2, 2025 • 187

liked a Space 7 months ago

Dots Demo

💻

145

Generate text responses to your queries

updated a model 7 months ago

dadaniel/Qwen3-4B-GRPO-Unsloth-16bit

Text Generation • 4B • Updated Jun 5, 2025 • 5

published a model 7 months ago

dadaniel/Qwen3-4B-GRPO-Unsloth-16bit

Text Generation • 4B • Updated Jun 5, 2025 • 5

updated a model 7 months ago

dadaniel/Qwen3-4B-GRPO-Unsloth-LoRA

Updated Jun 5, 2025

published a model 7 months ago

dadaniel/Qwen3-4B-GRPO-Unsloth-LoRA

Updated Jun 5, 2025

upvoted an article 7 months ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

Feb 7, 2025

•

266

upvoted a collection 7 months ago

MiMo-VL

Collection

6 items • Updated 16 days ago • 38

liked 2 models 7 months ago

XiaomiMiMo/MiMo-VL-7B-RL

Image-Text-to-Text • 8B • Updated Jun 7, 2025 • 1.45k • 167

deepseek-ai/DeepSeek-R1-0528

Text Generation • 685B • Updated May 29, 2025 • 352k • • 2.39k

liked a model 8 months ago

Qwen/Qwen3-235B-A22B

Text Generation • 235B • Updated Jul 26, 2025 • 273k • • 1.07k

liked a Space 9 months ago

EasyControl Ghibli

🦀

1.45k

New Ghibli EasyControl model is now released!!

liked a model 10 months ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 489k • • 12.9k

liked a model 11 months ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated Feb 1, 2025 • 67.2k • 3.55k

liked a model almost 2 years ago

keras-io/tab_transformer

Tabular Classification • Updated Jul 9, 2024 • 27 • 41