17 13

Васильев Сергей

miljones2024

AI & ML interests

None yet

Recent Activity

liked a dataset about 13 hours ago

wegrthj/l36l5h-v654-data

liked a model 1 day ago

TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e1-persona-v1-all-tcs-fsx-sm0.1

upvoted a paper 2 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

View all activity

Organizations

None yet

liked a dataset about 13 hours ago

wegrthj/l36l5h-v654-data

Updated less than a minute ago • 20.9k • 3

liked a model 1 day ago

TAUR-dev/rankalign-v6-gemma-2-2b-d0.15-e1-persona-v1-all-tcs-fsx-sm0.1

Text Generation • 3B • Updated 1 day ago • 13 • 1

upvoted a paper 2 days ago

Anti-Self-Distillation for Reasoning RL via Pointwise Mutual Information

Paper • 2605.11609 • Published 11 days ago • 186

liked a model 4 days ago

arepaconcafe/neko-base

Updated about 1 hour ago • 4

liked a dataset 8 days ago

mrmrx/CADS-dataset

Viewer • Updated 1 day ago • 21.8k • 4.97k • 53

upvoted a paper 8 days ago

Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers

Paper • 2605.06169 • Published 16 days ago • 186

upvoted a paper 11 days ago

Repetition over Diversity: High-Signal Data Filtering for Sample-Efficient German Language Modeling

Paper • 2604.28075 • Published 23 days ago • 20

upvoted a paper 15 days ago

MolmoAct2: Action Reasoning Models for Real-world Deployment

Paper • 2605.02881 • Published 19 days ago • 333

liked a model 29 days ago

Bialy17/qwen-finetuned-Reasoning-Socratic-QandA-unsloth

Updated 29 days ago • 1

upvoted a paper 29 days ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published about 1 month ago • 240

liked 2 models about 1 month ago

openbmb/VoxCPM2

Text-to-Speech • Updated Apr 16 • 200k • 1.32k

tencent/HY-Embodied-0.5

Image-Text-to-Text • 4B • Updated Apr 14 • 986 • 906

liked a dataset about 1 month ago

rahuljoy/stack_binary_subset_chat

Viewer • Updated Apr 13 • 2.03k • 12 • 1

liked a model about 1 month ago

mradermacher/Vero-MiMo-7B-i1-GGUF

Reinforcement Learning • 8B • Updated Apr 21 • 552 • 2

upvoted a paper about 1 month ago

GrandCode: Achieving Grandmaster Level in Competitive Programming via Agentic Reinforcement Learning

Paper • 2604.02721 • Published Apr 3 • 629

liked a model about 1 month ago

PhoenixHu/ral_grpo_internvl2_5_how2sign_1b_bleu1_rouge_kl05_temp07_0405_metta

Updated Apr 8 • 1

upvoted 2 papers about 1 month ago

Adam's Law: Textual Frequency Law on Large Language Models

Paper • 2604.02176 • Published Apr 2 • 503

AgentWatcher: A Rule-based Prompt Injection Monitor

Paper • 2604.01194 • Published Apr 1 • 3

liked a model about 2 months ago

Outlier-Ai/Outlier-10B

Text Generation • Updated 17 days ago • 205 • 2

upvoted a paper about 2 months ago

CARLA-Air: Fly Drones Inside a CARLA World -- A Unified Infrastructure for Air-Ground Embodied Intelligence

Paper • 2603.28032 • Published Mar 30 • 342

Васильев Сергей

AI & ML interests

Recent Activity

Organizations

miljones2024's activity