1 296 44

jasonjiang

mikinyaa

jasonjiang8866

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation

upvoted a paper 2 days ago

SketchVLM: Vision language models can annotate images to explain thoughts and guide users

upvoted a paper 2 days ago

dWorldEval: Scalable Robotic Policy Evaluation via Discrete Diffusion World Model

View all activity

Organizations

None yet

upvoted a paper 1 day ago

Mutual Forcing: Dual-Mode Self-Evolution for Fast Autoregressive Audio-Video Character Generation

Paper • 2604.25819 • Published 4 days ago • 16

upvoted 2 papers 2 days ago

SketchVLM: Vision language models can annotate images to explain thoughts and guide users

Paper • 2604.22875 • Published 9 days ago • 31

dWorldEval: Scalable Robotic Policy Evaluation via Discrete Diffusion World Model

Paper • 2604.22152 • Published 8 days ago • 4

upvoted a paper 3 days ago

Recursive Multi-Agent Systems

Paper • 2604.25917 • Published 4 days ago • 233

upvoted 2 papers 4 days ago

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

Paper • 2604.13602 • Published 17 days ago • 31

Agentic World Modeling: Foundations, Capabilities, Laws, and Beyond

Paper • 2604.22748 • Published 8 days ago • 217

upvoted 2 papers 5 days ago

EvoMaster: A Foundational Agent Framework for Building Evolving Autonomous Scientific Agents at Scale

Paper • 2604.17406 • Published 13 days ago • 5

PlayCoder: Making LLM-Generated GUI Code Playable

Paper • 2604.19742 • Published 11 days ago • 26

upvoted 2 papers 9 days ago

Tstars-Tryon 1.0: Robust and Realistic Virtual Try-On for Diverse Fashion Items

Paper • 2604.19748 • Published 11 days ago • 248

MultiWorld: Scalable Multi-Agent Multi-View Video World Models

Paper • 2604.18564 • Published 12 days ago • 43

upvoted an article 10 days ago

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

16 days ago

•

upvoted a paper 10 days ago

OneVL: One-Step Latent Reasoning and Planning with Vision-Language Explanation

Paper • 2604.18486 • Published 12 days ago • 89

upvoted 2 papers 11 days ago

Agent-World: Scaling Real-World Environment Synthesis for Evolving General Agent Intelligence

Paper • 2604.18292 • Published 12 days ago • 81

Qwen3.5-Omni Technical Report

Paper • 2604.15804 • Published 15 days ago • 56

upvoted a paper 18 days ago

FP4 Explore, BF16 Train: Diffusion Reinforcement Learning via Efficient Rollout Scaling

Paper • 2604.06916 • Published 24 days ago • 34

upvoted an article 21 days ago

Article

Using OCR models with llama.cpp

21 days ago

•

upvoted a paper 21 days ago

RAGEN-2: Reasoning Collapse in Agentic RL

Paper • 2604.06268 • Published 25 days ago • 66

upvoted a paper 24 days ago

OpenWorldLib: A Unified Codebase and Definition of Advanced World Models

Paper • 2604.04707 • Published 26 days ago • 203

upvoted a paper 25 days ago

Self-Distilled RLVR

Paper • 2604.03128 • Published 29 days ago • 169

upvoted a paper 26 days ago

Think, Act, Build: An Agentic Framework with Vision Language Models for Zero-Shot 3D Visual Grounding

Paper • 2604.00528 • Published about 1 month ago • 12