harryhsing
/

EchoInk-R1-7B

Model card Files Files and versions

This repository contains the EchoInk-R1-7B model as presented in EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning.

For training and inference, please refer to the Code: https://github.com/HarryHsing/EchoInk

Downloads last month: 3

Safetensors

Model size

9B params

Tensor type

BF16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for harryhsing/EchoInk-R1-7B

Base model

Qwen/Qwen2.5-Omni-7B

Finetuned

(41)

this model

Dataset used to train harryhsing/EchoInk-R1-7B

Paper for harryhsing/EchoInk-R1-7B

EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning

Paper • 2505.04623 • Published May 7, 2025