Plans to include additional models?

#1
by SamuraiBarbi - opened

Hello, I'm just inquiring as to whether there's any plans to further update the this benchmark/leaderboard with additional models. Would there be any way for us to request models to be tested/benchmarked?

Flowers AI & CogSci Lab org

Hello! I'm doing my best to maintain the leaderboard with the time I have between other projects. 🙂
Absolutely — feel free to suggest models! Ideally, they should be runnable with vLLM and have a context length of at least ~8k tokens. You’re welcome to post suggestions here or open a new issue.

Flowers AI & CogSci Lab org

The models that were easily runnable with vLLM were added. On another note, keep in mind that this leaderboard captures but one aspect of role play, population-level stability of value expression over various context.

grg changed discussion status to closed

Sign up or log in to comment