AI & ML interests
Video generation Video reasoning
Recent Activity
VMEvalKit π₯π§
A framework to evaluate reasoning capabilities in video generation models at scale.
Invitation to Collaborate π€
VMEvalKit is meant to be a permissively open-source shared playground for everyone. If youβre interested in machine cognition, video models, evaluation, or anything anything π¦β¨, weβd love to build with you:
- π§ͺ Add new reasoning tasks (planning, causality, social, physical, etc.)
- π₯ Plug in new video models (APIs or open-source)
- π Experiment with better evaluation metrics and protocols
- π§± Improve infrastructure, logging, and the web dashboard
- π Use VMEvalKit in your own research and share back configs/scripts
- ππ Or Anything anything π¦β¨
π¬ Join us on Slack to ask questions, propose ideas, or start a collab: Slack Invite π
Research
Here we keep track of papers spinned off from this code infrastructure and some works in progress.
This paper implements our experimental framework and demonstrates that leading video generation models (Sora-2 etc) can perform visual reasoning tasks with >60% success rates. See results.
License
Apache 2.0
Citation
If you find VMEvalKit useful in your research, please cite:
@misc{VMEvalKit,
author = {VMEvalKit Team},
title = {VMEvalKit: A framework for evaluating reasoning abilities in foundational video models},
year = {2025},
howpublished = {\url{https://github.com/Video-Reason/VMEvalKit}}
}