7 1 3

Yansong Shi

nanamma

https://huggingface.co/nanamma

AI & ML interests

multi modality, video understanding, robotics

Recent Activity

upvoted a paper 6 days ago

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

authored a paper 2 months ago

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

authored a paper 2 months ago

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

View all activity

Organizations

upvoted a paper 6 days ago

InternVideo-Next: Towards General Video Foundation Models without Video-Text Supervision

Paper • 2512.01342 • Published 7 days ago • 14

authored 2 papers 2 months ago

InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding

Paper • 2403.15377 • Published Mar 22, 2024 • 26

TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning

Paper • 2410.19702 • Published Oct 25, 2024 • 1

New activity in qiukingballball/RoboCerebra 4 months ago

how to test

#4 opened 4 months ago by

nanamma

liked a dataset 12 months ago

Mutonix/Vript

Viewer • Updated Jun 11, 2024 • 409k • 6.77k • 24

New activity in Enxin/MovieChat-1K_train about 1 year ago

so many quote '"' in captions in json files

#2 opened about 1 year ago by

nanamma

updated a collection about 1 year ago

VideoChat

Collection

Chat-Centric Video Understanding • 8 items • Updated Sep 28 • 3

updated 2 models about 1 year ago

OpenGVLab/ViCLIP-L-14-hf

0.4B • Updated Sep 17, 2024 • 26k • 1

OpenGVLab/ViCLIP-B-16-hf

0.1B • Updated Sep 17, 2024 • 99 • 1

updated a collection about 1 year ago

InternVid

Collection

A Large-Scale Video-Text Dataset • 7 items • Updated Sep 28

updated a model over 1 year ago

nanamma/umt_0907

Updated Sep 7, 2024

New activity in openbmb/RLHF-V over 1 year ago

key_error "beit3_llava"

#4 opened over 1 year ago by

nanamma

New activity in liuhaotian/LLaVA-Instruct-150K over 1 year ago

请问哪里可以下载 LLaVA-150K 的图片

👍 15

#4 opened about 2 years ago by

flashgoy

liked a Space over 1 year ago

Open LLM Leaderboard

🏆

13.7k

Track, rank and evaluate open LLMs and chatbots

liked a model about 2 years ago

google-t5/t5-base

Translation • 0.2B • Updated Feb 14, 2024 • 2.23M • • 757

Yansong Shi

AI & ML interests

Recent Activity

Organizations

nanamma's activity

how to test

so many quote '"' in captions in json files

key_error "beit3_llava"

请问哪里可以下载 LLaVA-150K 的图片

Open LLM Leaderboard