Multi-modal Multilingual Instruction

university

https://m3-it.github.io

Activity Feed Request to join this org

AI & ML interests

None defined yet.

Recent Activity

yaolily submitted a paper about 1 month ago

TimeChat-Captioner: Scripting Multi-Scene Videos with Time-Aware and Structural Audio-Visual Captions

tobiaslee authored a paper 3 months ago

GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation

tobiaslee submitted a paper 3 months ago

GroundingME: Exposing the Visual Grounding Gap in MLLMs through Multi-Dimensional Evaluation

View all activity

Collections 1

spaces 1

VL RewardBench

Explore vision-language model performance on VL-RewardBench

models 9

MMInstruction/Qwen2-VL-72B-Video-T3

73B • Updated Dec 23, 2024 • 5

MMInstruction/Giraffe

8B • Updated Dec 17, 2024 • 4 • 2

MMInstruction/LongVA-7B-Video-T3

8B • Updated Oct 26, 2024 • 67

MMInstruction/Qwen-VL-ArXivCap

Text Generation • Updated May 6, 2024 • 4 • 4

MMInstruction/Qwen-VL-ArXivQA

Text Generation • Updated May 6, 2024 • 5 • 4

MMInstruction/Silkie

Text Generation • Updated Dec 20, 2023 • 13 • 12

MMInstruction/YingVLM

Updated Aug 16, 2023 • 13 • 1

MMInstruction/YingVLM-zh

Updated Aug 10, 2023 • 9

MMInstruction/YingVLM-Video

Updated Aug 10, 2023 • 8

datasets 17

MMInstruction/stock_factors

Viewer • Updated Dec 8, 2025 • 48.2M • 66 • 1

MMInstruction/OSWorld-G

Viewer • Updated May 22, 2025 • 510 • 113 • 6

MMInstruction/VL-RewardBench

Viewer • Updated May 19, 2025 • 1.25k • 494 • 14

MMInstruction/Video-T3-QA

Viewer • Updated Feb 24, 2025 • 162k • 115 • 2

MMInstruction/SuperClevr_Val

Viewer • Updated Feb 18, 2025 • 5k • 66 • 1

MMInstruction/Clevr_CoGenT_TrainA_R1

Viewer • Updated Feb 13, 2025 • 37.8k • 340 • 48

MMInstruction/Clevr_CoGenT_TrainA_70K_Complex

Viewer • Updated Feb 5, 2025 • 70k • 647 • 8

MMInstruction/Clevr_CoGenT_ValB

Viewer • Updated Feb 3, 2025 • 5k • 13 • 2

MMInstruction/Clevr_CoGenT_ValA

Viewer • Updated Feb 3, 2025 • 5k • 374 • 1

MMInstruction/Clevr_CoAgent_TrainA_R1

Viewer • Updated Feb 2, 2025 • 2.5k • 9

View 17 datasets