Video-Text-to-Text
Transformers
Safetensors
English
internvl_chat
feature-extraction
multimodal
custom_code
Eval Results (legacy)
Instructions to use OpenGVLab/InternVideo2_5_Chat_8B with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use OpenGVLab/InternVideo2_5_Chat_8B with Transformers:
# Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("OpenGVLab/InternVideo2_5_Chat_8B", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
| { | |
| "</box>": 92552, | |
| "</img>": 92545, | |
| "</quad>": 92548, | |
| "</ref>": 92550, | |
| "<IMG_CONTEXT>": 92546, | |
| "<box>": 92551, | |
| "<box_begin>": 92553, | |
| "<boxes>": 92557, | |
| "<img>": 92544, | |
| "<quad>": 92547, | |
| "<ref>": 92549, | |
| "<temp>": 92558, | |
| "<time_begin>": 92554, | |
| "<track_begin>": 92555, | |
| "<track_box>": 92556, | |
| "<tracking>": 92559 | |
| } | |