OpenGVLab
/

VideoChat-Flash-Qwen2-7B_res224

Video-Text-to-Text

videochat_flash_qwen

feature-extraction

Model card Files Files and versions

lixinhao commited on Mar 4

Commit

c7a5c8f

·

verified ·

1 Parent(s): 4c75d1b

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -108,12 +108,13 @@ pip install flash-attn --no-build-isolation
 Then you could use our model:
 ```python
 from transformers import AutoModel, AutoTokenizer
 # model setting
 model_path = 'OpenGVLab/VideoChat-Flash-Qwen2-7B_res224'
 tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
-model = AutoModel.from_pretrained(model_path, trust_remote_code=True).half().cuda()
 image_processor = model.get_vision_tower().image_processor
 mm_llm_compress = False # use the global compress or not

 Then you could use our model:
 ```python
 from transformers import AutoModel, AutoTokenizer
+import torch
 # model setting
 model_path = 'OpenGVLab/VideoChat-Flash-Qwen2-7B_res224'
 tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
+model = AutoModel.from_pretrained(model_path, trust_remote_code=True).to(torch.bfloat16).cuda()
 image_processor = model.get_vision_tower().image_processor
 mm_llm_compress = False # use the global compress or not