OpenGVLab
/

VideoChat-Flash-Qwen2-7B_res448

Video-Text-to-Text

videochat_flash_qwen

feature-extraction

Model card Files Files and versions

lixinhao commited on Mar 4

Commit

c17ed00

·

verified ·

1 Parent(s): 816c629

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -109,12 +109,13 @@ pip install flash-attn --no-build-isolation
 Then you could use our model:
 ```python
 from transformers import AutoModel, AutoTokenizer
 # model setting
 model_path = 'OpenGVLab/VideoChat-Flash-Qwen2-7B_res448'
 tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
-model = AutoModel.from_pretrained(model_path, trust_remote_code=True).half().cuda()
 image_processor = model.get_vision_tower().image_processor
 mm_llm_compress = False # use the global compress or not

 Then you could use our model:
 ```python
 from transformers import AutoModel, AutoTokenizer
+import torch
 # model setting
 model_path = 'OpenGVLab/VideoChat-Flash-Qwen2-7B_res448'
 tokenizer = AutoTokenizer.from_pretrained(model_path, trust_remote_code=True)
+model = AutoModel.from_pretrained(model_path, trust_remote_code=True).to(torch.bfloat16).cuda()
 image_processor = model.get_vision_tower().image_processor
 mm_llm_compress = False # use the global compress or not