Add library name and paper link to model card

This PR adds the `library_name` to the metadata section of the model card, clarifying that this model works with the Hugging Face `transformers` library. It also adds a link to the paper on Hugging Face.

Files changed (1) hide show

README.md +11 -5

README.md CHANGED Viewed

@@ -1,11 +1,13 @@
 ---
 license: llama2
 pipeline_tag: video-text-to-text
 ---
 # Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
 **Paper or resources for more information:**
-[[Paper](https://huggingface.co/papers/2311.08046)] [[Code](https://github.com/PKU-YuanGroup/Chat-UniVi)]
 ## License
 Llama 2 is licensed under the LLAMA 2 Community License,
@@ -133,9 +135,11 @@ if __name__ == '__main__':
         cur_prompt = qs
         if model.config.mm_use_im_start_end:
-            qs = DEFAULT_IM_START_TOKEN + DEFAULT_IMAGE_TOKEN * slice_len + DEFAULT_IM_END_TOKEN + '\n' + qs
         else:
-            qs = DEFAULT_IMAGE_TOKEN * slice_len + '\n' + qs
         conv = conv_templates[conv_mode].copy()
         conv.append_message(conv.roles[0], qs)
@@ -224,9 +228,11 @@ if __name__ == '__main__':
     if image_path is not None:
         cur_prompt = qs
         if model.config.mm_use_im_start_end:
-            qs = DEFAULT_IM_START_TOKEN + DEFAULT_IMAGE_TOKEN + DEFAULT_IM_END_TOKEN + '\n' + qs
         else:
-            qs = DEFAULT_IMAGE_TOKEN + '\n' + qs
         conv = conv_templates[conv_mode].copy()
         conv.append_message(conv.roles[0], qs)

 ---
 license: llama2
 pipeline_tag: video-text-to-text
+library_name: transformers
 ---
 # Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
 **Paper or resources for more information:**
+[[Paper](https://huggingface.co/papers/2503.04504)] [[Code](https://github.com/PKU-YuanGroup/Chat-UniVi)]
 ## License
 Llama 2 is licensed under the LLAMA 2 Community License,
         cur_prompt = qs
         if model.config.mm_use_im_start_end:
+            qs = DEFAULT_IM_START_TOKEN + DEFAULT_IMAGE_TOKEN * slice_len + DEFAULT_IM_END_TOKEN + '
+' + qs
         else:
+            qs = DEFAULT_IMAGE_TOKEN * slice_len + '
+' + qs
         conv = conv_templates[conv_mode].copy()
         conv.append_message(conv.roles[0], qs)
     if image_path is not None:
         cur_prompt = qs
         if model.config.mm_use_im_start_end:
+            qs = DEFAULT_IM_START_TOKEN + DEFAULT_IMAGE_TOKEN + DEFAULT_IM_END_TOKEN + '
+' + qs
         else:
+            qs = DEFAULT_IMAGE_TOKEN + '
+' + qs
         conv = conv_templates[conv_mode].copy()
         conv.append_message(conv.roles[0], qs)