OpenGVLab
/

InternVL3_5-1B-HF

@@ -1,18 +1,23 @@
 ---
-license: apache-2.0
-pipeline_tag: image-text-to-text
-library_name: transformers
 base_model:
-  - OpenGVLab/InternVL3_5-1B-MPO
-base_model_relation: finetune
 datasets:
-  - OpenGVLab/MMPR-v1.2
-  - OpenGVLab/MMPR-Tiny
 language:
-  - multilingual
 tags:
-  - internvl
-  - custom_code
 ---
 # InternVL3_5-1B
@@ -494,7 +499,9 @@ image_urls=[
 images = [load_image(img_url) for img_url in image_urls]
 # Numbering images improves multi-image conversations
-response = pipe((f'Image-1: {IMAGE_TOKEN}\nImage-2: {IMAGE_TOKEN}\ndescribe these two images', images))
 print(response.text)
 ```
@@ -596,4 +603,4 @@ If you find this project useful in your research, please consider citing:
   journal={arXiv preprint arXiv:2508.18265},
   year={2025}
 }
-```

 ---
 base_model:
+- OpenGVLab/InternVL3_5-1B-MPO
 datasets:
+- OpenGVLab/MMPR-v1.2
+- OpenGVLab/MMPR-Tiny
 language:
+- multilingual
+library_name: transformers
+license: apache-2.0
+pipeline_tag: image-text-to-text
 tags:
+- internvl
+- custom_code
+- multimodal
+- reasoning
+- agent
+- llm
+- efficiency
+base_model_relation: finetune
 ---
 # InternVL3_5-1B
 images = [load_image(img_url) for img_url in image_urls]
 # Numbering images improves multi-image conversations
+response = pipe((f'Image-1: {IMAGE_TOKEN}
+Image-2: {IMAGE_TOKEN}
+describe these two images', images))
 print(response.text)
 ```
   journal={arXiv preprint arXiv:2508.18265},
   year={2025}
 }
+```