Image-Text-to-Text
Transformers
Safetensors
multilingual
internvl
custom_code
conversational

Add more descriptive tags to InternVL3.5-1B model card

#1
by nielsr HF Staff - opened
Files changed (1) hide show
  1. README.md +19 -12
README.md CHANGED
@@ -1,18 +1,23 @@
1
  ---
2
- license: apache-2.0
3
- pipeline_tag: image-text-to-text
4
- library_name: transformers
5
  base_model:
6
- - OpenGVLab/InternVL3_5-1B-MPO
7
- base_model_relation: finetune
8
  datasets:
9
- - OpenGVLab/MMPR-v1.2
10
- - OpenGVLab/MMPR-Tiny
11
  language:
12
- - multilingual
 
 
 
13
  tags:
14
- - internvl
15
- - custom_code
 
 
 
 
 
 
16
  ---
17
 
18
  # InternVL3_5-1B
@@ -494,7 +499,9 @@ image_urls=[
494
 
495
  images = [load_image(img_url) for img_url in image_urls]
496
  # Numbering images improves multi-image conversations
497
- response = pipe((f'Image-1: {IMAGE_TOKEN}\nImage-2: {IMAGE_TOKEN}\ndescribe these two images', images))
 
 
498
  print(response.text)
499
  ```
500
 
@@ -596,4 +603,4 @@ If you find this project useful in your research, please consider citing:
596
  journal={arXiv preprint arXiv:2508.18265},
597
  year={2025}
598
  }
599
- ```
 
1
  ---
 
 
 
2
  base_model:
3
+ - OpenGVLab/InternVL3_5-1B-MPO
 
4
  datasets:
5
+ - OpenGVLab/MMPR-v1.2
6
+ - OpenGVLab/MMPR-Tiny
7
  language:
8
+ - multilingual
9
+ library_name: transformers
10
+ license: apache-2.0
11
+ pipeline_tag: image-text-to-text
12
  tags:
13
+ - internvl
14
+ - custom_code
15
+ - multimodal
16
+ - reasoning
17
+ - agent
18
+ - llm
19
+ - efficiency
20
+ base_model_relation: finetune
21
  ---
22
 
23
  # InternVL3_5-1B
 
499
 
500
  images = [load_image(img_url) for img_url in image_urls]
501
  # Numbering images improves multi-image conversations
502
+ response = pipe((f'Image-1: {IMAGE_TOKEN}
503
+ Image-2: {IMAGE_TOKEN}
504
+ describe these two images', images))
505
  print(response.text)
506
  ```
507
 
 
603
  journal={arXiv preprint arXiv:2508.18265},
604
  year={2025}
605
  }
606
+ ```