Update: chat template
#3
by
kaki-paper
- opened
README.md
CHANGED
@@ -24,7 +24,7 @@ pipeline_tag: image-text-to-text
|
|
24 |
- **Developed by:** NC Research, Multimodal Generation Team
|
25 |
- **Technical Report:** [VARCO-VISION: Expanding Frontiers in Korean Vision-Language Models](https://arxiv.org/pdf/2411.19103)
|
26 |
- **Blog(Korean):** [VARCO-VISION Technical Report Summary](https://ncsoft.github.io/ncresearch/95ad8712e60063e9ac97538504ac3eea0ac530af)
|
27 |
-
- **Demo Page:**
|
28 |
- **Languages:** Korean, English
|
29 |
- **License:** CC BY-NC 4.0
|
30 |
- **Architecture:** VARCO-VISION-14B follows the architecture of [LLaVA-OneVision](https://arxiv.org/abs/2408.03326).
|
@@ -33,14 +33,11 @@ pipeline_tag: image-text-to-text
|
|
33 |
- **Vision Encoder:** [google/siglip-so400m-patch14-384](https://huggingface.co/google/siglip-so400m-patch14-384)
|
34 |
- **Huggingface Version Model:** [NCSOFT/VARCO-VISION-14B-HF](https://huggingface.co/NCSOFT/VARCO-VISION-14B-HF)
|
35 |
- **Korean VLM Benchmarks:**
|
36 |
-
- You can use the following benchmark datasets in the [LLMs-Eval toolkit](https://github.com/EvolvingLMMs-Lab/lmms-eval).
|
37 |
- [NCSOFT/K-MMBench](https://huggingface.co/datasets/NCSOFT/K-MMBench)
|
38 |
- [NCSOFT/K-SEED](https://huggingface.co/datasets/NCSOFT/K-SEED)
|
39 |
- [NCSOFT/K-MMStar](https://huggingface.co/datasets/NCSOFT/K-MMStar)
|
40 |
- [NCSOFT/K-DTCBench](https://huggingface.co/datasets/NCSOFT/K-DTCBench)
|
41 |
- [NCSOFT/K-LLaVA-W](https://huggingface.co/datasets/NCSOFT/K-LLaVA-W)
|
42 |
-
|
43 |
-
- **you can also evaluate VARCO-VISION-14B in the [VLMEval kit](https://github.com/open-compass/VLMEvalKit)**.
|
44 |
- **This model is for research purposes only. Commercial use is prohibited.**
|
45 |
|
46 |
## Uses
|
|
|
24 |
- **Developed by:** NC Research, Multimodal Generation Team
|
25 |
- **Technical Report:** [VARCO-VISION: Expanding Frontiers in Korean Vision-Language Models](https://arxiv.org/pdf/2411.19103)
|
26 |
- **Blog(Korean):** [VARCO-VISION Technical Report Summary](https://ncsoft.github.io/ncresearch/95ad8712e60063e9ac97538504ac3eea0ac530af)
|
27 |
+
- **Demo Page:** [VARCO-VISION HF Space](https://huggingface.co/spaces/NCSOFT/VARCO-VISION-14B)
|
28 |
- **Languages:** Korean, English
|
29 |
- **License:** CC BY-NC 4.0
|
30 |
- **Architecture:** VARCO-VISION-14B follows the architecture of [LLaVA-OneVision](https://arxiv.org/abs/2408.03326).
|
|
|
33 |
- **Vision Encoder:** [google/siglip-so400m-patch14-384](https://huggingface.co/google/siglip-so400m-patch14-384)
|
34 |
- **Huggingface Version Model:** [NCSOFT/VARCO-VISION-14B-HF](https://huggingface.co/NCSOFT/VARCO-VISION-14B-HF)
|
35 |
- **Korean VLM Benchmarks:**
|
|
|
36 |
- [NCSOFT/K-MMBench](https://huggingface.co/datasets/NCSOFT/K-MMBench)
|
37 |
- [NCSOFT/K-SEED](https://huggingface.co/datasets/NCSOFT/K-SEED)
|
38 |
- [NCSOFT/K-MMStar](https://huggingface.co/datasets/NCSOFT/K-MMStar)
|
39 |
- [NCSOFT/K-DTCBench](https://huggingface.co/datasets/NCSOFT/K-DTCBench)
|
40 |
- [NCSOFT/K-LLaVA-W](https://huggingface.co/datasets/NCSOFT/K-LLaVA-W)
|
|
|
|
|
41 |
- **This model is for research purposes only. Commercial use is prohibited.**
|
42 |
|
43 |
## Uses
|