NCSOFT
/

VARCO-VISION-14B

Image-Text-to-Text

text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Update: chat template

#3

by kaki-paper - opened Dec 12, 2024

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

Files changed (1) hide show

README.md +1 -4

README.md CHANGED Viewed

@@ -24,7 +24,7 @@ pipeline_tag: image-text-to-text
 - **Developed by:** NC Research, Multimodal Generation Team
 - **Technical Report:** [VARCO-VISION: Expanding Frontiers in Korean Vision-Language Models](https://arxiv.org/pdf/2411.19103)
 - **Blog(Korean):** [VARCO-VISION Technical Report Summary](https://ncsoft.github.io/ncresearch/95ad8712e60063e9ac97538504ac3eea0ac530af)
-- **Demo Page:** *The demo page is no longer available.*
 - **Languages:** Korean, English
 - **License:** CC BY-NC 4.0
 - **Architecture:** VARCO-VISION-14B follows the architecture of [LLaVA-OneVision](https://arxiv.org/abs/2408.03326).
@@ -33,14 +33,11 @@ pipeline_tag: image-text-to-text
   - **Vision Encoder:** [google/siglip-so400m-patch14-384](https://huggingface.co/google/siglip-so400m-patch14-384)
 - **Huggingface Version Model:** [NCSOFT/VARCO-VISION-14B-HF](https://huggingface.co/NCSOFT/VARCO-VISION-14B-HF)
 - **Korean VLM Benchmarks:**
-  - You can use the following benchmark datasets in the [LLMs-Eval toolkit](https://github.com/EvolvingLMMs-Lab/lmms-eval).
   - [NCSOFT/K-MMBench](https://huggingface.co/datasets/NCSOFT/K-MMBench)
   - [NCSOFT/K-SEED](https://huggingface.co/datasets/NCSOFT/K-SEED)
   - [NCSOFT/K-MMStar](https://huggingface.co/datasets/NCSOFT/K-MMStar)
   - [NCSOFT/K-DTCBench](https://huggingface.co/datasets/NCSOFT/K-DTCBench)
   - [NCSOFT/K-LLaVA-W](https://huggingface.co/datasets/NCSOFT/K-LLaVA-W)
-- **you can also evaluate VARCO-VISION-14B in the [VLMEval kit](https://github.com/open-compass/VLMEvalKit)**.
 - **This model is for research purposes only. Commercial use is prohibited.**
 ## Uses

 - **Developed by:** NC Research, Multimodal Generation Team
 - **Technical Report:** [VARCO-VISION: Expanding Frontiers in Korean Vision-Language Models](https://arxiv.org/pdf/2411.19103)
 - **Blog(Korean):** [VARCO-VISION Technical Report Summary](https://ncsoft.github.io/ncresearch/95ad8712e60063e9ac97538504ac3eea0ac530af)
+- **Demo Page:** [VARCO-VISION HF Space](https://huggingface.co/spaces/NCSOFT/VARCO-VISION-14B)
 - **Languages:** Korean, English
 - **License:** CC BY-NC 4.0
 - **Architecture:** VARCO-VISION-14B follows the architecture of [LLaVA-OneVision](https://arxiv.org/abs/2408.03326).
   - **Vision Encoder:** [google/siglip-so400m-patch14-384](https://huggingface.co/google/siglip-so400m-patch14-384)
 - **Huggingface Version Model:** [NCSOFT/VARCO-VISION-14B-HF](https://huggingface.co/NCSOFT/VARCO-VISION-14B-HF)
 - **Korean VLM Benchmarks:**
   - [NCSOFT/K-MMBench](https://huggingface.co/datasets/NCSOFT/K-MMBench)
   - [NCSOFT/K-SEED](https://huggingface.co/datasets/NCSOFT/K-SEED)
   - [NCSOFT/K-MMStar](https://huggingface.co/datasets/NCSOFT/K-MMStar)
   - [NCSOFT/K-DTCBench](https://huggingface.co/datasets/NCSOFT/K-DTCBench)
   - [NCSOFT/K-LLaVA-W](https://huggingface.co/datasets/NCSOFT/K-LLaVA-W)
 - **This model is for research purposes only. Commercial use is prohibited.**
 ## Uses