baichuan-inc
/

Baichuan-Omni-1d5

Safetensors

omni

custom_code

Model card Files Files and versions Community

fengzi258 commited on 14 days ago

Commit

26ea7ac

verified ·

1 Parent(s): 9cc4480

Update README.md

Browse files

Files changed (1) hide show

README.md +14 -3

README.md CHANGED Viewed

@@ -64,13 +64,24 @@ Baichuan-Omni-1.5-Base is a high-performance foundational omni-modal model in th
 Notably, the model is fully trained end-to-end using NTP loss in the whole pre-training stage.
 - **High-quality Controllable Audio Solution.** Multimodal system prompts have been redesigned to include traditional text system prompts and **speech system prompts** for specifying model sounds. It provides the flexibility to control voice style through text or speech samples at inference time, and supports advanced capabilities such as end-to-end voice cloning and timbre creation.
-**High-quality Medical Image Evaluation Dataset--Openmm-Medical**
 - We have built a more diverse medical evaluation dataset named **Openmm-Medical** to evaluate large models in medical scenarios.
 - The images in Openmm-Medical come from **42 public medical image datasets**, such as ACRIMA (fundus images), BioMediTech (microscope images), and CoronaHack (X-rays).
 - **Openmm-Medical contains a total of 88,996 images**, and each image is designed as a **multiple-choice question to facilitate the evaluation of different large models.**
 - To promote the development of omnimodal large models in the medical field, we will soon **open** this evaluation dataset.
 ### Evaluation
@@ -1086,7 +1097,7 @@ We recommend interested scholars to visit our github repo for more details. [**G
 ### License
-The community usage of Baichuan-Omni-1.5/Baichuan-Omni-1.5-base requires adherence to [Apache 2.0](https://github.com/baichuan-inc/Baichuan-Omni-1.5/blob/main/LICENSE) and [Community License for Baichuan-Omni-1.5 Models](https://huggingface.co/baichuan-inc/Baichuan2-7B-Base/resolve/main/Baichuan%202%E6%A8%A1%E5%9E%8B%E7%A4%BE%E5%8C%BA%E8%AE%B8%E5%8F%AF%E5%8D%8F%E8%AE%AE.pdf). The Baichuan-Omni-1.5/Baichuan-Omni-1.5-base models supports commercial use. If you plan to use the Baichuan-Omni-1.5/Baichuan-Omni-1.5-base models or its derivatives for commercial purposes, please ensure that your entity meets the following conditions:
   1. The Daily Active Users (DAU) of your or your affiliate's service or product is less than 1 million.
   2. Neither you nor your affiliates are software service providers or cloud service providers.

 Notably, the model is fully trained end-to-end using NTP loss in the whole pre-training stage.
 - **High-quality Controllable Audio Solution.** Multimodal system prompts have been redesigned to include traditional text system prompts and **speech system prompts** for specifying model sounds. It provides the flexibility to control voice style through text or speech samples at inference time, and supports advanced capabilities such as end-to-end voice cloning and timbre creation.
+### Open-source Evaluation Datasets
+**OpenMM-Medical**
+To comprehensively evaluate the model's multi-modal medical capabilities, we have constructed OpenMM-Medical, which includes data from 42 publicly available medical image datasets such as ACRIMA (retinal images), BioMediTech (microscope images), and CoronaHack (X-rays), totaling 88,996 images.
+**OpenAudioBench**
+To efficiently assess the model's "IQ" issues, we developed OpenAudioBench, comprising five end-to-end audio understanding sub-datasets: four public benchmarks (Llama Question, WEB QA, TriviaQA, AlpacaEval), and an internally created speech logical reasoning dataset by the Baichuan team, totaling 2,701 entries. This suite reflects the model's comprehensive "IQ" level.
+<!-- **High-quality Medical Image Evaluation Dataset--Openmm-Medical**
 - We have built a more diverse medical evaluation dataset named **Openmm-Medical** to evaluate large models in medical scenarios.
 - The images in Openmm-Medical come from **42 public medical image datasets**, such as ACRIMA (fundus images), BioMediTech (microscope images), and CoronaHack (X-rays).
 - **Openmm-Medical contains a total of 88,996 images**, and each image is designed as a **multiple-choice question to facilitate the evaluation of different large models.**
 - To promote the development of omnimodal large models in the medical field, we will soon **open** this evaluation dataset.
+ -->
 ### Evaluation
 ### License
+The community usage of Baichuan-Omni-1.5/Baichuan-Omni-1.5-base requires adherence to [Apache 2.0](https://github.com/baichuan-inc/Baichuan-Omni-1.5/blob/main/LICENSE) and [Community License for Baichuan-Omni-1.5 Models](https://github.com/baichuan-inc/Baichuan-Omni-1.5/blob/main/LICENSE). The Baichuan-Omni-1.5/Baichuan-Omni-1.5-base models supports commercial use. If you plan to use the Baichuan-Omni-1.5/Baichuan-Omni-1.5-base models or its derivatives for commercial purposes, please ensure that your entity meets the following conditions:
   1. The Daily Active Users (DAU) of your or your affiliate's service or product is less than 1 million.
   2. Neither you nor your affiliates are software service providers or cloud service providers.