fengzi258 commited on
Commit
26ea7ac
·
verified ·
1 Parent(s): 9cc4480

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -3
README.md CHANGED
@@ -64,13 +64,24 @@ Baichuan-Omni-1.5-Base is a high-performance foundational omni-modal model in th
64
  Notably, the model is fully trained end-to-end using NTP loss in the whole pre-training stage.
65
  - **High-quality Controllable Audio Solution.** Multimodal system prompts have been redesigned to include traditional text system prompts and **speech system prompts** for specifying model sounds. It provides the flexibility to control voice style through text or speech samples at inference time, and supports advanced capabilities such as end-to-end voice cloning and timbre creation.
66
 
67
- **High-quality Medical Image Evaluation Dataset--Openmm-Medical**
 
 
 
 
 
 
 
 
 
 
 
68
 
69
  - We have built a more diverse medical evaluation dataset named **Openmm-Medical** to evaluate large models in medical scenarios.
70
  - The images in Openmm-Medical come from **42 public medical image datasets**, such as ACRIMA (fundus images), BioMediTech (microscope images), and CoronaHack (X-rays).
71
  - **Openmm-Medical contains a total of 88,996 images**, and each image is designed as a **multiple-choice question to facilitate the evaluation of different large models.**
72
  - To promote the development of omnimodal large models in the medical field, we will soon **open** this evaluation dataset.
73
-
74
 
75
  ### Evaluation
76
 
@@ -1086,7 +1097,7 @@ We recommend interested scholars to visit our github repo for more details. [**G
1086
 
1087
 
1088
  ### License
1089
- The community usage of Baichuan-Omni-1.5/Baichuan-Omni-1.5-base requires adherence to [Apache 2.0](https://github.com/baichuan-inc/Baichuan-Omni-1.5/blob/main/LICENSE) and [Community License for Baichuan-Omni-1.5 Models](https://huggingface.co/baichuan-inc/Baichuan2-7B-Base/resolve/main/Baichuan%202%E6%A8%A1%E5%9E%8B%E7%A4%BE%E5%8C%BA%E8%AE%B8%E5%8F%AF%E5%8D%8F%E8%AE%AE.pdf). The Baichuan-Omni-1.5/Baichuan-Omni-1.5-base models supports commercial use. If you plan to use the Baichuan-Omni-1.5/Baichuan-Omni-1.5-base models or its derivatives for commercial purposes, please ensure that your entity meets the following conditions:
1090
 
1091
  1. The Daily Active Users (DAU) of your or your affiliate's service or product is less than 1 million.
1092
  2. Neither you nor your affiliates are software service providers or cloud service providers.
 
64
  Notably, the model is fully trained end-to-end using NTP loss in the whole pre-training stage.
65
  - **High-quality Controllable Audio Solution.** Multimodal system prompts have been redesigned to include traditional text system prompts and **speech system prompts** for specifying model sounds. It provides the flexibility to control voice style through text or speech samples at inference time, and supports advanced capabilities such as end-to-end voice cloning and timbre creation.
66
 
67
+
68
+ ### Open-source Evaluation Datasets
69
+
70
+ **OpenMM-Medical**
71
+
72
+ To comprehensively evaluate the model's multi-modal medical capabilities, we have constructed OpenMM-Medical, which includes data from 42 publicly available medical image datasets such as ACRIMA (retinal images), BioMediTech (microscope images), and CoronaHack (X-rays), totaling 88,996 images.
73
+
74
+ **OpenAudioBench**
75
+
76
+ To efficiently assess the model's "IQ" issues, we developed OpenAudioBench, comprising five end-to-end audio understanding sub-datasets: four public benchmarks (Llama Question, WEB QA, TriviaQA, AlpacaEval), and an internally created speech logical reasoning dataset by the Baichuan team, totaling 2,701 entries. This suite reflects the model's comprehensive "IQ" level.
77
+
78
+ <!-- **High-quality Medical Image Evaluation Dataset--Openmm-Medical**
79
 
80
  - We have built a more diverse medical evaluation dataset named **Openmm-Medical** to evaluate large models in medical scenarios.
81
  - The images in Openmm-Medical come from **42 public medical image datasets**, such as ACRIMA (fundus images), BioMediTech (microscope images), and CoronaHack (X-rays).
82
  - **Openmm-Medical contains a total of 88,996 images**, and each image is designed as a **multiple-choice question to facilitate the evaluation of different large models.**
83
  - To promote the development of omnimodal large models in the medical field, we will soon **open** this evaluation dataset.
84
+ -->
85
 
86
  ### Evaluation
87
 
 
1097
 
1098
 
1099
  ### License
1100
+ The community usage of Baichuan-Omni-1.5/Baichuan-Omni-1.5-base requires adherence to [Apache 2.0](https://github.com/baichuan-inc/Baichuan-Omni-1.5/blob/main/LICENSE) and [Community License for Baichuan-Omni-1.5 Models](https://github.com/baichuan-inc/Baichuan-Omni-1.5/blob/main/LICENSE). The Baichuan-Omni-1.5/Baichuan-Omni-1.5-base models supports commercial use. If you plan to use the Baichuan-Omni-1.5/Baichuan-Omni-1.5-base models or its derivatives for commercial purposes, please ensure that your entity meets the following conditions:
1101
 
1102
  1. The Daily Active Users (DAU) of your or your affiliate's service or product is less than 1 million.
1103
  2. Neither you nor your affiliates are software service providers or cloud service providers.