Chong Zhang commited on
Commit
d770f8d
·
verified ·
1 Parent(s): f47fe9a
Files changed (1) hide show
  1. README.md +28 -5
README.md CHANGED
@@ -1,3 +1,12 @@
 
 
 
 
 
 
 
 
 
1
  [//]: # (# InspireMusic)
2
  <p align="center">
3
  <a href="https://github.com/FunAudioLLM/InspireMusic" target="_blank">
@@ -18,8 +27,8 @@
18
  <a href="https://github.com/FunAudioLLM/InspireMusic" target="_blank">
19
  <img alt="Code" src="https://img.shields.io/badge/Code%20⭐-InspireMusic?labelColor=%20%237372EB&label=InspireMusic&color=%20%235462eb"></a>
20
 
21
- <a href="https://huggingface.co/FunAudioLLM/InspireMusic-1.5B-Long" target="_blank">
22
- <img alt="Model" src="https://img.shields.io/badge/InspireMusic-Model-orange"></a>
23
 
24
  <a href="https://arxiv.org/abs/" target="_blank">
25
  <img alt="Paper" src="https://img.shields.io/badge/arXiv-Paper-lightgrey"></a>
@@ -76,7 +85,8 @@ InspireMusic is a fundamental AIGC toolkit designed for music, song, and audio g
76
  <a name="What's News"></a>
77
  ## What's New 🔥
78
 
79
- [//]: # (- 2024/12: The [InspireMusic-Base]&#40;https://huggingface.co/FunAudioLLM/InspireMusic-Base&#41; voice understanding model is open-sourced, which offers high quality, diverse text style, music structure, music genre control capability. )
 
80
  - 2024/11: Welcome to preview 👉🏻 [**InspireMusic Demos**](https://iris2c.github.io/InspireMusic) 👈🏻. We're excited to share this with you and are working hard to bring even more features and models soon. Your support and feedback mean a lot to us!
81
  - 2024/11: We are thrilled to announce the open-sourcing of the **InspireMusic** [code repository](https://github.com/FunAudioLLM/InspireMusic) and [demos](https://iris2c.github.io/InspireMusic). **InspireMusic** is a unified framework for music, song, and audio generation, featuring capabilities such as text-to-music conversion, music structure, genre control, and timestamp management. InspireMusic stands out for its exceptional music generation and instruction-following abilities.
82
 
@@ -150,9 +160,23 @@ If you are an expert in this field, and you are only interested in training your
150
  ``` sh
151
  # git模型下载,请确保已安装git lfs
152
  mkdir -p pretrained_models
153
- git clone https://www.modelscope.cn/iic/InspireMusic-1.5B-Long.git pretrained_models/InspireMusic-1.5B-Long
154
  ```
155
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
156
  ## Basic Usage
157
 
158
  At the moment, InspireMusic contains the training code and inference code for [music generation](https://github.com/FunAudioLLM/InspireMusic/tree/main/examples/music_generation). More tasks such as song generation and audio generation will be supported in future.
@@ -255,6 +279,5 @@ for task in 'text-to-music' 'continuation'; do
255
  done
256
  ```
257
 
258
-
259
  ## Disclaimer
260
  The content provided above is for academic purposes only and is intended to demonstrate technical capabilities. Some examples are sourced from the internet. If any content infringes on your rights, please contact us to request its removal.
 
1
+ ---
2
+ license: apache-2.0
3
+ language:
4
+ - en
5
+ pipeline_tag: text-to-audio
6
+ tags:
7
+ - music_generation
8
+ ---
9
+
10
  [//]: # (# InspireMusic)
11
  <p align="center">
12
  <a href="https://github.com/FunAudioLLM/InspireMusic" target="_blank">
 
27
  <a href="https://github.com/FunAudioLLM/InspireMusic" target="_blank">
28
  <img alt="Code" src="https://img.shields.io/badge/Code%20⭐-InspireMusic?labelColor=%20%237372EB&label=InspireMusic&color=%20%235462eb"></a>
29
 
30
+ <a href="https://modelscope.cn/models/iic/InspireMusic-1.5B-Long" target="_blank">
31
+ <img alt="Model" src="https://img.shields.io/badge/InspireMusic-Model-green"></a>
32
 
33
  <a href="https://arxiv.org/abs/" target="_blank">
34
  <img alt="Paper" src="https://img.shields.io/badge/arXiv-Paper-lightgrey"></a>
 
85
  <a name="What's News"></a>
86
  ## What's New 🔥
87
 
88
+ - 2025/01: Open-source [InspireMusic-Base](https://modelscope.cn/models/iic/InspireMusic/summary), [InspireMusic-Base-24kHz](https://modelscope.cn/models/iic/InspireMusic-Base-24kHz/summary), [InspireMusic-1.5B](https://modelscope.cn/models/iic/InspireMusic-1.5B/summary), [InspireMusic-1.5B-24kHz](https://modelscope.cn/models/iic/InspireMusic-1.5B-24kHz/summary), [InspireMusic-1.5B-Long](https://modelscope.cn/models/iic/InspireMusic-1.5B-Long/summary) models for music generation.
89
+ - 2024/12: Support to generate 48kHz audio with super resolution flow matching.
90
  - 2024/11: Welcome to preview 👉🏻 [**InspireMusic Demos**](https://iris2c.github.io/InspireMusic) 👈🏻. We're excited to share this with you and are working hard to bring even more features and models soon. Your support and feedback mean a lot to us!
91
  - 2024/11: We are thrilled to announce the open-sourcing of the **InspireMusic** [code repository](https://github.com/FunAudioLLM/InspireMusic) and [demos](https://iris2c.github.io/InspireMusic). **InspireMusic** is a unified framework for music, song, and audio generation, featuring capabilities such as text-to-music conversion, music structure, genre control, and timestamp management. InspireMusic stands out for its exceptional music generation and instruction-following abilities.
92
 
 
160
  ``` sh
161
  # git模型下载,请确保已安装git lfs
162
  mkdir -p pretrained_models
163
+ git clone https://www.modelscope.cn/iic/InspireMusic-1.5B-Long.git pretrained_models/InspireMusic
164
  ```
165
 
166
+ ### Available Models
167
+ Currently, we open source the music generation models support 24KHz mono and 48KHz stereo audio.
168
+ The table below presents the links to the ModelScope and Huggingface model hub. More models will be available soon.
169
+
170
+ | Model name | Model Links | Remarks |
171
+ |-------------------------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------|
172
+ | InspireMusic-Base-24kHz | [![model](https://img.shields.io/badge/ModelScope-Model-green.svg)](https://modelscope.cn/models/iic/InspireMusic-Base-24kHz/summary) [![model](https://img.shields.io/badge/HuggingFace-Model-green.svg)](https://huggingface.co/FunAudioLLM/InspireMusic-Base-24kHz) | Pre-trained Music Generation Model, 24kHz mono |
173
+ | InspireMusic-Base | [![model](https://img.shields.io/badge/ModelScope-Model-green.svg)](https://modelscope.cn/models/iic/InspireMusic-Base/summary) [![model](https://img.shields.io/badge/HuggingFace-Model-green.svg)](https://huggingface.co/FunAudioLLM/InspireMusic-Base) | Pre-trained Music Generation Model, 48kHz |
174
+ | InspireMusic-1.5B-24kHz | [![model](https://img.shields.io/badge/ModelScope-Model-green.svg)](https://modelscope.cn/models/iic/InspireMusic-1.5B-24kHz/summary) [![model](https://img.shields.io/badge/HuggingFace-Model-green.svg)](https://huggingface.co/FunAudioLLM/InspireMusic-1.5B-24kHz) | Pre-trained Music Generation 1.5B Model, 24kHz mono |
175
+ | InspireMusic-1.5B | [![model](https://img.shields.io/badge/ModelScope-Model-green.svg)](https://modelscope.cn/models/iic/InspireMusic-1.5B/summary) [![model](https://img.shields.io/badge/HuggingFace-Model-green.svg)](https://huggingface.co/FunAudioLLM/InspireMusic-1.5B) | Pre-trained Music Generation 1.5B Model, 48kHz |
176
+ | InspireMusic-1.5B-Long | [![model](https://img.shields.io/badge/ModelScope-Model-green.svg)](https://modelscope.cn/models/iic/InspireMusic-1.5B-Long/summary) [![model](https://img.shields.io/badge/HuggingFace-Model-green.svg)](https://huggingface.co/FunAudioLLM/InspireMusic-1.5B-Long) | Pre-trained Music Generation 1.5B Model, 48kHz, support long audio |
177
+ | InspireSong-1.5B | [![model](https://img.shields.io/badge/ModelScope-Model-lightgrey.svg)]() [![model](https://img.shields.io/badge/HuggingFace-Model-lightgrey.svg)]() | Pre-trained Song Generation 1.5B Model, 48kHz stereo |
178
+ | InspireAudio-1.5B | [![model](https://img.shields.io/badge/ModelScope-Model-lightgrey.svg)]() [![model](https://img.shields.io/badge/HuggingFace-Model-lightgrey.svg)]() | Pre-trained Audio Generation 1.5B Model, 48kHz stereo |
179
+
180
  ## Basic Usage
181
 
182
  At the moment, InspireMusic contains the training code and inference code for [music generation](https://github.com/FunAudioLLM/InspireMusic/tree/main/examples/music_generation). More tasks such as song generation and audio generation will be supported in future.
 
279
  done
280
  ```
281
 
 
282
  ## Disclaimer
283
  The content provided above is for academic purposes only and is intended to demonstrate technical capabilities. Some examples are sourced from the internet. If any content infringes on your rights, please contact us to request its removal.