Update README.md
Browse files
README.md
CHANGED
@@ -4,4 +4,11 @@ license: apache-2.0
|
|
4 |
|
5 |
# FUTGA: Fine-grained Music Understanding through Temporally-enhanced Generative Augmentation
|
6 |
|
7 |
-
<div align=center><img src="futga.jpg" height="256px" width="256px"/></div>
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
4 |
|
5 |
# FUTGA: Fine-grained Music Understanding through Temporally-enhanced Generative Augmentation
|
6 |
|
7 |
+
<div align=center><img src="futga.jpg" height="256px" width="256px"/></div>
|
8 |
+
|
9 |
+
## Overview
|
10 |
+
FUTGA is an audio LLM with fine-grained music understanding, learning from generative augmentation with temporal compositions. By leveraging existing music caption datasets and large language models (LLMs), we synthesize detailed music captions with structural descriptions and time boundaries for full-length songs. This synthetic dataset enables FUTGA to identify temporal changes at key transition points, their musical functions, and generate dense captions for full-length songs.
|
11 |
+
|
12 |
+
|
13 |
+
|
14 |
+
|