File size: 367 Bytes
2b5a57e
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
---
license: openrail
datasets:
- vucinatim/spectrogram-captions
language:
- en
library_name: diffusers
pipeline_tag: text-to-image
---
SDXL 1.0 finetunes on vucinatim/spectrogram-captions for 89 epochs(800 steps). It generates spectrograms for simple sounds. It currently does not produce very good sound effects, but I will train the model for longer in the future.