csc-unipd
/

tasty-musicgen-small

Inference Endpoints

Model card Files Files and versions Community

tasty-musicgen-small / README.md

matteospanio's picture

Add OSF link to model card (#2)

bbbc363 verified 6 days ago

|

history blame contribute delete

2.04 kB

	---
	base_model:
	- facebook/musicgen-small
	library_name: transformers
	license: cc-by-4.0
	pipeline_tag: text-to-audio
	tags:
	- music
	- art
	---

	# tasty-musicgen-small

	[![License: CC BY 4.0](https://img.shields.io/badge/License-CC%20BY%204.0-lightgrey.svg)](https://creativecommons.org/licenses/by/4.0/)
	[![arXiv](https://img.shields.io/badge/arXiv-2503.02823-b31b1b.svg)](https://arxiv.org/abs/2503.02823)

	tasty-musicgen-small is a [musicgen-small](https://huggingface.co/facebook/musicgen-small) fine-tuned on a [patched version](https://github.com/matteospanio/taste-music-dataset) of the [Taste & Affect Music Database](https://osf.io/2cqa5/).
	It generates music that's supposed to induce gustatory synesthesia perceptions based on multimodal research. It generates mono audio in 32khz.

	## Code and Dataset

	Code and the dataset used to train this model are available at: https://osf.io/xs5jy/.

	## How to use

	Here is a showcase on how to use the model with the transformer library, it is also possible to make inference with the audiocraft library, for a detailed explanation we suggest to read the [official MusicGEN guide](https://huggingface.co/docs/transformers/main/model_doc/musicgen) by Hugging Face

	```python
	from transformers import pipeline
	import scipy

	synthesiser = pipeline("text-to-audio", "csc-unipd/tasty-musicgen-small")

	music = synthesiser("sweet music for fine restaurents", forward_params={"do_sample": True})

	scipy.io.wavfile.write("musicgen_out.wav", rate=music["sampling_rate"], data=music["audio"])
	```

	## Citation

	If you use this model, code or the data in your research, please cite the following article:

	```
	@misc{spanio2025multimodalsymphonyintegratingtaste,
	title={A Multimodal Symphony: Integrating Taste and Sound through Generative AI},
	author={Matteo Spanio and Massimiliano Zampini and Antonio Rodà and Franco Pierucci},
	year={2025},
	eprint={2503.02823},
	archivePrefix={arXiv},
	primaryClass={cs.SD},
	url={https://arxiv.org/abs/2503.02823},
	}
	```