Spaces:

arhanv
/

drum-kit-generator

Running on CPU Upgrade

App Files Files Community

arhanv commited on 6 days ago

Commit

f6a3d7e

1 Parent(s): cb93854

created file structure, added dataset

Browse files

Files changed (8) hide show

.gitignore +3 -0
app.py +0 -0
audio_utils.py +0 -0
dataset/README.md +69 -0
dataset/analysis.zip +3 -0
dataset/licenses.txt +0 -0
dataset/one_shot_percussive_sounds.zip +3 -0
inference.py +0 -0

.gitignore ADDED Viewed

	@@ -0,0 +1,3 @@

+/.env
+._*
+/dataset/unzipped

app.py ADDED Viewed

File without changes

audio_utils.py ADDED Viewed

File without changes

dataset/README.md ADDED Viewed

	@@ -0,0 +1,69 @@

+# Freesound One-Shot Percussive Sounds Dataset
+This dataset contains 10254 one-shot (single event) percussive sounds from Freesound.org and the corresponding timbral analysis. These were used to train the generative model for "Neural Percussive Synthesis Parameterised by High-Level Timbral Features".
+## Dataset Construction
+To collect this dataset, the following steps were performed:
+* Freesound was queried with words associated with percussive instruments, such as "percussion", "kick", "wood" or "clave". Only sounds with less than one second of [effective duration](https://essentia.upf.edu/reference/std_EffectiveDuration.html) were selected.
+* This stage retrieved some audio clips that contained multiple sound events or that were of low quality.
+Therefore, we listened to all the retrieved sounds and manually discarded the sounds presenting one of these characteristics. For this, the [percussive-annotator](https://github.com/xavierfav/percussive-annotator) was used.
+* The sounds were then cut or padded to have 1-second length, normalized and downsampled to 16kHz.
+* Finally, the sounds were analyzed with the [AudioCommons Extractor](https://github.com/AudioCommons/ac-audio-extractor), to obtain the AudioCommons timbral descriptors. This information is contained in the 'analysis' folder.
+## Dataset Organisation
+The dataset contains two folders and two files in the root directory:
+* 'one_shot_percussive_sounds' encloses the pre-processed audio files. These are named '<freesound_sound_id>.wav'
+* 'analysis' holds the AudioCommons analysis files for each of the sounds in the dataset. This analysis is stored as a .json file, named '<freesound_sound_id>_analysis.json', with a key for each of the features extracted.
+* Two more files are present in the root directory of the dataset: this 'README' and the 'licenses.json'. The latter one is a '.json' file containing the name, the username of the uploader and the license for each of the sounds in the dataset.
+## Authors and Contact
+This dataset was developed by António Ramires, Pritish Chadna, Xavier Favory, Emilia Gómez and Xavier Serra.
+Any questions related to this dataset please contact:
+António Ramires
+[email protected]
+[email protected]
+## References
+Please cite this paper if you use this dataset:
+```
+@inproceedings{ramires2020,
+    author = "Antonio Ramires and Pritish Chandna and Xavier Favory and Emilia Gómez and Xavier Serra",
+    title = "Neural Percussive Synthesis Parametrerised by High-Level Timbral Features",
+    booktitle = "Proc. of the IEEE Int. Conf. on Acoustics, Speech and Signal Processing (ICASSP)",
+    year = "2020"
+}
+```
+## Acknowledgements
+This work has received funding from the European Union's Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No. 765068 (MIP-Frontiers).
+This work has received funding from the European Union's Horizon 2020 research and innovation programme under grant agreement No. 770376 (TROMPA).
+<img src="https://upload.wikimedia.org/wikipedia/commons/b/b7/Flag_of_Europe.svg" height="64" hspace="20">

dataset/analysis.zip ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e27faf24d3650e9541fb2f76c0ec7bd2be79672583c45aa17bc2cb830cb50fd8
+size 5610013

dataset/licenses.txt ADDED Viewed

The diff for this file is too large to render. See raw diff

dataset/one_shot_percussive_sounds.zip ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c45401b3cbdd56606f0d9e5e494a18efbae1ca830f835504dccc316c1934720c
+size 112614838

inference.py ADDED Viewed

File without changes