ilyassmoummad
/

ProtoCLR

Feature Extraction

Model card Files Files and versions Community

ilyassmoummad commited on Nov 4, 2024

Commit

d2a2381

·

verified ·

1 Parent(s): 8e1e081

Update README.md

Files changed (1) hide show

README.md +9 -0

README.md CHANGED Viewed

@@ -1,3 +1,12 @@
 # ProtoCLR
 This repository contains a CvT-13 [Convolutional Vision Transformer](https://arxiv.org/abs/2103.15808) model trained from scratch on the [Xeno-Canto dataset](https://huggingface.co/datasets/ilyassmoummad/Xeno-Canto-6s-16khz), specifically on 6-second audio segments sampled at 16 kHz. The model is trained on Mel spectrograms of bird sounds using ProtoCLR [(Prototypical Contrastive Loss)](https://arxiv.org/abs/2409.08589) for 300 epochs and can be used as a feature extractor for bird audio classification and related tasks.

+---
+license: cc-by-nc-4.0
+datasets:
+- ilyassmoummad/Xeno-Canto-6s-16khz
+pipeline_tag: feature-extraction
+library_name: Pytorch
+tags:
+- Bioacoustics
+---
 # ProtoCLR
 This repository contains a CvT-13 [Convolutional Vision Transformer](https://arxiv.org/abs/2103.15808) model trained from scratch on the [Xeno-Canto dataset](https://huggingface.co/datasets/ilyassmoummad/Xeno-Canto-6s-16khz), specifically on 6-second audio segments sampled at 16 kHz. The model is trained on Mel spectrograms of bird sounds using ProtoCLR [(Prototypical Contrastive Loss)](https://arxiv.org/abs/2409.08589) for 300 epochs and can be used as a feature extractor for bird audio classification and related tasks.