|
# SECAP: Speech Emotion Captioning with Large Language Model |
|
|
|
This repository contains the implementation of the paper "SECap: Speech Emotion Captioning with Large Language Model". It includes the model code, training and testing scripts, and a test dataset. The test dataset consists of 600 wav audio files and their corresponding emotion descriptions. |
|
|
|
Please find more details at the GitHub repo[https://github.com/xuyaoxun/SECaps] |
|
|
|
## Checkpoint |
|
|
|
You can download the model checkpoint in this repo freely and put it in the main folder of SECaps. |
|
|
|
Meanwhile you will need to download the weights folder and also put it in the main folder of SECaps. |
|
|
|
## Citation |
|
|
|
If you use this repository in your research, please kindly cite our paper: |
|
|
|
@article{SECap, |
|
title={SECap: Speech Emotion Captioning with Large Language Model}, |
|
|
|
} |
|
|