Spaces:
Runtime error
Runtime error
title: MultiModal Phi2 | |
emoji: π | |
colorFrom: blue | |
colorTo: red | |
sdk: gradio | |
sdk_version: 3.35.2 | |
app_file: app.py | |
pinned: false | |
license: mit | |
## Phi2 : Multimodal Finetuning | |
### Details | |
1. LLM Backbone: Phi2 | |
2. Vision Tower: clip-vit-large-patch14-336 | |
3. Audio Model: Whisper | |
4. Pretraining Dataset: LAION-CC-SBU dataset with BLIP captions(200k samples) | |
5. Finetuning Dataset: Instruct 150k dataset based on COCO | |
### Design | |
 | |
### Pretraining | |
#### Training Loss Curve | |
 | |
#### Learing Rate | |
 | |
#### Training Logs | |
 | |
### Finetuning | |
#### Training Loss Curve | |
 | |
#### Learing Rate | |
 | |
#### Training Logs | |
 | |
### Results | |
 | |