title: README | |
emoji: 🦀 | |
colorFrom: blue | |
colorTo: pink | |
sdk: static | |
pinned: false | |
Neubla Optimized Model Repository | |
Model categories | |
- Vision | |
- NLP | |
Compression techniques | |
- Pruning: channel-pruning, 2:4 semi-structured, etc. | |
- Quantization: W8A8, W4AF16, WF8AF8, etc. | |
- Parameter-efficient finetuning: PEFT, etc. | |