SAK / README.md
yxlu0's picture
Update README.md
4c33a99 verified
metadata
license: mit
datasets:
  - ILSVRC/imagenet-1k

SAK

These are checkpoints for our ICLR2025 paper: Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning.

Model Details

Model Description

  • Developed by: Yuxiang Lu, Shengcao Cao, Yu-Xiong Wang
  • License: mit

Model Sources

Uses

Currently we directly provide checkpoints of pre-trained models in this repository. For detailed information on usage, please refer to our github repository.

Following are the checkpoint lists:

Stage 1

Teachers Student backbone Checkpoint
DINOv2-B, CLIP-B, SAM-B ViT-S BS_s1.pth
DINOv2-B, CLIP-B, SAM-B ViT-B BB_s1.pth
DINOv2-L, CLIP-L, SAM-L ViT-B LB_s1.pth
DINOv2-L, CLIP-L, SAM-L ViT-L LL_s1.pth

Stage 2

We provide two example checkpoints after Stage 2 training, initialized by BB_s1.pth from Stage 1 training:

Citation

@inproceedings{lu2025swiss,
  title={Swiss Army Knife: Synergizing Biases in Knowledge from Vision Foundation Models for Multi-Task Learning},
  author={Yuxiang Lu and Shengcao Cao and Yu-Xiong Wang},
  booktitle={The Thirteenth International Conference on Learning Representations},
  year={2025}
}