Model Description

SkinSAM is on the 12-layer ViT-b model, the mask decoder module of SAM is fine-tuned on a combined dataset of ISIC and PH2 skin lesion images and masks. SkinSAM was trained on an Nvidia Tesla A100 40GB GPU.

Some of the notable results taken:
ISIC Dataset:

IOU 78.25%
Pixel Accuracy 92.18%
F1 Score 87.47%

PH2 Dataset:

IOU 86.68%
Pixel Accuracy 93.33%
F1 Score 93.95%

Downloads last month: 143

Safetensors

Model size

93.7M params

Tensor type

F32

Inference Providers NEW

Mask Generation

This model is not currently available via any of the supported third-party Inference Providers, and the HF Inference API does not support transformers models with pipeline type mask-generation

ahishamm
/

skinsam

Model Description

Dataset used to train ahishamm/skinsam

Space using ahishamm/skinsam 1