ColonGPT (A colonoscopy-specific multimodal Language Model)


The Gradio Web UI allows you to use our examples or upload your images for inference.

๐Ÿ“– Paper | ๐Ÿ  Home

This is the weight of the pre-alignment stage of ColonGPT-v1.

Our ColonGPT is a standard multimodal language model, which contains four basic components: a language tokenizer, an visual encoder (๐Ÿค— SigLIP-SO), a multimodal connector, and a language model (๐Ÿค— Phi1.5). In this huggingface page, we provide a quick start for convenient of new users. For further details about ColonGPT, we highly recommend visiting our homepage. There, you'll find comprehensive usage instructions for our model and the latest advancements in intelligent colonoscopy technology.

Downloads last month
0
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The HF Inference API does not support image-text-to-text models for adapter-transformers library.

Model tree for ai4colonoscopy/ColonGPT-stg1

Base model

microsoft/phi-1_5
Adapter
(500)
this model

Dataset used to train ai4colonoscopy/ColonGPT-stg1