CXR LLaVA

Forked from : https://github.com/ECOFRI/CXR_LLaVA

Multimodal Large Language Model Fine-Tuned for Chest X-ray Images

CXR LLaVA is an innovative open-source, multimodal large language model specifically designed for generating radiologic reports from chest X-ray images.

Arxiv Preprint Paper: Explore the detailed scientific background of CXR LLaVA on Arxiv.
Demo Website: Experience the model in action at Radiologist App.

Version	Input CXR resolution	Channels	Vision Encoder	Base LLM	Weight
v1.0	512x512	RGB	RN50	LLAMA2-13B-CHAT	Deprecated
v2.0 (Latest)	512x512	Grayscale	ViT-L/16	LLAMA2-7B-CHAT	Link

jcsagar
/

CXR-LLAVA-v2

CXR LLaVA

Multimodal Large Language Model Fine-Tuned for Chest X-ray Images

Space using jcsagar/CXR-LLAVA-v2 1