MuPaD: Multimodal Pathology Diffusion Model
Collection
2 items • Updated
Please run demo.py for full demonstrations. This repo generates 256x256 images.
from diffusers import DiffusionPipeline
import torch
device = torch.device('cuda' if torch.cuda.is_available() else 'cpu')
pipeline = DiffusionPipeline.from_pretrained(
"xiangjx/MuPaD-256",
custom_pipeline="xiangjx/MuPaD-256",
trust_remote_code=True,
)
pipeline.to(device)
Generate histopathology images from a text prompt.
# Text-to-Image generation
prompt = "lung adenocarcinoma"
output_t2i = pipeline(
prompt=prompt,
modality="text",
num_images_per_prompt=4,
num_inference_steps=250,
guidance_scale=2.5,
guidance_high=0.75,
guidance_low=0.0,
mode="sde",
path_type="linear",
seed=42
)
for i, img in enumerate(output_t2i["images"]):
img.save(f"text2image_{i}.png")
Generate images conditioned on a reference image.
from PIL import Image
# Load reference image
# Ensure you have a reference image path
raw_image = Image.open("test_image.png").convert("RGB")
output_i2i = pipeline(
image=raw_image,
modality="image",
num_images_per_prompt=4,
num_inference_steps=250,
guidance_scale=2.5,
guidance_high=0.75,
guidance_low=0.0,
mode="sde",
path_type="linear",
seed=42
)
for i, img in enumerate(output_i2i["images"]):
img.save(f"image2image_{i}.png")