metadata
base_model:
- THUDM/CogView4-6B
datasets:
- multimodalart/1920-raider-waite-tarot-public-domain
library_name: diffusers
license: other
license_link: https://huggingface.co/THUDM/CogView4-6B/blob/main/LICENSE
widget:
- text: >-
TRTCRD a trtcrd of a knight mounting a running horse wearing an armor and
holding a staff, \"knight of wands\"
output:
url: final-5000-0-2-TRTCRD-a-trtcrd-of-a-knig-1741227419.png
- text: >-
TRTCRD a trtcrd of a woman sitting on a throne, wearing a crown and
holding a trophee, \"queen of cups\"
output:
url: final-5000-1-2-TRTCRD-a-trtcrd-of-a-woma-1741227417.png
- text: >-
TRTCRD a trtcrd of a person in a red robe holding a scale and giving coins
to two kneeling figures, surrounded by six pentacles
output:
url: final-5000-1-2-TRTCRD-a-trtcrd-of-a-pers-1741227455.png
tags:
- text-to-image
- diffusers-training
- diffusers
- template:sd-lora
- cogview4

- Prompt
- TRTCRD a trtcrd of a knight mounting a running horse wearing an armor and holding a staff, \"knight of wands\"

- Prompt
- TRTCRD a trtcrd of a woman sitting on a throne, wearing a crown and holding a trophee, \"queen of cups\"

- Prompt
- TRTCRD a trtcrd of a person in a red robe holding a scale and giving coins to two kneeling figures, surrounded by six pentacles
This is a LoRA fine-tune of the THUDM/CogView4-6B model.
Code: https://github.com/a-r-r-o-w/finetrainers
Inference code:
import torch
from diffusers import CogView4Pipeline
from diffusers.utils import export_to_video
pipe = CogView4Pipeline.from_pretrained(
"THUDM/CogView4-6B", torch_dtype=torch.bfloat16
).to("cuda")
pipe.load_lora_weights("finetrainers/CogView4-6B-rider-waite-tarot-v0", adapter_name="cogview4-lora")
pipe.set_adapters(["cogview4-lora"], [0.9])
image = pipe("<my-awesome-prompt>").images[0]
image.save("output.png")
Training logs are available on WandB here.
NOTE: this checkpoint uses sigmas logit_normal weighting. For shifted_sigmas logit_normal weighting, check https://huggingface.co/finetrainers/CogView4-6B-rider-waite-tarot-v0-shifted-sigmas