a-r-r-o-w HF staff commited on
Commit
bf174c0
·
verified ·
1 Parent(s): 0ea0023

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +69 -0
README.md ADDED
@@ -0,0 +1,69 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - THUDM/CogVideoX1.5-5b
4
+ datasets: finetrainers/crush-smol
5
+ library_name: diffusers
6
+ license: other
7
+ license_link: https://huggingface.co/THUDM/CogVideoX1.5-5b/blob/main/LICENSE
8
+ widget:
9
+ - text: >-
10
+ PIKA_CRUSH A red toy car is being crushed by a large hydraulic press, which is flattening objects as if they were under a hydraulic press.
11
+ output:
12
+ url: final-3000-0-2-PIKA_CRUSH-A-red-toy-car-.mp4
13
+ - text: >-
14
+ PIKA_CRUSH A large metal cylinder is seen pressing down on a pile of colorful jelly beans, flattening them as if they were under a hydraulic press.
15
+ output:
16
+ url: final-3000-0-2-PIKA_CRUSH-A-large-metal-.mp4
17
+ - text: >-
18
+ PIKA_CRUSH A large metal cylinder is seen pressing down on a pile of Oreo cookies, flattening them as if they were under a hydraulic press.
19
+ output:
20
+ url: final-3000-1-2-PIKA_CRUSH-A-large-metal-.mp4
21
+ tags:
22
+ - text-to-video
23
+ - diffusers-training
24
+ - diffusers
25
+ - cogvideox
26
+ - cogvideox-diffusers
27
+ - template:sd-lora
28
+ ---
29
+
30
+ <Gallery />
31
+
32
+ This is a LoRA fine-tune of the [THUDM/CogVideoX1.5-5b](https://huggingface.co/THUDM/CogVideoX1.5-5b) model on the
33
+ [finetrainers/crush-smol](https://huggingface.co/datasets/finetrainers/crush-smol) dataset.
34
+
35
+ Code: https://github.com/a-r-r-o-w/finetrainers
36
+
37
+ > [!IMPORTANT]
38
+ > This is an experimental checkpoint and its poor generalization is well-known.
39
+
40
+ Inference code:
41
+
42
+ ```py
43
+ from diffusers import CogVideoXTransformer3DModel, DiffusionPipeline
44
+ from diffusers.utils import export_to_video
45
+ import torch
46
+
47
+ pipeline = DiffusionPipeline.from_pretrained(
48
+ "THUDM/CogVideoX1.5-5b", torch_dtype=torch.bfloat16
49
+ ).to("cuda")
50
+ pipeline.load_lora_weights("finetrainers/CogVideoX-1.5-crush-smol-v0", adapter_name="cogvideox-lora")
51
+ pipeline.set_adapters("cogvideox-lora", 0.9)
52
+
53
+ prompt = """
54
+ PIKA_CRUSH A red toy car is being crushed by a large hydraulic press, which is flattening objects as if they were under a hydraulic press.
55
+ """
56
+ negative_prompt = "inconsistent motion, blurry motion, worse quality, degenerate outputs, deformed outputs"
57
+
58
+ video = pipeline(
59
+ prompt=prompt,
60
+ negative_prompt=negative_prompt,
61
+ num_frames=81,
62
+ height=480,
63
+ width=768,
64
+ num_inference_steps=50
65
+ ).frames[0]
66
+ export_to_video(video, "output.mp4", fps=25)
67
+ ```
68
+
69
+ Training logs are available on WandB [here](https://wandb.ai/aryanvs/finetrainers-cogvideox?nw=nwuseraryanvs).