Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,69 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
base_model:
|
3 |
+
- THUDM/CogVideoX1.5-5b
|
4 |
+
datasets: finetrainers/crush-smol
|
5 |
+
library_name: diffusers
|
6 |
+
license: other
|
7 |
+
license_link: https://huggingface.co/THUDM/CogVideoX1.5-5b/blob/main/LICENSE
|
8 |
+
widget:
|
9 |
+
- text: >-
|
10 |
+
PIKA_CRUSH A red toy car is being crushed by a large hydraulic press, which is flattening objects as if they were under a hydraulic press.
|
11 |
+
output:
|
12 |
+
url: final-3000-0-2-PIKA_CRUSH-A-red-toy-car-.mp4
|
13 |
+
- text: >-
|
14 |
+
PIKA_CRUSH A large metal cylinder is seen pressing down on a pile of colorful jelly beans, flattening them as if they were under a hydraulic press.
|
15 |
+
output:
|
16 |
+
url: final-3000-0-2-PIKA_CRUSH-A-large-metal-.mp4
|
17 |
+
- text: >-
|
18 |
+
PIKA_CRUSH A large metal cylinder is seen pressing down on a pile of Oreo cookies, flattening them as if they were under a hydraulic press.
|
19 |
+
output:
|
20 |
+
url: final-3000-1-2-PIKA_CRUSH-A-large-metal-.mp4
|
21 |
+
tags:
|
22 |
+
- text-to-video
|
23 |
+
- diffusers-training
|
24 |
+
- diffusers
|
25 |
+
- cogvideox
|
26 |
+
- cogvideox-diffusers
|
27 |
+
- template:sd-lora
|
28 |
+
---
|
29 |
+
|
30 |
+
<Gallery />
|
31 |
+
|
32 |
+
This is a LoRA fine-tune of the [THUDM/CogVideoX1.5-5b](https://huggingface.co/THUDM/CogVideoX1.5-5b) model on the
|
33 |
+
[finetrainers/crush-smol](https://huggingface.co/datasets/finetrainers/crush-smol) dataset.
|
34 |
+
|
35 |
+
Code: https://github.com/a-r-r-o-w/finetrainers
|
36 |
+
|
37 |
+
> [!IMPORTANT]
|
38 |
+
> This is an experimental checkpoint and its poor generalization is well-known.
|
39 |
+
|
40 |
+
Inference code:
|
41 |
+
|
42 |
+
```py
|
43 |
+
from diffusers import CogVideoXTransformer3DModel, DiffusionPipeline
|
44 |
+
from diffusers.utils import export_to_video
|
45 |
+
import torch
|
46 |
+
|
47 |
+
pipeline = DiffusionPipeline.from_pretrained(
|
48 |
+
"THUDM/CogVideoX1.5-5b", torch_dtype=torch.bfloat16
|
49 |
+
).to("cuda")
|
50 |
+
pipeline.load_lora_weights("finetrainers/CogVideoX-1.5-crush-smol-v0", adapter_name="cogvideox-lora")
|
51 |
+
pipeline.set_adapters("cogvideox-lora", 0.9)
|
52 |
+
|
53 |
+
prompt = """
|
54 |
+
PIKA_CRUSH A red toy car is being crushed by a large hydraulic press, which is flattening objects as if they were under a hydraulic press.
|
55 |
+
"""
|
56 |
+
negative_prompt = "inconsistent motion, blurry motion, worse quality, degenerate outputs, deformed outputs"
|
57 |
+
|
58 |
+
video = pipeline(
|
59 |
+
prompt=prompt,
|
60 |
+
negative_prompt=negative_prompt,
|
61 |
+
num_frames=81,
|
62 |
+
height=480,
|
63 |
+
width=768,
|
64 |
+
num_inference_steps=50
|
65 |
+
).frames[0]
|
66 |
+
export_to_video(video, "output.mp4", fps=25)
|
67 |
+
```
|
68 |
+
|
69 |
+
Training logs are available on WandB [here](https://wandb.ai/aryanvs/finetrainers-cogvideox?nw=nwuseraryanvs).
|