pharaouk Doubiiu commited on
Commit
8069fab
·
verified ·
0 Parent(s):

Duplicate from Doubiiu/DynamiCrafter_1024

Browse files

Co-authored-by: Jinbo Xing <Doubiiu@users.noreply.huggingface.co>

.gitattributes ADDED
@@ -0,0 +1,45 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ *.7z filter=lfs diff=lfs merge=lfs -text
2
+ *.arrow filter=lfs diff=lfs merge=lfs -text
3
+ *.bin filter=lfs diff=lfs merge=lfs -text
4
+ *.bz2 filter=lfs diff=lfs merge=lfs -text
5
+ *.ckpt filter=lfs diff=lfs merge=lfs -text
6
+ *.ftz filter=lfs diff=lfs merge=lfs -text
7
+ *.gz filter=lfs diff=lfs merge=lfs -text
8
+ *.h5 filter=lfs diff=lfs merge=lfs -text
9
+ *.joblib filter=lfs diff=lfs merge=lfs -text
10
+ *.lfs.* filter=lfs diff=lfs merge=lfs -text
11
+ *.mlmodel filter=lfs diff=lfs merge=lfs -text
12
+ *.model filter=lfs diff=lfs merge=lfs -text
13
+ *.msgpack filter=lfs diff=lfs merge=lfs -text
14
+ *.npy filter=lfs diff=lfs merge=lfs -text
15
+ *.npz filter=lfs diff=lfs merge=lfs -text
16
+ *.onnx filter=lfs diff=lfs merge=lfs -text
17
+ *.ot filter=lfs diff=lfs merge=lfs -text
18
+ *.parquet filter=lfs diff=lfs merge=lfs -text
19
+ *.pb filter=lfs diff=lfs merge=lfs -text
20
+ *.pickle filter=lfs diff=lfs merge=lfs -text
21
+ *.pkl filter=lfs diff=lfs merge=lfs -text
22
+ *.pt filter=lfs diff=lfs merge=lfs -text
23
+ *.pth filter=lfs diff=lfs merge=lfs -text
24
+ *.rar filter=lfs diff=lfs merge=lfs -text
25
+ *.safetensors filter=lfs diff=lfs merge=lfs -text
26
+ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
27
+ *.tar.* filter=lfs diff=lfs merge=lfs -text
28
+ *.tar filter=lfs diff=lfs merge=lfs -text
29
+ *.tflite filter=lfs diff=lfs merge=lfs -text
30
+ *.tgz filter=lfs diff=lfs merge=lfs -text
31
+ *.wasm filter=lfs diff=lfs merge=lfs -text
32
+ *.xz filter=lfs diff=lfs merge=lfs -text
33
+ *.zip filter=lfs diff=lfs merge=lfs -text
34
+ *.zst filter=lfs diff=lfs merge=lfs -text
35
+ *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ bike_chineseink.gif filter=lfs diff=lfs merge=lfs -text
37
+ firework03.gif filter=lfs diff=lfs merge=lfs -text
38
+ girl07.gif filter=lfs diff=lfs merge=lfs -text
39
+ isometric.gif filter=lfs diff=lfs merge=lfs -text
40
+ robot01.gif filter=lfs diff=lfs merge=lfs -text
41
+ ship02.gif filter=lfs diff=lfs merge=lfs -text
42
+ DynamiCrafter-1024-2.webp filter=lfs diff=lfs merge=lfs -text
43
+ DynamiCrafter-1024.webp filter=lfs diff=lfs merge=lfs -text
44
+ DynamiCrafter-1024-21.webp filter=lfs diff=lfs merge=lfs -text
45
+ DynamiCrafter-10241.webp filter=lfs diff=lfs merge=lfs -text
DynamiCrafter-1024-21.webp ADDED

Git LFS Details

  • SHA256: 080fea5d83648388cbf1588a26518c7ea4626166a1287deaf0b14a6801a1c99d
  • Pointer size: 132 Bytes
  • Size of remote file: 1.89 MB
DynamiCrafter-10241.webp ADDED

Git LFS Details

  • SHA256: ff1254c9ed1ac932040b7dbedb8ff4be0e8b37f2d43e33a42d8406b70163378d
  • Pointer size: 132 Bytes
  • Size of remote file: 1.69 MB
README.md ADDED
@@ -0,0 +1,66 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ # For reference on model card metadata, see the spec: https://github.com/huggingface/hub-docs/blob/main/modelcard.md?plain=1
3
+ # Doc / guide: https://huggingface.co/docs/hub/model-cards
4
+ {}
5
+ ---
6
+
7
+ # DynamiCrafter (576x1024) (text-)Image-to-Video/Image Animation Model Card
8
+ ![row01](DynamiCrafter-1024-21.webp)
9
+ ![row02](DynamiCrafter-10241.webp)
10
+ <!-- Provide a quick summary of what the model is/does. -->
11
+
12
+ DynamiCrafter (576x1024) (Text-)Image-to-Video is a video diffusion model that <br> takes in a still image as a conditioning image and text prompt describing dynamics,<br> and generates videos from it.
13
+
14
+ ## Model Details
15
+
16
+ ### Model Description
17
+
18
+ <!-- Provide a longer summary of what this model is. -->
19
+
20
+ DynamiCrafter, a (Text-)Image-to-Video/Image Animation approach, aims to generate <br>
21
+ short video clips (~2 seconds) from a conditioning image and text prompt.
22
+
23
+ This model was trained to generate 16 video frames at a resolution of 576x1024 <br>
24
+ given a context frame of the same resolution.
25
+
26
+
27
+ - **Developed by:** CUHK & Tencent AI Lab
28
+ - **Funded by:** CUHK & Tencent AI Lab
29
+ - **Model type:** Generative (text-)image-to-video model
30
+ - **Finetuned from model:** DynamiCrafter (320x512)
31
+
32
+ ### Model Sources
33
+
34
+ <!-- Provide the basic links for the model. -->
35
+ For research purpose, we recommend our Github repository (https://github.com/Doubiiu/DynamiCrafter), <br>
36
+ which includes the detailed implementations.
37
+ - **Repository:** https://github.com/Doubiiu/DynamiCrafter
38
+ - **Paper:** https://arxiv.org/abs/2310.12190
39
+ - **Demo1:** https://huggingface.co/spaces/Doubiiu/DynamiCrafter
40
+ - **Demo2:** https://replicate.com/camenduru/dynami-crafter-576x1024
41
+ ## Uses
42
+
43
+ <!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
44
+
45
+ ### Direct Use
46
+
47
+ <!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
48
+
49
+ We develop this repository for RESEARCH purposes, so it can only be used for personal/research/non-commercial purposes.
50
+
51
+
52
+
53
+ ## Limitations
54
+
55
+ <!-- This section is meant to convey both technical and sociotechnical limitations. -->
56
+ - The generated videos are relatively short (2 seconds, FPS=8).
57
+ - The model cannot render legible text.
58
+ - Faces and people in general may not be generated properly.
59
+ - The autoencoding part of the model is lossy, resulting in slight flickering artifacts.
60
+
61
+
62
+
63
+ ## How to Get Started with the Model
64
+
65
+ Check out https://github.com/Doubiiu/DynamiCrafter
66
+
model.ckpt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:97181be1431cc1c08fe31f8d0385a43c2beb1c7f36d25d2df301636f0c4f20f2
3
+ size 10437549158