Duplicate from Doubiiu/DynamiCrafter_1024
Browse filesCo-authored-by: Jinbo Xing <Doubiiu@users.noreply.huggingface.co>
- .gitattributes +45 -0
- DynamiCrafter-1024-21.webp +3 -0
- DynamiCrafter-10241.webp +3 -0
- README.md +66 -0
- model.ckpt +3 -0
.gitattributes
ADDED
|
@@ -0,0 +1,45 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
*.7z filter=lfs diff=lfs merge=lfs -text
|
| 2 |
+
*.arrow filter=lfs diff=lfs merge=lfs -text
|
| 3 |
+
*.bin filter=lfs diff=lfs merge=lfs -text
|
| 4 |
+
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
| 5 |
+
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
| 6 |
+
*.ftz filter=lfs diff=lfs merge=lfs -text
|
| 7 |
+
*.gz filter=lfs diff=lfs merge=lfs -text
|
| 8 |
+
*.h5 filter=lfs diff=lfs merge=lfs -text
|
| 9 |
+
*.joblib filter=lfs diff=lfs merge=lfs -text
|
| 10 |
+
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
| 11 |
+
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
| 12 |
+
*.model filter=lfs diff=lfs merge=lfs -text
|
| 13 |
+
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
| 14 |
+
*.npy filter=lfs diff=lfs merge=lfs -text
|
| 15 |
+
*.npz filter=lfs diff=lfs merge=lfs -text
|
| 16 |
+
*.onnx filter=lfs diff=lfs merge=lfs -text
|
| 17 |
+
*.ot filter=lfs diff=lfs merge=lfs -text
|
| 18 |
+
*.parquet filter=lfs diff=lfs merge=lfs -text
|
| 19 |
+
*.pb filter=lfs diff=lfs merge=lfs -text
|
| 20 |
+
*.pickle filter=lfs diff=lfs merge=lfs -text
|
| 21 |
+
*.pkl filter=lfs diff=lfs merge=lfs -text
|
| 22 |
+
*.pt filter=lfs diff=lfs merge=lfs -text
|
| 23 |
+
*.pth filter=lfs diff=lfs merge=lfs -text
|
| 24 |
+
*.rar filter=lfs diff=lfs merge=lfs -text
|
| 25 |
+
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
| 26 |
+
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
| 27 |
+
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
| 28 |
+
*.tar filter=lfs diff=lfs merge=lfs -text
|
| 29 |
+
*.tflite filter=lfs diff=lfs merge=lfs -text
|
| 30 |
+
*.tgz filter=lfs diff=lfs merge=lfs -text
|
| 31 |
+
*.wasm filter=lfs diff=lfs merge=lfs -text
|
| 32 |
+
*.xz filter=lfs diff=lfs merge=lfs -text
|
| 33 |
+
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
+
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
+
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
bike_chineseink.gif filter=lfs diff=lfs merge=lfs -text
|
| 37 |
+
firework03.gif filter=lfs diff=lfs merge=lfs -text
|
| 38 |
+
girl07.gif filter=lfs diff=lfs merge=lfs -text
|
| 39 |
+
isometric.gif filter=lfs diff=lfs merge=lfs -text
|
| 40 |
+
robot01.gif filter=lfs diff=lfs merge=lfs -text
|
| 41 |
+
ship02.gif filter=lfs diff=lfs merge=lfs -text
|
| 42 |
+
DynamiCrafter-1024-2.webp filter=lfs diff=lfs merge=lfs -text
|
| 43 |
+
DynamiCrafter-1024.webp filter=lfs diff=lfs merge=lfs -text
|
| 44 |
+
DynamiCrafter-1024-21.webp filter=lfs diff=lfs merge=lfs -text
|
| 45 |
+
DynamiCrafter-10241.webp filter=lfs diff=lfs merge=lfs -text
|
DynamiCrafter-1024-21.webp
ADDED
|
Git LFS Details
|
DynamiCrafter-10241.webp
ADDED
|
Git LFS Details
|
README.md
ADDED
|
@@ -0,0 +1,66 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
---
|
| 2 |
+
# For reference on model card metadata, see the spec: https://github.com/huggingface/hub-docs/blob/main/modelcard.md?plain=1
|
| 3 |
+
# Doc / guide: https://huggingface.co/docs/hub/model-cards
|
| 4 |
+
{}
|
| 5 |
+
---
|
| 6 |
+
|
| 7 |
+
# DynamiCrafter (576x1024) (text-)Image-to-Video/Image Animation Model Card
|
| 8 |
+

|
| 9 |
+

|
| 10 |
+
<!-- Provide a quick summary of what the model is/does. -->
|
| 11 |
+
|
| 12 |
+
DynamiCrafter (576x1024) (Text-)Image-to-Video is a video diffusion model that <br> takes in a still image as a conditioning image and text prompt describing dynamics,<br> and generates videos from it.
|
| 13 |
+
|
| 14 |
+
## Model Details
|
| 15 |
+
|
| 16 |
+
### Model Description
|
| 17 |
+
|
| 18 |
+
<!-- Provide a longer summary of what this model is. -->
|
| 19 |
+
|
| 20 |
+
DynamiCrafter, a (Text-)Image-to-Video/Image Animation approach, aims to generate <br>
|
| 21 |
+
short video clips (~2 seconds) from a conditioning image and text prompt.
|
| 22 |
+
|
| 23 |
+
This model was trained to generate 16 video frames at a resolution of 576x1024 <br>
|
| 24 |
+
given a context frame of the same resolution.
|
| 25 |
+
|
| 26 |
+
|
| 27 |
+
- **Developed by:** CUHK & Tencent AI Lab
|
| 28 |
+
- **Funded by:** CUHK & Tencent AI Lab
|
| 29 |
+
- **Model type:** Generative (text-)image-to-video model
|
| 30 |
+
- **Finetuned from model:** DynamiCrafter (320x512)
|
| 31 |
+
|
| 32 |
+
### Model Sources
|
| 33 |
+
|
| 34 |
+
<!-- Provide the basic links for the model. -->
|
| 35 |
+
For research purpose, we recommend our Github repository (https://github.com/Doubiiu/DynamiCrafter), <br>
|
| 36 |
+
which includes the detailed implementations.
|
| 37 |
+
- **Repository:** https://github.com/Doubiiu/DynamiCrafter
|
| 38 |
+
- **Paper:** https://arxiv.org/abs/2310.12190
|
| 39 |
+
- **Demo1:** https://huggingface.co/spaces/Doubiiu/DynamiCrafter
|
| 40 |
+
- **Demo2:** https://replicate.com/camenduru/dynami-crafter-576x1024
|
| 41 |
+
## Uses
|
| 42 |
+
|
| 43 |
+
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
| 44 |
+
|
| 45 |
+
### Direct Use
|
| 46 |
+
|
| 47 |
+
<!-- This section is for the model use without fine-tuning or plugging into a larger ecosystem/app. -->
|
| 48 |
+
|
| 49 |
+
We develop this repository for RESEARCH purposes, so it can only be used for personal/research/non-commercial purposes.
|
| 50 |
+
|
| 51 |
+
|
| 52 |
+
|
| 53 |
+
## Limitations
|
| 54 |
+
|
| 55 |
+
<!-- This section is meant to convey both technical and sociotechnical limitations. -->
|
| 56 |
+
- The generated videos are relatively short (2 seconds, FPS=8).
|
| 57 |
+
- The model cannot render legible text.
|
| 58 |
+
- Faces and people in general may not be generated properly.
|
| 59 |
+
- The autoencoding part of the model is lossy, resulting in slight flickering artifacts.
|
| 60 |
+
|
| 61 |
+
|
| 62 |
+
|
| 63 |
+
## How to Get Started with the Model
|
| 64 |
+
|
| 65 |
+
Check out https://github.com/Doubiiu/DynamiCrafter
|
| 66 |
+
|
model.ckpt
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:97181be1431cc1c08fe31f8d0385a43c2beb1c7f36d25d2df301636f0c4f20f2
|
| 3 |
+
size 10437549158
|