Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Feng Liu¹^*, Shiwei Zhang², Xiaofeng Wang^1,3, Yujie Wei⁴, Haonan Qiu⁵
Yuzhong Zhao¹, Yingya Zhang², Qixiang Ye¹, Fang Wan¹^†

¹University of Chinese Academy of Sciences, ²Alibaba Group
³Institute of Automation, Chinese Academy of Sciences
⁴Fudan University, ⁵Nanyang Technological University

(* Work was done during internship at Alibaba Group. † Corresponding author.)

If you like our project, please give us a star ⭐ on GitHub for the latest update.

Latest News 🔥

Welcome for PRs to support other models. Please star ⭐ our project and stay tuned.
[2024/12/30] 🔥 Support Mochi and LTX-Video for Video Diffusion Models. Support Lumina-T2X for Image Diffusion Models.
[2024/12/27] 🔥 Support FLUX. TeaCache works well for Image Diffusion Models!
[2024/12/26] 🔥 Support ConsisID. Thanks @SHYuanBest. TeaCache can be easily adapted to models based on CogvideoX.
[2024/12/24] 🔥 Support HunyuanVideo.
[2024/12/19] 🔥 Support CogVideoX.
[2024/12/06] 🎉 Release the code of TeaCache. Support Open-Sora, Open-Sora-Plan and Latte.
[2024/11/28] 🎉 Release the paper of TeaCache.

Introduction

We introduce Timestep Embedding Aware Cache (TeaCache), a training-free caching approach that estimates and leverages the fluctuating differences among model outputs across timesteps, thereby accelerating the inference. For more details and visual results, please visit our project page.

TeaCache for HunyuanVideo

Please refer to TeaCache4HunyuanVideo.

TeaCache for ConsisID

Please refer to TeaCache4ConsisID.

TeaCache for FLUX

Please refer to TeaCache4FLUX.

TeaCache for Mochi

Please refer to TeaCache4Mochi.

TeaCache for LTX-Video

Please refer to TeaCache4LTX-Video.

TeaCache for Lumina-T2X

Please refer to TeaCache4Lumina-T2X.

Installation

Prerequisites:

Python >= 3.10
PyTorch >= 1.13 (We recommend to use a >2.0 version)
CUDA >= 11.6

We strongly recommend using Anaconda to create a new environment (Python >= 3.10) to run our examples:

conda create -n teacache python=3.10 -y
conda activate teacache

Install TeaCache:

git clone https://github.com/LiewFeng/TeaCache
cd TeaCache
pip install -e .

Evaluation of TeaCache

We first generate videos according to VBench's prompts.

And then calculate Vbench, PSNR, LPIPS and SSIM based on the video generated.

Generate video

cd eval/teacache
python experiments/latte.py
python experiments/opensora.py
python experiments/open_sora_plan.py
python experiments/cogvideox.py

Calculate Vbench score

# vbench is calculated independently
# get scores for all metrics
python vbench/run_vbench.py --video_path aaa --save_path bbb
# calculate final score
python vbench/cal_vbench.py --score_dir bbb

Calculate other metrics

# these metrics are calculated compared with original model
# gt video is the video of original model
# generated video is our methods's results
python common_metrics/eval.py --gt_video_dir aa --generated_video_dir bb

Acknowledgement

This repository is built based on VideoSys, Diffusers, Open-Sora, Open-Sora-Plan, Latte, CogVideoX, HunyuanVideo, ConsisID, FLUX, Mochi, LTX-Video and Lumina-T2X. Thanks for their contributions!

License

The majority of this project is released under the Apache 2.0 license as found in the LICENSE file.
For VideoSys, Diffusers, Open-Sora, Open-Sora-Plan, Latte, CogVideoX, HunyuanVideo, ConsisID, FLUX, Mochi, LTX-Video and Lumina-T2X, please follow their LICENSE.
The service is a research preview. Please contact us if you find any potential violations. ([email protected])

Citation

If you find TeaCache is useful in your research or applications, please consider giving us a star 🌟 and citing it by the following BibTeX entry.

@article{liu2024timestep,
  title={Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model},
  author={Liu, Feng and Zhang, Shiwei and Wang, Xiaofeng and Wei, Yujie and Qiu, Haonan and Zhao, Yuzhong and Zhang, Yingya and Ye, Qixiang and Wan, Fang},
  journal={arXiv preprint arXiv:2411.19108},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
TeaCache4ConsisID		TeaCache4ConsisID
TeaCache4FLUX		TeaCache4FLUX
TeaCache4HunyuanVideo		TeaCache4HunyuanVideo
TeaCache4LTX-Video		TeaCache4LTX-Video
TeaCache4Lumina-T2X		TeaCache4Lumina-T2X
TeaCache4Mochi		TeaCache4Mochi
assets		assets
eval/teacache		eval/teacache
videosys		videosys
.gitignore		.gitignore
.isort.cfg		.isort.cfg
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

If you like our project, please give us a star ⭐ on GitHub for the latest update.

Latest News 🔥

Introduction

TeaCache for HunyuanVideo

TeaCache for ConsisID

TeaCache for FLUX

TeaCache for Mochi

TeaCache for LTX-Video

TeaCache for Lumina-T2X

Installation

Evaluation of TeaCache

Acknowledgement

License

Citation

About

Releases

Packages

Contributors 2

Languages

License

ali-vilab/TeaCache

Folders and files

Latest commit

History

Repository files navigation

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

If you like our project, please give us a star ⭐ on GitHub for the latest update.

Latest News 🔥

Introduction

TeaCache for HunyuanVideo

TeaCache for ConsisID

TeaCache for FLUX

TeaCache for Mochi

TeaCache for LTX-Video

TeaCache for Lumina-T2X

Installation

Evaluation of TeaCache

Acknowledgement

License

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages