This is the official reproduction of WISA, designed to enhance Text-to-Video models by improving their ability to simulate the real world.
WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation
Jing Wang*, Ao Ma*, Ke Cao*, Jun Zheng, Zhanjie Zhang, Jiasong Feng, Shanyuan Liu, Yuhang Ma, Bo Cheng, Dawei Leng‡, Yuhui Yin, Xiaodan Liang‡(*Equal Contribution, ‡Corresponding Authors)
- [2025.03.12] We have released our paper WISA and created a dedicated project homepage.
We are seeking academic interns in the AIGC field. If interested, please send your resume to maao@360.cn.
@misc{wang2025wisa,
title={WISA: World Simulator Assistant for Physics-Aware Text-to-Video Generation},
author={Jing Wang and Ao Ma and Ke Cao and Jun Zheng and Zhanjie Zhang and Jiasong Feng and Shanyuan Liu and Yuhang Ma and Bo Cheng and Dawei Leng and Yuhui Yin and Xiaodan Liang},
year={2025},
eprint={2502.08153},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2502.08153},
}