Diffuman4D Model

Project Page | Paper | Code | Dataset

The official model repo for Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models.

Diffuman4D enables high-fidelity free-viewpoint rendering of human performances from sparse-view videos.

Usage

See the GitHub repo for detailed usage.

Cite

@inproceedings{jin2025diffuman4d,
  title={Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models},
  author={Jin, Yudong and Peng, Sida and Wang, Xuan and Xie, Tao and Xu, Zhen and Yang, Yifan and Shen, Yujun and Bao, Hujun and Zhou, Xiaowei},
  booktitle={International Conference on Computer Vision (ICCV)},
  year={2025}
}

Downloads last month: 37

Safetensors

Model size

0.8B params

Tensor type

F32

Inference Providers NEW

Video-to-Video

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for krahets/Diffuman4D

Base model

stabilityai/stable-diffusion-2-1-base

Finetuned

(56)

this model

Dataset used to train krahets/Diffuman4D

Paper for krahets/Diffuman4D

Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models

Paper • 2507.13344 • Published Jul 17, 2025 • 59