Novel Human Pose and Viewpoints Renderings
We show the original video(left upper), the reconstructed canonical human model(right upper), the reconstructed scene model(left bottom), and the animated reposed human together with the scene(right bottom). All models are trained with the proposed method.
Seattle
Citron
Parking
Bike
Jogging
Lab
Telegathering
Our method is able to provide telegathering for multiple persons. It only requires a single video of the human, which makes telegathering accessible to anyone with a cellphone.
Handshake
Dance
Comparison with NeuralBody
We apply NeuralBody to our dataset in a monocular setting. NeuralBody overfits to the training observations, and produce poor rendering on the back of the subject, while ours generalize better and can faithfully render the back.
Ablations
We ablate the impacts of error correction network, and background model conditioning. Both help reconstructing a sharper and cleaner canonical NeRF model.