Virtually Being: Customizing Camera-Controllable Video Diffusion Models with Multi-View Performance Captures — arXiv2