4Real-Video-V2: Fused View-Time Attention and Feedforward Reconstruction for 4D Scene Generation — arXiv2