Du$^2$Net: Learning Depth Estimation from Dual-Cameras and Dual-Pixels — arXiv2