Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation — arXiv2