LLM2CLIP: Powerful Language Model Unlocks Richer Cross-Modality Representation — arXiv2