data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language — arXiv2