EdgeViTs: Competing Light-weight CNNs on Mobile Devices with Vision Transformers — arXiv2