M$^3$GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation — arXiv2