Multi-modal Large Language Models Research Papers