AERMANI-VLM: Structured Reasoning for Aerial Manipulation with Vision Language Models