LLM with Scene Manipulation from Voice Commands