Object Detection and Description using VLMs and LLMs