Semantic-Aware Human Object Interaction Image Generation