Coarse-to-fine Q-attention with Tree Expansion