Bird-Eye Transformers for Text Generation Models