Policy Gradient with Self-Attention for Model-Free Distributed Nonlinear Multi-Agent Games