Discovering Diverse Multi-agent Strategic Behavior via Reward Randomization