A cross-disciplinary big-team science effort to study LLM capability evaluation in the context of persuasion