- A session consists of 100 trials, in which the subject chooses between the two alternatives.
- If the subject chooses a rewarded alternative then she receives a monetary reward in that trial.
- A reward may be assigned to one, two or none of the alternatives in each of the trials
- Feedback is partial - the subject is informed about the outcome of her choice but not about the outcome of the forgone choice.
Your challenge is to construct a reward schedule that maximally biases the choices of the subjects, complying with the global constraint:
- Assign a reward to exactly one-quarter of the trials (25 trials) of each alternative.
We offer two participation tracks:
- In the “static” track, choice designers are invited to propose a reward schedules in the form of a sequence of rewards (see for example here).
- In the “dynamic” track, choice designers are challenged to submit a code that allocates, in every trial t, the reward/s of the next trial (rtbias ,rt¬bias ∈{0, 1}), based on the history of choices (a1, a2,...at-1; ai∈{bias, ¬bias}) and rewards ((r1bias ,r1¬bias ), (r2bias ,r2¬bias ),..., (rt-1bias ,rt-1¬bias ); ribias ,ri¬bias ∈{0, 1}), see for example here.