R/P Distro

This Trial mode-specific tab page was first introduced in Maestro v1.4.0 to support a new experimental paradigm, in which reward delivery is contingent upon the subject's measured behavioral response falling within a reward window defined by the researcher (in additioning to satisfying the usual fixation requirements). The goal of this paradigm -- which (for lack of a better term) we coin "distribution-based reward/penalty protocol", or R/P Distro for short -- is to determine whether or not it is possible to modify the variability in a subject's behavior through motivation.

The protocol occurs in two distinct phases: assessment, followed by modification. In the assessment phase, the subject's behavioral response to a particular visual stimulus is measured across repeated presentations of the stimulus. Maestro accumulates the response samples, building up a nominal distribution that characterizes the inherent variability in the subject's behavior.

The "stimulus" is simply a trial defined to use the R/P Distro special operation. The "response sample" is one of four possible measures related to the subject's eye trajectory: the eye velocity vector magnitude; horizontal eye velocity; vertical eye velocity; and the direction of eye motion. In each of these cases, the response sample is averaged over the course of a specially designated segment in the trial, which is typically very short in duration. After examining the nominal response distribution collected in the assessment phase, the researcher defines a reward window spanning some portion of the nominal distribution and then resumes trial sequencing.

Again, the behavioral response is measured as before, but now the subject receives an enhanced reward ("pass") if the response falls within the reward window and a reduced reward ("fail") otherwise. The differential rewards are a function of the two reward pulses defined in the R/P Distro trial. Reward pulse #1 is the nominal reward delivered at the end of each trial during the assessment phase. Reward pulse #2 is only used during the modification phase and must be significantly shorter than pulse #1. During behavior modification, the "enhanced" reward is really two rewards: the shorter reward pulse #2 delivered immediately after the response is measured, plus the nominal reward pulse #1 delivered at trial's end. When the behavioral response falls outside the reward window, no reward is given after the response is measured, and the shorter reward is given instead at trial's end -- resulting in a "reduced" overall reward. Maestro again accumulates the response samples over many presentations of the trial, forming a modified response distribution. If the variability in the subject's behavior can indeed be altered by this approach, we would expect this second distribution to be skewed toward the defined reward window.

To conduct this kind of an experiment, you must first define a trial set that includes one or more trials employing the R/P Distro special operation and set the reward pulses appropriately for those trials. Maestro can track distribution and reward statistics for any number of such trials simultaneously. For example, one might define R/P Distro trials that measure eye velocity during the initial stage of pursuit of a constant velocity target moving in each of 8 different directions. Once you've entered Trial mode and selected the trial set in the Protocol tab, Maestro will catalogue all trials in the selected set that use the R/P Distro feature. Navigate to the R/P Distro tab page, and note that all such trials are listed in the dropdown combo box labelled Trial. Other controls on the dialog display additional relevant runtime information for the trial currently selected in the combo box; to look at a different R/P Distro trial, simply select its name from the dropdown list. Note that all controls are disabled while a trial sequence is running!

A small "canvas" window offers a graphical depiction of the "current" (white outline) and "previous" (filled in gray) response sample histograms collected for the trial; it will be blank if no response data has been collected. The horizontal extent of the canvas spans the valid response range defined in the numeric edit controls immediately above it. Specifying a reasonable response range may be useful, for example, if you want to ignore trials in which a saccade or an electronics artifact happens to occur during the special segment, resulting in an unreasonable value for the eye velocity. The histograms divide the valid response range into 25 bins. Samples falling outside the response range are still stored -- so if you later widen the range, the histogram will be adjusted to include the omitted samples. In the unnormalized display mode, the height of the histogram bars indicate the #counts in each bin; the vertical axis is scaled by the maximum observed bin count, which is written in the top left corner of the canvas (it is never less than 10). In the normalized display mode, the height of the histogram bars indicate the #counts in each bin relative to the maximum observed over that histogram; in this mode, the vertical axis spans [0.0 .. 1.0]. Simply click on the canvas to toggle between the two modes. Blue bars in the top and bottom margins of the canvas span the behavioral reward window for the trial, if it is enabled. If you want to enlarge this canvas, simply make the mode control panel larger; the canvas's right and bottom edges "stick" to the panel's corresponding edges.

As of Maestro v2.1.1, the experimenter can choose among four different behavioral response measures via the Response Type combo box. The available choices, as mentioned above, are all related to the velocity trajectory of the subject's eye. Note that changing the response type will automatically clear all previously collected response samples. Normally, the response type is selected prior to beginning the experiment.

The "response reward window" for the selected R/P Distro trial is defined by controls in the Reward Window group box. Click the check box to enable or disable the reward window, and use the accompanying numeric edit controls to set the response range covered by the window. Note that reward windows may not extend beyond the valid response range; Maestro will correct invalid entries accordingly.

As of Maestro v1.5.0, the Shift By and UpdIntv parameters were introduced to permit auto-shifting of the reward window based upon the subject's measured responses during the reward phase of the protocol. Here's how it works in the case of a positive shift. After constructing the subject's nominal response distribution in the assessment phase of the experiment, the investigator would set the reward window [Rmin .. Rmax] skewed toward the high side of the distribution, set Shift By to some small positive number δ, and set UpdIntv to the number Nupd of valid response samples collected per window update. After Nupd valid responses are collected, Maestro computes the mean response R. If R > Rmin, then the subject's response has "shifted" in the right direction, and Maestro shifts the reward window to [Rmin+δ .. Rmax+δ]. This process repeats until the investigator stops the trial sequence. Analogously, if the goal is to shift the response distribution to the left, the initial reward window would be skewed to the low side of the nominal distribution and δ would be set to a small negative number. In this case, Maestro would shift the reward window only if the mean response R satisfied R < Rmax.

The Mean/Std Dev control group includes read-only static controls that display the number of valid samples collected (i.e., within the valid response range), the mean, and the standard deviation for the current and previous response sample distributions. Instead of displaying statistics across the entire set of valid response samples, the investigator may choose to include only the most recent samples by setting N to some postive number; if N is zero, all valid samples are included in the calculations. Note that N applies also to the histograms displayed in the canvas window, and that you can specify different values for N for the current and previous response distributions.

The Reward Stats control group displays pass/fail statistics accumulated during the modification phase of the R/P Distro protocol, both for the currently selected trial and across all R/P Distro trials that have been run since Maestro started up.

Clicking the Start New Distrib button will discard the previous distribution, make the current distribution the previous one, then reset the current distribution data. Clicking it twice will effectively discard all response sample data collected thus far.

Given the controls and readouts on the R/P Distro tab page, running an experiment that aims to modify the variability in a subject's behavioral response is straightforward. Switch to Trial mode and select the trial set that contains the R/P Distro trial or trials you wish to present. Before starting the trial sequence, switch to the R/P Distro tab and, for each R/P Distro trial in the set: (1) select the type of behavioral response to be measured; (2) initialize the valid response range; (3) make sure the reward window is disabled; and (4) discard any previously collected response samples by clicking the Start New Distrib button twice. Then start trial sequencing to begin the assessment phase, watching the nominal response sample distribution build up for each relevant trial. When sufficient data has been collected, pause the trial sequence and use the controls in the Reward Window group to set up the response reward window for each trial. Also, click the Start New Distrib pushbutton once for each trial to make the "current" distribution the "previous" one. Then resume trial sequencing to start the modification phase, and keep the R/P Distro dialog in front to monitor the new response distributions and pass/fail statistics as they are collected by Maestro.

When you're done, you will probably want to save a summary of the statistics that have been collected during the experiment. Simply click on the Save Summary pushbutton and select a destination file. Maestro will write a summary of the runtime information collected thus far, in plain text format. As you can see in the example output file at the bottom of this page, the summary includes the overall pass/fail reward statistics and, for each R/P Distro trial:

    • The name of the trial.

    • The type of behavioral response measured (a string that matches the items in the Response Type combo box).

    • The valid response range assigned to the trial.

    • The reward window parameters, if the window is enabled.

    • Pass/fail statistics for the trial.

    • All samples collected for both the current and previous distributions, as well as sample histograms calculated over the valid response range.

Note that there is a utility available, getrpdsummary(), that will read the contents of this file into Matlab.

Sample R/P Distro Summary File

Overall: pass=37, fail=19


Trial name: testRPDistro

Measured response type: Eye Speed

Valid response range: [0.000 to 20.000]

Reward Window: [5.000 to 15.000], shift=0.200, updN=20

#passed = 37, #failed= 19

Current: N = 56 total, 53 valid; mean = 11.130, stdev = 4.472 (over ALL valid samples)

All samples: -1.638 1.224 10.371 7.994 4.343 22.776 22.669 9.808 13.236 16.807

4.899 8.760 14.268 11.469 17.103 3.693 4.912 4.962 12.744 9.481 5.775 17.719

9.474 16.898 19.797 7.940 17.130 9.994 14.955 14.650 8.898 8.175 18.821

7.439 19.096 11.059 12.001 14.021 8.731 18.363 9.403 14.103 12.974 7.336

11.323 9.475 10.347 2.880 10.472 14.915 14.908 8.811 13.020 7.159 11.448

14.285

Sample histogram (over ALL valid samples):

0 [0.000 0.800]; 1 [0.800 1.600]; 0 [1.600 2.400]; 1 [2.400 3.200];

1 [3.200 4.000]; 1 [4.000 4.800]; 3 [4.800 5.600]; 1 [5.600 6.400];

1 [6.400 7.200]; 4 [7.200 8.000]; 3 [8.000 8.800]; 6 [8.800 9.600];

4 [9.600 10.400]; 2 [10.400 11.200]; 3 [11.200 12.000]; 2 [12.000 12.800];

3 [12.800 13.600]; 4 [13.600 14.400]; 4 [14.400 15.200]; 0 [15.200 16.000];

0 [16.000 16.800]; 4 [16.800 17.600]; 2 [17.600 18.400]; 2 [18.400 19.200];

1 [19.200 20.000];

Previous: N = 108 total, 102 valid; mean = 10.409, stdev = 4.464 (over 75 most recent valid samples)

All samples: 5.816 9.139 10.936 18.077 9.116 13.266 7.268 10.971 14.629 16.022

17.653 3.222 10.257 15.101 3.869 13.542 14.358 6.051 11.660 11.028 9.153

8.408 8.522 12.611 -1.155 11.291 9.561 1.766 11.434 11.500 15.520 13.714

9.212 7.012 13.299 9.836 15.826 4.760 14.089 2.964 12.596 6.333 11.627 9.786

5.643 13.858 10.049 5.526 6.753 9.957 19.386 -2.393 1.582 6.179 10.729

11.107 21.984 19.556 18.481 14.040 23.937 10.138 10.524 18.901 11.488 12.342

10.955 15.983 18.752 16.387 5.232 -2.263 18.672 5.954 7.712 10.167 6.162

8.131 14.557 7.735 4.266 10.913 18.773 15.199 16.166 9.793 1.063 3.178 9.568

9.583 13.143 14.540 3.707 13.840 7.900 7.592 7.322 1.788 13.921 11.262 8.121

7.246 8.910 8.864 -1.040 5.679 8.258 5.680

Sample histogram (over 75 most recent valid samples):

0 [0.000 0.800]; 2 [0.800 1.600]; 2 [1.600 2.400]; 2 [2.400 3.200];

3 [3.200 4.000]; 2 [4.000 4.800]; 2 [4.800 5.600]; 9 [5.600 6.400];

2 [6.400 7.200]; 7 [7.200 8.000]; 5 [8.000 8.800]; 9 [8.800 9.600];

8 [9.600 10.400]; 8 [10.400 11.200]; 7 [11.200 12.000]; 3 [12.000 12.800];

4 [12.800 13.600]; 7 [13.600 14.400]; 5 [14.400 15.200]; 3 [15.200 16.000];

3 [16.000 16.800]; 0 [16.800 17.600]; 2 [17.600 18.400]; 5 [18.400 19.200];

2 [19.200 20.000];