I am interested in the perceptual organization of auditory and visual experiences. Knowing how to organize the complex perceptual input we are constantly receiving is crucial for our ability to interact with the world around us. I aim to understand how we make sense of this input, how we deal with cases of overload, and how we know when to combine input from multiple senses into a single percept. I have active and productive collaborations with faculty members and students within several areas of psychology, as well as interdisciplinary partnerships with music and linguistics faculty, which have allowed me to approach these questions in three lines of research.

Rhythm Complexity
In one line of work, I examine the psychological processes associated with perceiving simple and complex auditory patterns. For example, my collaborators and I have proposed a quantitative model of how listeners decide on the starting point of simple auditory patterns (Yu, Getz, & Kubovy, 2015). My main focus has been on understanding more complex rhythm sequences—an area that has received less empirical attention. Specifically, my colleagues and I have looked at rhythm complexity using a variety of tasks including pattern matching, discrimination, and production in adult musicians and non-musicians (Getz, Barton, & Kubovy, 2014). In addition to adults, I have expanded this line of research to investigate five-year-old children's (in collaboration with Rachel Keen) and songbirds' (in collaboration with Dan Meliza) understanding of complex rhythms.

Rhythm and Language Overlap
My second line of research addresses the competition between people’s ability to segment rhythms and their ability to perceive whole sentences. Across a number of studies, I have found that sentence processing overwhelms perceptual organization: listeners are told to ignore the words of the sentence and choose the underlying rhythm pattern in which the words are repeating, yet the majority of listeners choose the start of the sentence as the start of the rhythm. This happens even when the sentences are in a non-native language that listeners are beginning to learn (Getz, Salona*, Yu, & Kubovy, 2015). It is harder for listeners to ignore sentences that appear with complex rhythms compared to simple rhythms (Getz, Wohltjen*, & Kubovy, in press) and sentences that are composed of anxiety-related words compared to neutral words (Gai*, Getz, & Kubovy, in prep).

Audiovisual Cross-modal Correspondences

For my dissertation, I am assessing the replication strength and top-down influences on the audiovisual correspondences between auditory pitch and the visual dimensions of size, height, brightness, spatial frequency, and sharpness. To take the pitch-size correspondence as an example, previous research has shown that participants are faster and more accurate to respond when large objects are paired with low pitches and small objects are paired with high pitches. However, using a variety of direct and conceptual replications, I have been unable to find evidence for an automatic association between pitch and size. Further, when specifically asked to pair in the opposite direction (large/high and small/low), participants do so without any loss in speed or accuracy. Thus although previous studies have largely assumed a bottom-up association between dimensions, my modified paradigm finds that the association direction can often easily be switched with top-down control of attention. I argue that cross-modal correspondences happen at a later decision level that is subject to constraints of language and task instructions rather than at an early perception-only level.


