VerBIO Dataset

The VerBIO dataset is a multimodal bio-behavioral dataset of individuals' affective responses while performing public speaking tasks in real-life and virtual settings. This data has been collected as part of a research study (EiF grant 18.02) jointly performed by HUBBS Lab and CIBER Lab at the University of Colorado Boulder and Texas A&M University. The aim of this study is to understand the relationship between bio-behavioral indices and public speaking anxiety in both real world and virtual learning environments. Also, this study explores the time-continuous detection of stress using multimodal bio-behavioral signals. This dataset contains audio recordings, physiological signals, self-reported measures, and time-continuous stress ratings from 344 public speaking sessions. During these sessions, 55 participants delivered short speeches on a given topic from newspaper articles, in front of a real or virtual audience. You can find more details on the dataset in the following papers:

M. Yadav, M. N. Sakib, E. H. Nirjhar, K. Feng, A. Behzadan, and T. Chaspari, "Exploring individual differences of public speaking anxiety in real-life and virtual presentations," in IEEE Transactions on Affective Computing, vol. 13, no. 3, pp. 1168-1182, 1 July-Sept. 2022, doi: 10.1109/TAFFC.2020.3048299.

E. H. Nirjhar, and T. Chaspari, "Modeling Gold Standard Moment-to-Moment Ratings of Perception of Stress from Audio Recordings," in IEEE Transactions on Affective Computing, vol. 16, no. 1, pp. 376-393, Jan.-March 2025, doi: 10.1109/TAFFC.2024.3435502.

We are releasing an updated version of VerBIO dataset. This version includes the moment-to-moment (i.e., time-continuous) ratings of stress from four annotators and their aggregated ratings that can be used for continuous stress detection.

The first version of VerBIO dataset was released in 2021. This version includes the audio recording, physiological signal time-series from wearable sensors (i.e., from Empatica E4 and Actiwave), and self-reported measures (e.g., trait anxiety, state anxiety, demographics).

The dataset is available for academic research only upon request. To obtain the dataset, please fill out the request submission form after agreeing to the terms and conditions. Please use an email affiliated with an academic institution to submit the request. Once your request has been verified, we will send a download link of the dataset to the provided email address. If you have any questions or comments, please contact Theodora Chaspari (theodora.chaspari at colorado dot edu).

Terms and Conditions:

This dataset will be provided to the requestor for academic research purposes only, after verifying their submitted form. This dataset can not be used for commercial purposes. After receiving the data, the requestor can not redistribute or share the data with a third party, or put the data on a public website. If the requestor publishes their research work using this dataset, please cite the following paper:

M. Yadav, M. N. Sakib, E. H. Nirjhar, K. Feng, A. Behzadan, and T. Chaspari, "Exploring individual differences of public speaking anxiety in real-life and virtual presentations," in IEEE Transactions on Affective Computing, vol. 13, no. 3, pp. 1168-1182, 1 July-Sept. 2022, doi: 10.1109/TAFFC.2020.3048299.

If you are using the moment-to-moment ratings in your research, please also cite the following paper:

E. H. Nirjhar, and T. Chaspari, "Modeling Gold Standard Moment-to-Moment Ratings of Perception of Stress from Audio Recordings," in IEEE Transactions on Affective Computing, vol. 16, no. 1, pp. 376-393, Jan.-March 2025, doi: 10.1109/TAFFC.2024.3435502.

What’s new in the updated version?

Time continuous ratings of stress are added in Annotation folder that can be used for continuous stress detection. Train-test split information for this use case is also provided.
All features and signals are in .csv format instead of .xlsx format.
Physiological signals for the relaxation and preparation time are provided for TEST sessions.
Actiwave features in the Features folders are updated.

Other Related Publications:

Yang et al., "Deconstructing demographic bias in speech-based machine learning models for digital health," Frontiers in Digital Health 2024
Tutul et al., "Investigating Trust in Human-AI Collaboration for a Speech-Based Data Analytics Task," International Journal of Human-Computer Interaction 2024
Raether et al., "Evaluating Just-In-Time Vibrotactile Feedback for Communication Anxiety," ACM ICMI 2022
Tutul et al., "Investigating Trust in Human-Machine Learning Collaboration: A Pilot Study on Estimating Public Anxiety from Speech," ACM ICMI 2021
Nirjhar et al., "Knowledge- and Data-Driven Models of Multimodal Trajectories of Public Speaking Anxiety in Real and Virtual Settings," ACM ICMI 2021
von Ebers et al., “Predicting the Effectiveness of Systematic Desensitization Through Virtual Reality for Mitigating Public Speaking Anxiety,” ACM ICMI 2020
Nirjhar et al., “Exploring Bio-Behavioral Signal Trajectories of State Anxiety During Public Speaking,” IEEE ICASSP 2020
Yadav et al., “Virtual reality interfaces and population-specific models to mitigate public speaking anxiety,” IEEE ACII 2020 (nominated for best paper award)
Yadav et al., “Speak Up! Studying the interplay of individual and contextual factors to physiological-based models of public speaking anxiety,” TransAI 2019 (invited paper)