Georgios Chochlakis, Turab Iqbal, Woohyun Kang, Zhaocheng Huang, "Modality-Agnostic Multimodal Emotion Recognition using a Contrastive Masked Autoencoder", to appear, INTERSPEECH '25, Rotterdam, The Nertherlands.
Nilaksh Das, Saket Dingliwal, Srikanth Ronanki, Rohit Paturi, Zhaocheng Huang, Prashant Mathur, Jie Yuan, Dhanush Bekal, Xing Niu, Sai Muralidhar Jayanthi, Xilai Li, Karel Mundnich, Monica Sunkara, Sundararajan Srinivasan, Daniel Garcia-Romero, Kyu J Han, Katrin Kirchhoff, "SpeechVerse: A Large-scale Generalizable Audio Language Model", 2024. [arvix] [ARR]
Juan Pablo Zuluaga-Gomez, Zhaocheng Huang*, Xing Niu*, Rohit Paturi, Sundararajan Srinavasan, Prashant Mathur, Brian Thompson, Marcello Federico, "End-to-End Single-Channel Speaker-Turn Aware Conversational Speech Transalation", in the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), Singapore. [slides] [arvix] [code]
Sundararajan Srinivasan, Zhaocheng Huang, Katrin Kirchhoff, "Representation Learning through Cross-modal Conditional Teacher-student Training for Speech Emotion Recognition", ICASSP '22, Singapore. [oral presentation] [slides] [arvix]
Brian Stasak, Zhaocheng Huang, Julien Epps, and Dale Joachim, "Depression Classification Using n-Gram Speech Errors from Manual and Automatic Stroop Color Test Transcripts", 2021 IEEE Engineering in Medicine and Biology Conference (EMBC' 21).
Brian Stasak, Zhaocheng Huang, Dale Joachim, and Julien Epps, "Automatic Elicitation Compliance for Short-Duration Speech-based Depression Detection", ICASSP '21, Toronto, Canada, pp. 7283 - 7287, 2021.
Zhaocheng Huang, Julien Epps, Dale Joachim, Brian Stasak, James. R. Williamson, and Thomas. F. Quatieri, "Domain Adaptation for Enhancing Speech-based Depression Detection in Natural Environmental Conditions Using Dilated CNNs", INTERSPEECH '20, Shanghai, China, pp. 4561-4565, 2020.
Sadari Jayawardena, Julien Epps, Zhaocheng Huang, "How Ordinal Are Your Data?", INTERSPEECH '20, Shanghai, China, pp. 1853-1857, 2020.
Zhaocheng Huang, Julien Epps, Dale Joachim, "Exploiting Vocal Tract Coordination using Dilated CNNs for Depression Detection in Natrualistic Environements." ICASSP '20, Barcelona, Spain, pp. 6549–6553, 2020. [oral presentation][slides]
Zhaocheng Huang, Julien Epps, Dale Joachim, "Speech Landmark Bigrams for Depression Detection from Naturalistic Smartphone Speech.", ICASSP '19, Brighton, UK, pp. 5856–5860, 2019. [oral presentation][slides]
Zhaocheng Huang, Julien Epps, Dale Joachim, Michael. C. Chen, "Depression Detection from Short Utterances via Diverse Smartphones in Natural Environmental Conditions." INTERSPEECH '18, Hyderabad, India, pp. 3393–3397, 2018. [poster]
Ting Dang, Brian Stasak, Zhaocheng Huang, Sadari Jayawardena, Mia Atcheson, Munawar Hayat, Phu Le, Vidhyasaharan Sethu, Roland Goecke, and Julien Epps, “Investigating Word Affect Features and Fusion of Probabilistic Predictions Incorporating Uncertainty in AVEC 2017.” In Proceedings of the 7th International Workshop on Audio/Visual Emotion Challenge (AVEC ’17). ACM, 2017, Mountain View, CA USA, pp. 27–35, 2017. [pdf] [oral presentation]
Zhaocheng Huang, and Julien Epps. "An Investigation of Emotion Dynamics and Kalman Filtering for Speech-based Emotion Prediction", INTERSPEECH '17, Stockholm, Sweden, pp. 3301–3305, 2017. [pdf] [poster]
Zhaocheng Huang, and Julien Epps. "A PLLR and Multi-Stage Staircase Regression Framework for Speech-based Emotion Prediction" 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '17), New Orleans, US, pp. 5145–5149, 2017. [pdf] [poster]
Zhaocheng Huang, and Julien Epps. "Time to Embrace Emotion Change: Selecting Emotionally Salient Segments for Speech-based Emotion Prediction" The 16th International Conference on Speech Science and Technology (SST '16), Sydney, Australia, pp. 281–284, 2016. [pdf] [oral presentation] [slides]
Zhaocheng Huang, Brian Stasak, Ting Dang, Kalani Wataraka Gamage, Phu Le, Vidhyasaharan Sethu, Julien Epps. "Staircase Regression in OA RVM, Data Selection and Gender Dependency in AVEC 2016" In Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge (AVEC '16), ACM Multimedia, pp. 19–26, 2016. [pdf] [oral presentation] [slides]
Zhaocheng Huang, and Julien Epps. "Detecting the instant of emotion change from speech using a martingale framework" 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP '16), pp. 5195-5199, 2016. [pdf] [oral presentation] [slides]
Zhaocheng Huang, Ting Dang, Nicholas Cummins, Brian Stasak, Phu Le, Vidhyasaharan Sethu, and Julien Epps. "An Investigation of Annotation Delay Compensation and Output-Associative Fusion for Multimodal Continuous Emotion Prediction." In Proceedings of the 5th International Workshop on Audio/Visual Emotion Challenge (AV+EC '15), ACM Multimedia, pp. 41-48, 2015. [pdf] [oral presentation]
Zhaocheng Huang. "An investigation of emotion changes from speech." 2015 International Conference on Affective Computing and Intelligent Interaction (ACII '15), pp. 733-736, 2015. [pdf] [oral presentation]
Zhaocheng Huang, Julien Epps, and Eliathamby Ambikairajah. "An Investigation of Emotion Change Detection from Speech" The 16th Annual Conference of the International Speech Communication Association (INTERSPEECH '15), pp. 5195-5199, 2015. [pdf] [poster]