Publications

Aanya Pratapneni, Alice Yuan, and TJ Tsai. Estimating the Reliability of Dynamic Time Warping Alignments Using Circumstantial Evidence. Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 2026. [code]
TJ Tsai, Kavi Dey, Yigitcan Ozer, and Meinard Mueller. Dense-Sparse Dynamic Time Warping for Customizing Piano Concerto Accompaniments. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2025, pp. 1-5. [code][talk]
Arhan Jain, Alec Bunn, Austin Pham, and TJ Tsai. PBSCR: The Piano Bootleg Score Composer Recognition Dataset. Transactions of the International Society for Music Information Retrieval, 7(1): 159-178, 2024.
Jittisa Kraprayoon, Austin Pham, and TJ Tsai. Improving the Robustness of DTW to Global Time Warping Conditions in Audio Synchronization. Applied Sciences, 14(4): 1459, 2024. [code]
Irmak Bukey, Jason Zhang, and TJ Tsai. FlexDTW: Dynamic Time Warping With Flexible Boundary Conditions. Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 2023, pp. 733-740. [code][talk]
Heidi Lei, Arm Wonghirundacha, Irmak Bukey, and TJ Tsai. Audio Cross Verification Using Dual Alignment Likelihood Ratio Test. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2023, pp. 1-5. [code][talk]
Marcos Acosta, Irmak Bukey, and TJ Tsai. An Exploration of Generating Sheet Music Images. Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 2022, pp. 701-708. [code] [talk]
Daniel Yang, Thaxter Shaw, and TJ Tsai. A Study of Parallelizable Alternatives to Dynamic Time Warping for Aligning Long Sequences. IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 30, pp. 2117-2127, 2022. [code]
Daniel Yang, Arya Goutam, Kevin Ji, and TJ Tsai. Large-Scale Multimodal Piano Music Identification Using Marketplace Fingerprinting. Algorithms, 15(5): 146, 2022. [code]
Claire Chang, Thaxter Shaw, Arya Goutam, Christina Lau, Mengyi Shan, and TJ Tsai. Parameter-Free Ordered Partial Match Alignment with Hidden State Time Warping. Applied Sciences, 12(8): 3783, 2022. [code]
Daniel Yang and TJ Tsai. Composer Classification With Cross-Modal Transfer Learning and Musically-Informed Augmentation. Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 2021, pp. 802-809. [code] [short talk] [long talk]
Daniel Yang, Kevin Ji, and TJ Tsai. Aligning Unsynchronized Part Recordings to a Full Mix Using Iterative Subtractive Alignment. Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 2021, pp. 810-817. [code] [talk]
Kevin Ji, Daniel Yang, and TJ Tsai. Piano Sheet Music Identification Using Marketplace Fingerprinting. Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 2021, pp. 326-333. [code] [talk]
TJ Tsai. Segmental DTW: A Parallelizable Alternative to Dynamic Time Warping. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021, pp. 106-110. [code] [talk]
Kevin Ji, Daniel Yang, and TJ Tsai. Instrument Classification of Solo Sheet Music Images. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2021, pp. 546-550. [code] [talk]
Mengyi Shan and TJ Tsai. Automatic Generation of Piano Score Following Videos. Transactions of the International Society for Music Information Retrieval, 4(1): 29-41, 2021. [code]
Daniel Yang and TJ Tsai. Piano Sheet Music Identification Using Dynamic N-gram Fingerprinting. Transactions of the International Society for Music Information Retrieval, 4(1): 42-51, 2021. [code]
Daniel Yang, Kevin Ji, and TJ Tsai. A Deeper Look at Sheet Music Composer Classification Using Self-Supervised Pretraining. Applied Sciences, 11(4): 1387, 2021. [code]
Daniel Yang and TJ Tsai. Camera-Based Piano Sheet Music Identification. Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 2020, pp. 481-488. [code] [talk]
TJ Tsai and Kevin Ji. Composer Style Classification of Piano Sheet Music Images Using Language Model Pretraining. Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 2020, pp. 176-183. [code] [talk]
Mengyi Shan and TJ Tsai. Improved Handling of Repeats and Jumps in Audio-Sheet Image Synchronization. Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 2020, pp. 62-69. [code] [talk]
TJ Tsai. Towards Linking the Lakh and IMSLP Datasets. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2020, pp. 546-550. [code] [talk]
TJ Tsai, Daniel Yang, Mengyi Shan, Thitaree Tanprasert, and Teerapat Jenrungrot. Using Cell Phone Pictures of Sheet Music To Retrieve MIDI Passages. IEEE Transactions on Multimedia, 22(5): 3115-3127, 2020. [code] [data]
Daniel Yang, Thitaree Tanprasert, Teerapat Jenrungrot, Mengyi Shan, and TJ Tsai. MIDI Passage Retrieval Using Cell Phone Pictures of Sheet Music. Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 2019, pp. 916-923. [code] [data] [talk]
Thitaree Tanprasert, Teerapat Jenrungrot, Meinard Müller, and TJ Tsai. MIDI-Sheet Music Alignment Using Bootleg Score Synthesis. Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 2019, pp. 91-98. [code] [talk]
TJ Tsai, Steven Tjoa, and Meinard Müller. Make Your Own Accompaniment: Adapting Full-Mix Recordings to Match Solo-Only Recordings. Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 2017, pp. 79-86. [audio samples]
TJ Tsai, Thomas Prätzlich, and Meinard Müller. Known-Artist Live Song Identification Using Audio Hashprints. IEEE Transactions on Multimedia, 19(7): 1569-1582, 2017. [code]
TJ Tsai, Thomas Prätzlich, and Meinard Müller. Known-Artist Live Song ID: A Hashprint Approach. Proceedings of the International Society for Music Information Retrieval Conference (ISMIR), 2016, pp. 427-433.
TJ Tsai and Andreas Stolcke. Robust and Efficient Multiple Alignment of Unsynchronized Meeting Recordings. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 24(5): 833-845, 2016.
TJ Tsai, Andreas Stolcke, and Malcolm Slaney. A Study of Multimodal Addressee Detection in Human-Human-Computer Interaction. IEEE Transactions on Multimedia, 17(9): 1550-1561, 2015.
TJ Tsai and Andreas Stolcke. Aligning Meeting Recordings Via Adaptive Fingerprinting. Proceedings of Interspeech, 2015, pp. 786-790
TJ Tsai. Are You TED Talk Material? Comparing Prosody in Professors and TED Speakers. Proceedings of Interspeech, 2015, pp. 2534-2538.
TJ Tsai, Andreas Stolcke, and Malcolm Slaney. Multimodal Addressee Detection in Multiparty Dialogue Systems. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2015, pp. 2314-2318.
TJ Tsai, Gerald Friedland, and Xavier Anguera. An Information-Theoretic Metric of Fingerprint Effectiveness. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2015, pp. 340-344.
TJ Tsai and Adam Janin. Confidence-Based Scoring: A Useful Diagnostic Tool for Detection Tasks. Proceedings of Interspeech, 2013, pp. 737-741.
TJ Tsai and Nelson Morgan. Speech Activity Detection: An Economics Approach. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2013, pp. 6842-6846.
TJ Tsai and Nelson Morgan. Longer Features: They Do a Speech Detector Good. Proceedings of Interspeech, 2012, pp. 1356-1359.

Page updated

Report abuse