Jie Pu

Hi, I am a research scientist at Zoom, working on multilingual speech recognition and language modelling. 

My research works cover the general topic of AI speech and language, which includes speech recognition (ASR), speech synthesis (TTS), speaker recognition, speaker diarization and speech separation. 

Before joining Zoom, I was working at the University of Cambridge with Prof. Mark Gales. I received a PhD degree from Imperial College London,  under the supervision of Prof. Maja Pantic and Dr. Yannis Panagakis. 

News

Research Topics and Selected Publications 

Multi-stage Large Language Model Correction for Speech Recognition [Paper

Jie Pu, Thai-Son Nguyen, Sebastian Stüker

Scaling Effect of Self-supervised Speech Models [Paper]

Jie Pu, Yuguang Yang, Ruirui Li, Oguz Elibol, Jasha Droppo

Building Synthetic Speaker Profiles in Text-to-speech Systems [Paper

Jie Pu, Yixiong Meng, Oguz Elibol

Blind Audio–Visual Localization and Separation via Low-Rank and Sparsity [Paper]

Jie Pu, Yannis Panagakis, Stavros Petridis, Jie Shen, Maja Pantic

Multimodal Video Search by Examples (UK EPSRC funded) [Project]

Research project with Universities of Cambridge, Surrey and Ulster and the BBC

A Full Publication List in Google Scholar 

Contact

Email: jp936 [at] cantab.ac.uk

Social: LinkedIn