Home

Siyuan Feng, Ph.D.

R&D, ByteDance

Postdoctoral researcher, Delft University of Technology (TU Delft), Delft, The Netherlands

Ph.D., The Chinese University of Hong Kong, Hong Kong S.A.R. of China

B.Eng., Tsinghua University, Beijing, China

[Last update: 2023-Sept]

I am currently a speech recognition researcher and engineer at ByteDance.

I was a postdoc researcher in Multimedia Computing Group, Delft University of Technology (TU Delft) since during 2019 ~ 2021, working with Dr. Odette Scharenborg. I had worked in a collaborative research project with a group owned by Prof. Mark Hasegawa-Johnson (University of Illinois Urbana-Champaign) and a group owned by Prof. Najim Dehak (Johns Hopkins University), between 2020 and 2021.

I obtained the Ph.D. degree in Department of Electronic Engineering, The Chinese University of Hong Kong (CUHK), in 2020. My Ph.D advisor is Prof. Tan Lee. I started my Ph.D. Programme since 2015. I obtained the B.Eng. in Department of Electronic Engineering, Tsinghua University, Beijing, China in 2015.

My research interests include automatic speech recognition (ASR), low-resource/zero-resource speech modeling and inclusive ASR.

My CV (not up-to-date).

E-mail: fengsym.ee@gmail.com

ORCID ID: 0000-0003-2531-8480

GitHub

News

10-Sep 2023: My Journal paper "Towards inclusive automatic speech recognition" got accepted by Computer Speech and Language. It is an extended version of the arxiv pre-print: "Quantifying bias in automatic speech recognition".
30-May-2023: Two papers got accepted to INTERSPEECH 2023: "Language-Universal Phonetic Representation in Multilingual Speech Pretraining for Low-Resource Speech Recognition" and "Language-universal Phonetic Encoder for Low-resource Speech Recognition".
2-June 2021: My paper accepted to INTERSPEECH 2021.
22-Apr 2021: My paper accepted for publication in IEEE Open Journal of Signal Processing.
2-Apr 2021: My two papers submitted to INTERSPEECH 2021: one for unsupervised acoustic unit discovery, the other for quantifying bias in ASR.
30-Jan 2021: My paper accepted to ICASSP 2021; A collaborative work with Xinsheng Wang accepted to ICASSP 2021.
25-July 2020: My paper accepted to INTERSPEECH 2020.
6-Mar 2020: I have successfully defended my Ph.D. Thesis.
25-January 2020: The joint work with Zhiyuan Peng on speech representation learning accepted to ICASSP 2020 as a poster presentation.
15-September 2019: I am attending INTERSPEECH 2019 at Graz, Austria, to give two oral presentations.
8-August 2019: My Journal manuscript "Exploiting Cross-Lingual Speaker and Phonetic Diversity for Unsupervised Subword Modeling" accepted for publication in Transactions on Audio, Speech and Language Processing (Trans. ASLP). Preprint available here.
31-July 2019: I will be visiting Prof. Hung-yi Lee's Lab at Department of Electrical Engineering, National Taiwan University, as a research exchange student for 2 weeks.
13-July 2019: I am giving a talk to M.Sc. students in CUHK(SZ). The title of my talk is "Unsupervised acoustic modeling for less-studied languages".
17-Jun 2019: My 2 papers accepted to INTERSPEECH 2019: ArXiv preprint of ZeroSpeech 2019 Challenge in a special session, and unsupervised acoustic modeling as an oral presentation.
12-May 2019: I am attending IEEE ICASSP 2019 at Brighton, UK, to give a poster presentation about language Identification.
25-Mar 2019: Our team achieved the second place (among 19 challenge submissions) in Zero Resource Speech Challenge (ZeroSpeech) 2019, ABX subword discriminability task!
1-Feb 2019: The joint work with Zhiyuan Peng on language identification accepted to IEEE ICASSP 2019.
2-Sep 2018: I am attending INTERSPEECH 2018 at Hyderabad, India, to give an oral and two poster presentations.
8-Aug 2018: The joint work with Yuanyuan Liu and Ying Qin on disordered speech assessment accepted to ISCSLP 2018 as an oral presentation.
7-Aug 2018: The joint work with Man-Ling Sung on unsupervised spoken pattern discovery accepted to APSIPA-ASC 2018 as an oral presentation.
4-Jun 2018: My 2 papers accepted to INTERSPEECH 2018: Unsupervised acoustic modeling as an oral presentation, and cross-lingual acoustic modeling as a poster presentation. The joint work with Ying Qin on Aphasic speech assessment is also accepted as a poster presentation.

Education

2015-2020, Ph.D. in Electronic Engineering

The Chinese University of Hong Kong, Hong Kong S.A.R. of China

2011-2015, B.Eng in Electronics Information Engineering

Tsinghua University, Beijing, China

Work experience

2021-now, Researcher and engineer

ByteDance

2019-2021, Postdoctoral researcher

Delft University of Technology, Delft, The Netherlands

Publications

Or, you may also find them from Google Scholar, or DBLP.

Teaching Experiences

Machine Intelligence - Principles and Applications (TECS 2461, for selected outstanding Hong Kong middle school students). Teaching Assistant with Prof Tan Lee and Prof Hongsheng Li.

18-19 Sem 2

Introduction to Digital Signal Processing (ELEG 3503, for undergraduates of CUHK). Teaching Assistant with Prof. Tan Lee

18-19 Sem 1

Hong Kong Primary & Secondary Schools STEM Robotics Competition 2018. Organizer and Tutor

17-18 Sem 1 & 2

Digital Processing of Speech Signals (ELEG 5741, for postgraduates of CUHK). Teaching Assistant with Prof. Tan Lee

16-17 Sem 1

Signals and Systems (ENGG 2030, for undergraduates of CUHK). Teaching Assistant with Prof. Tan Lee

15-16 Sem 1

16-17 Sem 1

Digital Signal Processing and Applications (ELEG 4501, for undergraduates of CUHK). Teaching Assistant with Prof. Tan Lee

15-16 Sem 2

16-17 Sem 2

Google Sites

Report abuse