Dr. Fei Huang is a Senior Director and Principal Research Scientist of Language Technology Lab, Alibaba DAMO Academy. He leads NLP foundation and machine translation teams, which developed AliNLP platform in AliCloud. The platform supports several hundreds internal and external clients with advanced NLP models, systems and solutions. Before Alibaba, Dr. Huang led the development of machine translation systems at Facebook AML which helps reduce language barriers for 20 Billion people. He was a senior researcher as IBM Watson after gradating from CMU with Ph.D. focusing on language technologies.

Dr. Huang has published 30+ papers and 20+ patents on machine translation, multilingual natural language processing. He has served as Area Chairs at ACL-IJCNLP 2015, NLPCC 2018, Senior PC member of IJCAI/AAAI.

Email: feirhuang(at)gmail.com

EDUCATION

CARNEGIE MELLON UNIVERSITY

LANGUAGE TECHNOLOGY INSTITUTE, SCHOOL OF COMPUTER SCIENCE

Ph.D. in Language and Information Technologies

CARNEGIE MELLON UNIVERSITY

LANGUAGE TECHNOLOGY INSTITUTE, SCHOOL OF COMPUTER SCIENCE

M.S. in Language Technologies

CHINESE ACADEMY OF SCIENCES

M.S. in Pattern Recognition and Intelligent System

TIANJIN UNIVERSITY

B.E. (major) in Electrical Engineering and Automation with High Honors

B.A. (minor) in English for Science and Technology

WORK EXPERIENCE

Alibaba DAMO Academy 2018-present

Senior Director and Principal Research Scientist of Language Technology Lab, DAMO Academy

"Make Language Local, Make Business Global"

He leads the Alibaba DAMO NLP foundational technologies and dialogue teams, which support the business needs inside and outside the Alibaba Group. He lead research teams conducting cutting-edge research in deep language model, multi-modal QA and machine reading comprehension.

The AliceMind pre-training language models built by the team released PLUG, (as of 2021/04) the largest-scale Chinese pre-training model, as well as VQA and machine reading comprehension (MRC) systems which surpass human performance for the first time. The NLP research team has published hundreds of papers at top AI/NLP conferences, won more than 20 international competition champions, and developed the first e-commerce live translation system in the industry. Related technologies and products support hundreds of scenarios in the Alibaba Group, with trillions of service calls per day, and empower multiple industry partners.

Facebook 2014-2018

Senior Tech Leader/Staff Research Scientist in Machine Translation, Applied Machine Learning and Places Data, Local

"Connect the world in everyone's language"

Reduce language barriers for 20 Billion people, building machine translation systems among 45+ languages

IBM T. J. Watson Research Center 2006-2014

Senior Research Staff Member in Statistical Machine Translation

"Make Watson multilingual"

GALE, nFluent, research in machine translation, information extraction and statistical NLP technologies


Publication

https://scholar.google.com/citations?hl=en&user=9r98PpoAAAAJ


He, Wanwei; Dai, Yinpei; Zheng, Yinhe; Wu, Yuchuan; Cao, Zheng; Liu, Dermot; Jiang, Peng; Yang, Min; Huang, Fei; Si, Luo;

GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection

arXiv preprint arXiv:2111.14592

2021


Liu, Che; Wang, Rui; Liu, Jinghua; Sun, Jian; Huang, Fei; Si, Luo;

DialogueCSE: Dialogue-based Contrastive Learning of Sentence Embeddings

arXiv preprint arXiv:2109.12599

2021


Chen, Xiang; Zhang, Ningyu; Li, Lei; Xie, Xin; Deng, Shumin; Tan, Chuanqi; Huang, Fei; Si, Luo; Chen, Huajun;

Lightner: A lightweight generative framework with prompt-guided attention for low-resource ner

arXiv preprint arXiv:2109.00720

2021


Luo, Fuli; Wang, Wei; Liu, Jiahao; Liu, Yijia; Bi, Bin; Huang, Songfang; Huang, Fei; Si, Luo;

VECO: Variable and Flexible Cross-lingual Pre-training for Language Understanding and Generation

arXiv preprint arXiv:2010.16046

2020


Xia, Qingrong; Zhang, Bo; Wang, Rui; Li, Zhenghua; Zhang, Yue; Huang, Fei; Si, Luo; Zhang, Min;

A unified span-based approach for opinion mining with syntactic constituents

Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies pp.1795-1804.

2021


Li, Chenliang; Bi, Bin; Yan, Ming; Wang, Wei; Huang, Songfang; Huang, Fei; Si, Luo;

StructuralLM: Structural Pre-training for Form Understanding

arXiv preprint arXiv:2105.11210

2021


Dai, Yinpei; Li, Hangyu; Li, Yongbin; Sun, Jian; Huang, Fei; Si, Luo; Zhu, Xiaodan;

Preview, Attend and Review: Schema-Aware Curriculum Learning for Multi-Domain Dialog State Tracking

arXiv preprint arXiv:2106.00291

2021


Zhang, Ningyu; Chen, Xiang; Xie, Xin; Deng, Shumin; Tan, Chuanqi; Chen, Mosha; Huang, Fei; Si, Luo; Chen, Huajun;

Document-level relation extraction as semantic segmentation

arXiv preprint arXiv:2106.03618

2021


Zhang, Ningyu; Chen, Mosha; Bi, Zhen; Liang, Xiaozhuan; Li, Lei; Shang, Xin; Yin, Kangping; Tan, Chuanqi; Xu, Jian; Huang, Fei;

CBLUE: A Chinese Biomedical Language Understanding Evaluation Benchmark

arXiv preprint arXiv:2106.08087

2021


Sun, Yajing; Shan, Yong; Tang, Chengguang; Hu, Yue; Dai, Yinpei; Yu, Jing; Sun, Jian; Huang, Fei; Si, Luo;

Unsupervised Learning of Deterministic Dialogue Structure with Edge Graph Auto-Encoder

2021


Sun, Yajing; Shan, Yong; Tang, Chengguang; Hu, Yue; Dai, Yinpei; Yu, Jing; Sun, Jian; Huang, Fei; Si, Luo;

Unsupervised Learning of Deterministic Dialogue Structure with Edge-Enhanced Graph Auto-Encoder

Proceedings of the AAAI Conference on Artificial Intelligence

13869-13877

2021


Hui, Binyuan; Geng, Ruiying; Ren, Qiyu; Li, Binhua; Li, Yongbin; Sun, Jian; Huang, Fei; Si, Luo; Zhu, Pengfei; Zhu, Xiaodan;

Dynamic hybrid relation exploration network for cross-domain context-dependent semantic parsing

Proceedings of the AAAI Conference on Artificial Intelligence

13116-13124

2021


Hui, Binyuan; Geng, Ruiying; Ren, Qiyu; Li, Binhua; Li, Yongbin; Sun, Jian; Huang, Fei; Si, Luo; Zhu, Pengfei; Zhu, Xiaodan;

Dynamic hybrid relation network for cross-domain context-dependent semantic parsing

arXiv preprint arXiv:2101.01686

2021


Xu, Haiyang; Yan, Ming; Li, Chenliang; Bi, Bin; Huang, Songfang; Xiao, Wenming; Huang, Fei;

E2E-VLP: End-to-End Vision-Language Pre-training Enhanced by Visual Learning

arXiv preprint arXiv:2106.01804

2021


Xu, Runxin; Luo, Fuli; Zhang, Zhiyuan; Tan, Chuanqi; Chang, Baobao; Huang, Songfang; Huang, Fei;

Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning

arXiv preprint arXiv:2109.05687

2021


Luo, Fuli; Yang, Pengcheng; Li, Shicheng; Ren, Xuancheng; Sun, Xu; Huang, Songfang; Huang, Fei;

Rethinking Denoised Auto-Encoding in Language Pre-Training

Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

pp 2922-2932.

2021


Wang, Ke; Chen, Guandan; Huang, Zhongqiang; Wan, Xiaojun; Huang, Fei;

Bridging the Domain Gap: Improve Informal Language Translation via Counterfactual Domain Adaptation

Proceedings of the AAAI Conference on Artificial Intelligence. pp 13970-13978.

2021


Xu, Runxin; Luo, Fuli; Wang, Chengyu; Chang, Baobao; Huang, Jun; Huang, Songfang; Huang, Fei;

From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression

arXiv preprint arXiv:2112.07198

2021


Zhang, Haoyu; Long, Dingkun; Xu, Guangwei; Zhu, Muhua; Xie, Pengjun; Huang, Fei; Wang, Ji;

Learning with noise: improving distantly-supervised fine-grained entity typing via automatic relabeling

Proceedings of the Twenty-Ninth International Conference on International Joint Conferences on Artificial Intelligence

AAAI 2021. pp 3808-3815

2021


Guo, Zilu; Huang, Zhongqiang; Zhu, Kenny Q; Chen, Guandan; Zhang, Kaibo; Chen, Boxing; Huang, Fei;

Automatically paraphrasing via sentence reconstruction and round-trip translation. IJCAI

2021


Ding, Ning; Wang, Xiaobin; Fu, Yao; Xu, Guangwei; Wang, Rui; Xie, Pengjun; Shen, Ying; Huang, Fei; Zheng, Hai-Tao; Zhang, Rui;

Prototypical representation learning for relation extraction. arXiv preprint arXiv:2103.11647.

2021


Bi, Bin; Li, Chenliang; Wu, Chen; Yan, Ming; Wang, Wei; Huang, Songfang; Huang, Fei; Si, Luo;

Palm: Pre-training an autoencoding&autoregressive language model for context-conditioned generation.

arXiv preprint arXiv:2004.07159.

2020


Xu, Lu; Bing, Lidong; Lu, Wei; Huang, Fei;

Aspect sentiment classification with aspect-specific opinion spans; Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

pp 3561-3567.

2020


Wang, Xinyu; Jiang, Yong; Bach, Nguyen; Wang, Tao; Huang, Zhongqiang; Huang, Fei; Tu, Kewei; More Embeddings, Better Sequence Labelers? EMNLP 2020

Chuanqi Tan, Wei Qiu, Mosha Chen, Rui Wang, Fei Huang, Boundary Enhanced Neural Span Classification for Nested Named Entity Recognition, AAAI2020

Haiyun Peng, Lu Xu, Lidong Bing, Fei Huang, Wei Lu, Luo Si, Knowing What, How and Why: A Near Complete Solution for Aspect-Based Sentiment Analysis, AAAI2020

Nguyen Bach, Fei Huang Noisy BiLSTM-Based Models for Disfluency Detection, Interspeech 2019

Nguyen Bach, Hongjie Chen, Kai Fan, Cheung-Chi Leung, Bo Li, Chongjia Ni, Rong Tong, Pei Zhang, Boxing Chen, Bin Ma, Fei Huang, Alibaba Speech Translation Systems for IWSLT 2018, IWSLT 2018

Yuanhang Su, Kai Fan, Nguyen Bach, C-C Jay Kuo, Fei Huang, Unsupervised multi-modal neural machine translation, CVPR 2018

Chen Li, Zhongyu Wei, Yang Liu, Yang Jin, Fei Huang Using Facebook Public Posts to Enhance Trending News Summarization, In the Proceedings of the COLINGL-2016, Osaka, Japan, 2016.

Boxing Chen, Roland Kuhn, George Foster, Colin Cherry, Fei Huang Bilingual Methods for Adaptive Training Data Selection for Machine Translation, In the Proceedings of AMTA 2016

Boxing Chen and Fei Huang Semi-supervised Convolutional Networks for Translation Adaptation with Tiny Amount of In-domain Data, In the Proceedings of the CoNLL-2016, Berlin, Germany, 2016.

Fei Huang. Improved Arabic Dialect Classification with Social Media Data, In the Proceedings of EMNLP 2015, Lisbon, Portugal, 2015.

Fei Huang, Jian-Ming Xu, Abraham Ittycheriah, Salim Roukos, Adaptive HTER Estimation for Document-Specific MT Post-Editing, In the Proceedings of the ACL 2014, Baltimore, MD, USA, 2014.

Fei Huang and Cezar Pendus, Generalized Reordering Rules for Improved SMT, In the Proceedings of the ACL-2013, Bulgaria, August, 2013.

Qi Li, Haibo Li, Heng Ji, Wen Wang, Jing Zheng, Fei Huang, Joint Bilingual Name Tagging for Parallel Corpora, Proceedings of the 21st ACM international conference on Information and knowledge management (CIKM) pp1727-1731, Hawaii, ISA, Oct. 2012.

Nguyen Bach, Fei Huang and Yaser Al-Onaizan, Goodness: A Method for Measuring Machine Translation Confidence, to appear In the Proceedings of the ACL-HLT 2011, Oregon, USA, June 2011.

Fei Huang and Bing Xiang Feature-Rich Phrase Rescoring for SMT, In the Proceedings of the COLING 2010, Beijing, September 2010.

Araham Ittycheriah, Fei Huang, Salim Roukos and Abhishek Arun, Improved Word Alignment Algorithms for Arabic-English and Chinese-English, book chapter in the book GALE “MT from Text”. 2009

Fei Huang, Confidence Measure for Word Alignment, In the Proceedings of the ACL-IJCNLP 2009, Singapore, July 2009.

Fei Huang, Ahmad Emami and Imed Zitouni, When Harry Mey Harri: Cross-lingual Name Spelling Normalization Hierarchical System Combination for Machine Translation, In the Proceedings of the EMNLP-2008, Hawaii, USA, October 2008.

Fei Huang and Kishore Papineni, Hierarchical System Combination for Machine Translation, In the Proceedings of the EMNLP-CoNLL 2007, Prague, Czech Repiblic, July 2007.

Fei Huang, Multilingual Named Entity Extraction and Translation from Text and Speech, Ph.D. Thesis, Carnegie Mellon University, 2005.

Fei Huang, Cluster-specific Name Transliteration, In the Proceedings of the HLT-EMNLP 2005, Vancouver, BC, Canada, October 2005.

Fei Huang, Ying Zhang and Stephan Vogel, Mining Key Phrase Translations from Web Corpora, In the Proceedings of the HLT-EMNLP 2005, Vancouver, BC, Canada, October 2005.

Fei Huang, Stephan Vogel and Alex Waibel, Clustering and Classifying Names by Origins, in the Proceedings of the 25th Annual Meeting of the American Association of Artificial Intelligence (AAAI-05), Pittsburgh, PA, July 2005.

Ying Zhang, Fei Huang and Stephan Vogel, Mining Translations of OOV Terms from the Web through Cross-lingual Query Expansion, in the Proceedings of the 28th Annual International ACM SIGIR, Salvador, Brazil, August 2005.

Fei Huang, Stephan Vogel and Alex Waibel, Towards Named Entity Extraction and Translation in Spoken Language Translation, in the Proceedings of the International Workshop on Spoken Language Translation, Kyoto, Japan, October 2004.

Fei Huang, Stephan Vogel and Alex Waibel, Improving Named Entity Translation Combining Phonetic and Semantic Similarities, in the Proceedings of the Human Language Technologies Conference (HLT/NAACL2004), Boston, USA, May 2004.

Fei Huang, S. Vogel and A. Waibel, Extracting NE Translingual Equivalence with Limited Resources, ACM Transactions on Asian Language Information Processing, 2(2), 124-129. 2003

Stephan Vogel, Ying Zhang, Fei Huang, Alicia Tribble, Ashish Venogupal, Bing Zhao, Alex Waibel, The CMU Statistical Translation System, in the Proceedings of MT Summit IX, New Orleans, LA, USA, September 2003.

Fei Huang, Stephan Vogel and Alex Waibel, Automatic Extraction of Named Entity Translingual Equivalence Based on Multi-feature Cost Minimization, in the Proceedings of the 41st Annual Conference of the Association for Computational Linguistics (ACL'03), Workshop on Multilingual and Mixed-language Named Entity Recognition, Sapporo, Japan, July 2003.

Fei Huang and Stephan Vogel, Improved Named Entity Translation and Bilingual Named Entity Extraction, in the Proceedings of the 2002 International Conference on Multimodal Interfaces (ICMI '02), Pittsburgh, USA, October 2002.

Fei Huang and Alex Waibel, An Adaptive Approach to Named Entity Extraction for Meeting Applications, in the Proceedings of the 2002 Human Language Technologies Conference (HLT'02), San Diego, USA, April 2002.

Fei Huang, Jie Yang and Alex Waibel, Dialogue Management for Multimodal User Registration, in the Proceedings of the 2000 International Conference on Spoken Language Processing (ICSLP'00), Beijing, China, October 2000.

Fei Huang, Language Model Adaptation for Chinese Speech Recognition, Master Thesis, Chinese Academy of Sciences, 1999.

Fei Huang and Bo Xu, Lexicon and Language Model Adaptation based on Domain Keywords, in the Proceedings of the Joint Conference on Chinese Information Processing, Beijing China, 1999.

PATENTS