Dr. Fei Huang is a Senior Director and Principal Research Scientist of Language Technology Lab, Alibaba DAMO Academy. He leads NLP foundation and machine translation teams, which developed AliNLP platform in AliCloud. The platform supports several hundreds internal and external clients with advanced NLP models, systems and solutions. Before Alibaba, Dr. Huang led the development of machine translation systems at Facebook AML which helps reduce language barriers for 20 Billion people. He was a senior researcher as IBM Watson after gradating from CMU with Ph.D. focusing on language technologies.
Dr. Huang has published 30+ papers and 20+ patents on machine translation, multilingual natural language processing. He has served as Area Chairs at ACL-IJCNLP 2015, NLPCC 2018, Senior PC member of IJCAI/AAAI.
CARNEGIE MELLON UNIVERSITY
LANGUAGE TECHNOLOGY INSTITUTE, SCHOOL OF COMPUTER SCIENCE
Ph.D. in Language and Information Technologies
CARNEGIE MELLON UNIVERSITY
LANGUAGE TECHNOLOGY INSTITUTE, SCHOOL OF COMPUTER SCIENCE
M.S. in Language Technologies
CHINESE ACADEMY OF SCIENCES
M.S. in Pattern Recognition and Intelligent System
B.E. (major) in Electrical Engineering and Automation with High Honors
B.A. (minor) in English for Science and Technology
Senior Director and Principal Research Scientist of Language Technology Lab, DAMO Academy
"Make Language Local, Make Business Global"
Lead multilingual NLP team that develops technologies processing 500 Billion sentences per day, focusing on multimodal translation (informal text, speech, image and video) to make it easy to do business everywhere!
Senior Tech Leader/Staff Research Scientist in Machine Translation, Applied Machine Learning and Places Data, Local
"Connect the world in everyone's language"
Reduce language barriers for 20 Billion people, building machine translation systems among 45+ languages
IBM T. J. Watson Research Center 2006-2014
Senior Research Staff Member in Statistical Machine Translation
"Make Watson multilingual"
GALE, nFluent, research in machine translation, information extraction and statistical NLP technologies
Chuanqi Tan, Wei Qiu, Mosha Chen, Rui Wang, Fei Huang, Boundary Enhanced Neural Span Classification for Nested Named Entity Recognition, AAAI2020
Haiyun Peng, Lu Xu, Lidong Bing, Fei Huang, Wei Lu, Luo Si, Knowing What, How and Why: A Near Complete Solution for Aspect-Based Sentiment Analysis, AAAI2020
Nguyen Bach, Fei Huang Noisy BiLSTM-Based Models for Disfluency Detection, Interspeech 2019
Nguyen Bach, Hongjie Chen, Kai Fan, Cheung-Chi Leung, Bo Li, Chongjia Ni, Rong Tong, Pei Zhang, Boxing Chen, Bin Ma, Fei Huang, Alibaba Speech Translation Systems for IWSLT 2018, IWSLT 2018
Yuanhang Su, Kai Fan, Nguyen Bach, C-C Jay Kuo, Fei Huang, Unsupervised multi-modal neural machine translation, CVPR 2018
Chen Li, Zhongyu Wei, Yang Liu, Yang Jin, Fei Huang Using Facebook Public Posts to Enhance Trending News Summarization, In the Proceedings of the COLINGL-2016, Osaka, Japan, 2016.
Boxing Chen, Roland Kuhn, George Foster, Colin Cherry, Fei Huang Bilingual Methods for Adaptive Training Data Selection for Machine Translation, In the Proceedings of AMTA 2016
Boxing Chen and Fei Huang Semi-supervised Convolutional Networks for Translation Adaptation with Tiny Amount of In-domain Data, In the Proceedings of the CoNLL-2016, Berlin, Germany, 2016.
Fei Huang. Improved Arabic Dialect Classification with Social Media Data, In the Proceedings of EMNLP 2015, Lisbon, Portugal, 2015.
Fei Huang, Jian-Ming Xu, Abraham Ittycheriah, Salim Roukos, Adaptive HTER Estimation for Document-Specific MT Post-Editing, In the Proceedings of the ACL 2014, Baltimore, MD, USA, 2014.
Fei Huang and Cezar Pendus, Generalized Reordering Rules for Improved SMT, In the Proceedings of the ACL-2013, Bulgaria, August, 2013.
Qi Li, Haibo Li, Heng Ji, Wen Wang, Jing Zheng, Fei Huang, Joint Bilingual Name Tagging for Parallel Corpora, Proceedings of the 21st ACM international conference on Information and knowledge management (CIKM) pp1727-1731, Hawaii, ISA, Oct. 2012.
Nguyen Bach, Fei Huang and Yaser Al-Onaizan, Goodness: A Method for Measuring Machine Translation Confidence, to appear In the Proceedings of the ACL-HLT 2011, Oregon, USA, June 2011.
Fei Huang and Bing Xiang Feature-Rich Phrase Rescoring for SMT, In the Proceedings of the COLING 2010, Beijing, September 2010.
Araham Ittycheriah, Fei Huang, Salim Roukos and Abhishek Arun, Improved Word Alignment Algorithms for Arabic-English and Chinese-English, book chapter in the book GALE “MT from Text”. 2009
Fei Huang, Confidence Measure for Word Alignment, In the Proceedings of the ACL-IJCNLP 2009, Singapore, July 2009.
Fei Huang, Ahmad Emami and Imed Zitouni, When Harry Mey Harri: Cross-lingual Name Spelling Normalization Hierarchical System Combination for Machine Translation, In the Proceedings of the EMNLP-2008, Hawaii, USA, October 2008.
Fei Huang and Kishore Papineni, Hierarchical System Combination for Machine Translation, In the Proceedings of the EMNLP-CoNLL 2007, Prague, Czech Repiblic, July 2007.
Fei Huang, Multilingual Named Entity Extraction and Translation from Text and Speech, Ph.D. Thesis, Carnegie Mellon University, 2005.
Fei Huang, Cluster-specific Name Transliteration, In the Proceedings of the HLT-EMNLP 2005, Vancouver, BC, Canada, October 2005.
Fei Huang, Ying Zhang and Stephan Vogel, Mining Key Phrase Translations from Web Corpora, In the Proceedings of the HLT-EMNLP 2005, Vancouver, BC, Canada, October 2005.
Fei Huang, Stephan Vogel and Alex Waibel, Clustering and Classifying Names by Origins, in the Proceedings of the 25th Annual Meeting of the American Association of Artificial Intelligence (AAAI-05), Pittsburgh, PA, July 2005.
Ying Zhang, Fei Huang and Stephan Vogel, Mining Translations of OOV Terms from the Web through Cross-lingual Query Expansion, in the Proceedings of the 28th Annual International ACM SIGIR, Salvador, Brazil, August 2005.
Fei Huang, Stephan Vogel and Alex Waibel, Towards Named Entity Extraction and Translation in Spoken Language Translation, in the Proceedings of the International Workshop on Spoken Language Translation, Kyoto, Japan, October 2004.
Fei Huang, Stephan Vogel and Alex Waibel, Improving Named Entity Translation Combining Phonetic and Semantic Similarities, in the Proceedings of the Human Language Technologies Conference (HLT/NAACL2004), Boston, USA, May 2004.
Fei Huang, S. Vogel and A. Waibel, Extracting NE Translingual Equivalence with Limited Resources, ACM Transactions on Asian Language Information Processing, 2(2), 124-129. 2003
Stephan Vogel, Ying Zhang, Fei Huang, Alicia Tribble, Ashish Venogupal, Bing Zhao, Alex Waibel, The CMU Statistical Translation System, in the Proceedings of MT Summit IX, New Orleans, LA, USA, September 2003.
Fei Huang, Stephan Vogel and Alex Waibel, Automatic Extraction of Named Entity Translingual Equivalence Based on Multi-feature Cost Minimization, in the Proceedings of the 41st Annual Conference of the Association for Computational Linguistics (ACL'03), Workshop on Multilingual and Mixed-language Named Entity Recognition, Sapporo, Japan, July 2003.
Fei Huang and Stephan Vogel, Improved Named Entity Translation and Bilingual Named Entity Extraction, in the Proceedings of the 2002 International Conference on Multimodal Interfaces (ICMI '02), Pittsburgh, USA, October 2002.
Fei Huang and Alex Waibel, An Adaptive Approach to Named Entity Extraction for Meeting Applications, in the Proceedings of the 2002 Human Language Technologies Conference (HLT'02), San Diego, USA, April 2002.
Fei Huang, Jie Yang and Alex Waibel, Dialogue Management for Multimodal User Registration, in the Proceedings of the 2000 International Conference on Spoken Language Processing (ICSLP'00), Beijing, China, October 2000.
Fei Huang, Language Model Adaptation for Chinese Speech Recognition, Master Thesis, Chinese Academy of Sciences, 1999.
Fei Huang and Bo Xu, Lexicon and Language Model Adaptation based on Domain Keywords, in the Proceedings of the Joint Conference on Chinese Information Processing, Beijing China, 1999.
- Machine translation output reranking Patent number: 10067936
- Analyzing language dependency structures. Patent number: 9830404
- Universal translation. Patent number: 10346537. Patent number: 9734142
- Determining trending topics in social media Patent number: 9830386
- OPTIMIZING MACHINE TRANSLATIONS FOR USER ENGAGEMENT Publication number: 20170371868
- Translation confidence scores Patent number: 10133738
- Language independent representations Patent number: 9990361
- Optimizing machine translations for user engagement Patent number: 10114819
- Machine learning dialect identification Patent number: 9477652
- Machine learning dialect identification Patent number: 10410625
- Identifying multiple languages in a content item Patent number: 10180935
- Predicting future translations Patent number: 9747283
- Machine learning dialect identification Patent number: 9899020
- User feedback for low-confidence translations Patent number: 9922029
- Predicting future translations Patent number: 9805029
- Predicting future translations Patent number: 10289681