Boliang Zhang
I am currently a machine learning engineer at Appel AI/ML. I completed my Ph.D. in 2019 at Rensselaer Polytechnic Institute while working with Dr. Heng Ji. Previously, I worked at DiDi Labs.
I work on projects related to Natural Language Processing and Machine Learning. I am interested in building artificial intelligence systems to solve problems in machine translation, information extraction and conversational AI.
Contact:
Email: boliang_zhang@apple.com
News and Highlights
[01/10/2022] I joined Apple as a machine learning engineer to work on machine translation.
[04/20/2021] I gave a talk virtually at the VGS-IT seminar, Brno University of Technology: "End-to-End Task-oriented Dialog Agent Training and Human-Human Dialog Collection" (slides).
[02/08/2021] I gave a talk about our DSTC9 submission "A Hybrid Task-Oriented Dialog System with Domain and Task Adaptive Pretraining" at the DSTC9 workshop of AAAI2021 (slides).
[10/20/2020] We tie 1st place in the Multi-domain Task-oriented Dialog Challenge of DSTC9! (paper and code)
[07/01/2020] Our work on filtering noisy sentences from web-crawled parallel data has been published on ACL2020. Check out our paper and data!
Publications
2021
Boliang Zhang*, Ying Lyu*, Ning Ding, Tianhao Shen, Zhaoyang Jia, Kun Han, Kevin Knight, A Hybrid Task-Oriented Dialog System with Domain and Task Adaptive Pretraining, AAAI 2021, DSTC9 workshop. [slides, code]
Arkady Arkhangorodsky, Amittai Axelrod, Christopher Chu, Scot Fang, Yiqi Huang, Ajay Nagesh, Xing Shi, Boliang Zhang, Kevin Knight, MEEP: An Open-Source Platform for Human-Human Dialog Collection and End-to-End Agent Training, arXiv. [code]
2020
Boliang Zhang, Ajay Nagesh, Kevin Knight. Parallel Corpus Filtering via Pre-trained Language Models, ACL 2020. [data]
2019
Boliang Zhang. Neural Name Tagging for Low-resource Languages, PhD Dissertation, Rensselaer Polytechnic Institute.
2018
Boliang Zhang, Spencer Whitehead, Lifu Huang and Heng Ji. Global Attention for Name Tagging, Proc. CoNLL, 2018
Boliang Zhang, Ying Lin, Xiaoman Pan, Di Lu, Jonathan May, Kevin Knight and Heng Ji. ELISA-EDL: A Cross-lingual Entity Extraction, Linking and Localization System, Proc. NAACL Demo Track, 2018
Qingyun Wang, Xiaoman Pan, Lifu Huang, Boliang Zhang, Zhiying Jiang, Heng Ji and Kevin Knight. Describing a Knowledge Base, Proc. of the 11th International Conference on Natural Language Generation (INLG 2018)
Lifu Huang, Kyunghyun Cho, Boliang Zhang, Heng Ji and Kevin Knight. Multi-lingual Common Semantic Space Construction via Cluster-Consistent Word Embedding, Proc. EMNLP, 2018
Ge Shi, Chong Feng, Lifu Huang, Boliang Zhang, Heng Ji, Lejian Liao and Heyan Huang, Genre Separation Network with Adversarial Training for Cross-genre Relation Extraction, Proc. EMNLP, 2018
Qingyun Wang, Zhihao Zhou, Lifu Huang, Spencer Whitehead, Boliang Zhang, Heng Ji and Kevin Knight, Paper Abstract Writing through Rewriting Mechanism, Proc. ACL 2018
Ying Lin, Cash Costello, Boliang Zhang, Di Lu, Heng Ji, James Mayfield and Paul McNamee. Platforms for Non-speakers Annotating Names in Any Language, Proc. ACL Demo Track, 2018 (Best Demo Paper nomination)
Zhiying Jiang, Lifu Huang, Boliang Zhang and Heng Ji. Chengyu Recommendation, Proc. NAACL Workshop on Innovative Use of NLP for Building Educational Applications, 2018
2017
Boliang Zhang, Xiaoman Pan, Ying Lin and Heng Ji. RPI BLENDER TAC-KBP2017 13 Languages EDL System, Proc. TAC, 2017
Boliang Zhang, Di Lu, Xiaoman Pan, Ying Lin, Halidanmu Abudukelimu, Heng Ji, Kevin Knight. Embracing Non-Traditional Linguistic Resources for Low-resource Language Name Tagging, Proc. IJCNLP, 2017
Xiaoman Pan, Boliang Zhang, Jonathan May, Joel Nothman, Kevin Knight and Heng Ji. Cross-lingual Name Tagging and Linking for 282 Languages, Proc. ACL, 2017
2016
Boliang Zhang, Xiaoman Pan, Tianlu Wang, Ashish Vaswani, Heng Ji, Kevin Knight, and Daniel Marcu. Name Tagging for Low-Resource Incident Languages Based on Expectation-Driven Learning, Proc. NAACL, 2016 (slides)
Dongxu Zhang, Boliang Zhang, Xiaoman Pan, Xiaocheng Feng, Heng Ji, Weiran Xu. Bitext Name Tagging for Cross-lingual Entity Annotation Projection, Proc. COLING, 2016
2015
Boliang Zhang, Hongzhao Huang, Xiaoman Pan, Sujian Li, Chin-Yew Lin, Heng Ji, Kevin Knight, Zhen Wen, Yizhou Sun, Jiawei Han and Bulent Yener. Context-aware Entity Morph Decoding. Proc. ACL, 2015 (poster)
2014
Boliang Zhang, Hongzhao Huang, Xiaoman Pan, Heng Ji, Zhen Wen, Yizhou Sun, Jiawei Han and Bulent Yener. Be Appropriate and Funny: Automatic Entity Morph Encoding. Proc. ACL, 2014 (slides)
Jin Guang Zheng, Daniel Howsmon, Boliang Zhang, Juergen Hahn, Deborah McGuinness, James Hendler, Heng Ji. Entity Linking for Biomedical Literature. Proc. CIKM Workshop on Data and Text Mining in Biomedical Informatics, 2014
System Description Papers:
Leon Cheung, Thamme Gowda, Ulf Hermjakob, Nelson Liu, Jonathan May, Alexandra Mayn, Nima Pourdamghani, Michael Pust, Kevin Knight, Nikolaos Malandrakis, Pavlos Papadopoulos, Anil Ramakrishna, Karan Singla, Victor Martinez, Colin Vaz, Dogan Can, Shrikanth Narayanan, Kenton Murray, Toan Nguyen, David Chiang, Xiaoman Pan, Boliang Zhang, Ying Lin, Di Lu, Lifu Huang, Kevin Blissett, Tongtao Zhang, Heng Ji, Ondrej Glembek, Murali Karthick Baskar, Santosh Kesiraju, Lukas Burget, Karel Benes, Igor Szoke, Karel Vesely, Jan ``Honza'' Cernocky, Camille Goudeseune, Mark Hasegawa Johnson, Leda Sari, Wenda Chen and Angli Liu, ELISA System Description for LoReHLT 2017. Proc. LoReHLT, 2017
Mohamed Al-Badrashiny, Jason Bolton, Arun Tejavsi Chaganty, Kevin Clark, Craig Harman, Lifu Huang, Matthew Lamm, Jinhao Lei, Di Lu, Xiaoman Pan, Ashwin Paranjape, Ellie Pavlick, Haoruo Peng, Peng Qi, Pushpendre Rastogi, Abigail See, Kai Sun, Max Thomas, Chen-Tse Tsai, Hao Wu, Boliang Zhang, Chris Callison-Burch, Claire Cardie, Heng Ji, Christopher Manning, Smaranda Muresan, Owen C. Rambow, Dan Roth, Mark Sammons, Benjamin Van Durme, TinkerBell: Cross-lingual Cold-Start Knowledge Base Construction, Proc. TAC, 2017
Heng Ji, Xiaoman Pan, Boliang Zhang, Joel Nothman, James Mayfield, Paul McNamee and Cash Costello, Overview of TAC-KBP2017 13 Languages Entity Discovery and Linking, Proc. TAC, 2017
Pavlos Papadopoulos, Ruchir Travadi, Colin Vaz, Nikolaos Malandrakis, Ulf Hermjakob, Nima Pourdamghani, Michael Pust, Boliang Zhang, Xiaoman Pan, Di Lu, Ying Lin, Ondrej Glembek, Murali Karthick B, Martin Karafiat, Lukas Burget, Mark Hasegawa-Johnson, Heng Ji, Jonathan May, Kevin Knight and Shrikanth Narayanan, Team ELISA System for DARPA LORELEI Speech Evaluation 2016, Proc. Interspeech 2017
Ulf Hermjakob, Qiang Li, Daniel Marcu, Jonathan May, Sebastian J. Mielke, Nima Pourdamghani, Michael Pust, Xing Shi, Kevin Knight, Tomer Levinboim, Kenton Murray, David Chiang, Boliang Zhang, Xiaoman Pan, Di Lu, Ying Lin and Heng Ji, Incident-Driven Machine Translation and Name Tagging for Low-resource Languages, Journal Machine Translation, pp 1-31
Dian Yu, Xiaoman Pan, Boliang Zhang, Lifu Huang, Di Lu, Spencer Whitehead and Heng Ji, RPI_BLENDER TAC-KBP2016 System Description, Proc. TAC, 2016
Yu Hong, Xiaobin Wang, Yadong Chen, Jian Wang, Tongtao Zhang, Jin Zheng, Dian Yu, Qi Li, Boliang Zhang, Han Wang, Xiaoman Pan, Heng Ji, RPI BLENDER TAC-KBP2014 Knowledge Base Population System, Proc. TAC, 2014
Services
Conference Area Chair / Senior Program Committee
IJCAI (2021)
Conference Program Committee
COLING (2018-present), NAACL (2018-present), AAAI (2019-present), EMNLP (2019-2020), ACL (2020-present), LREC (2019, 2020), IJCAI (2020-present)
Journal Reviewer
Information Journal
Transactions on Audio, Speech and Language
Neural Computing