I am currently a third year PhD student in Rensselaer Polytechnic Institute (RPI).
Generally, my research interests lie in Natural Language Processing (NLP) and statistical machine learning, especially Information Extraction and text mining. My thesis topic is "Event Extraction in Social Media".
I have conducted research on several NLP tasks including Event Extraction, Open Domain Event Discovery, Document Retrieval and Sentiment Analysis, i.e., how to identify (first) important event information from news and tweets, how to retrieve similar documents in big data, how to identify user sentiment of online reviews, tweets and forums. I have experience to tackle various NLP tasks in different genres, for example, news, tweets, forums, clinical documents, product descriptions and so on.
 Yulia Tyshchuk, Hao Li, Heng Ji and William A. Wallace. 2013
In the Proceeding of ASONAM, 2013, Niagara Falls, Canada
 Weiwei Guo, Hao Li, Heng Ji and Mona Diab. 2013.
In the Proceeding of ACL, 2013, Sofia, Bulgaria.
 Hongbo Deng, Jiawei Han, Hao Li, Heng Ji, Hongning Wang and Yue Lu. 2013.
Exploring and Inferring User-User Pseudo-Friendship for Sentiment Analysis with Heterogeneous Networks
In the Proceeding of SIAM International Conference on Data Mining (SDM), 2013, Austin, Texas.
(Selected as Statistical Analysis and Data Mining Special Issue of “Best of SDM 2013”!)
 Hao Li, Yu Chen, Heng Ji, Smaranda Muresan and Dequan Zheng. 2012.
In the Proceeding of PACLIC, 2012, Bali, Indonesia.
(Best Paper Nomination!)
 Xiang Li, Heng Ji, Faisal Farooq, Hao Li, Wen-Pin Lin and Shipeng Yu. 2012.
Invited Paper for International Journal On Advances in Intelligent Systems, v 5 n 3&4 2012.
 Hao Li, Heng Ji, Hongbo Deng and Jiawei Han. 2011.
Exploiting Background Information Networks to Enhance Bilingual Event Extraction Through Topic Modeling
In the Proceeding of International Conference on Advances in Information Mining and Management (IMMM), 2011, Barcelona, Spain.
 Matthew Snover, Xiang Li, Wen-Pin Lin, Zheng Chen, Suzanne Tamang, Mingmin Ge, Adam Lee, Qi Li, Hao Li, Sam Anzaroot, Heng Ji. 2011.
In the Proceeding of ACL2011 Worshop on Building and Using Comparable Corpora.
 Hao Li, Xiang Li, Heng Ji and Yuval Marton. 2010.
In the Proceeding of PACLIC, 2010, Sendai, Japan.
 Ning Hua Zhu, Bang Hong Zhang, Ji Min Wen, Hao Li, Liang Li, Wei Chen,Yan Zhang, and Liang Xie. 2008
In Photonics Technology Letters, IEEE, vol 20, issue 2, pages 138-140, 2008
Rakuten Institute of Technology, New York
Summer Research Internship May.2013 - Aug.2013
Mentor: Dr. Masato Hagiwara and Dr. Satoshi Sekine
Unsupervised product attribute extraction of Taiwanese e-commerce data.
Siemens Medical Solutions,
Summer Research Internship June.2011 - Aug.2011
Mentor: Dr. Faisal Farooq, Dr. Shipeng Yu and Dr. Balaji Krishnapuram
1. Named Entity Recognition for clinical text to detect several entity types in the medical domain: allergy, symptom, medication, diagnosis and so on.
2. Classified medications in clinical text into different classes based on the time period they were taken.
Program Committee Member for Edited Book from ASONAM 2013
Program Committee Member for IMMM 2013
Program Committee Member for IMMM 2012
Reviewer for Edited Book from ASONAM 2013
Reviewer for IMMM 2013
Secondary Reviewer for ASONAM 2013
Reviewer for IMMM 2012
Secondary Reviewer for IJCNLP 2011
Ph.D. Computer Science Rensselaer Polytechnic Institute (RPI) Aug. 2013 - Present
Ph.D. Computer Science The Graduate Center, CUNY Jan. 2011 - July. 2013
M.S. Computer Science Columbia University in the City of New York Aug. 2008 - May. 2010
B.E. Electronic Engineering Beijing University of Posts and Telecommunications Sep. 2003 - July. 2007