Wei Gao - Senior Lecturer, Victoria University of Wellington


Contact

Room 422, Rutherford House, 23 Lambton Quay, Wellington 6140, New Zealand
School of Information Management
Victoria University of Wellington
PO Box 600
Wellington

Working E-mail: wei.gao@vuw.ac.nz
Personal E-mail: wgao.qcri@gmail.com
Tel.: +64 463 7437


Short Bio

I received Ph.D. of Information Systems in 2010 from the Department of Systems Engineering and Engineering Management, the Chinese University of Hong Kong. I am currently a Senior Lecturer in the School of Information Management, Victoria University of Wellington in New Zealand. I have been affiliated with Qatar Computing Research Institute as a scientist since September 2011 until May 2017. Prior to this appointment, I held positions as research assistant professor in the Chinese University of Hong Kong during August 2010-August 2011 and research fellow at the Institute for Infocomm Research, A*STAR in Singapore during May-August 2010. Early on I was working as an intern in Microsoft Research Asia (with Natural Language Computing Group) for about 15 months starting from July 2006. Before my graduate studies, I had experience in software industry in China, working as software engineer in TRS Information Technology Co. Ltd. and Institute of Computer Science & Technology of Peking University.



Research Interests
  • Information Retrieval and Web Search
  • Social Media Analytics
  • Natural Language Processing
  • Artificial Intelligence, Machine Learning 

Academic Services
  • Program Committee Member: IJCAI 2016, SIGIR 2011-2017, CIKM 2017, WSDM 2015/2017, ACL 2012/2014-2017, EMNLP 2010(best reviewer award)/2011/2013-2017, ASONAM 2015-2017, CoNLL 2015, RANLP 2013/2015, IJCNLP 2013, AIRS 2012/2013, PACLIC 25/26/28/29/30, NLPCC 2015/2017, CCL 2016, BigComp 2016-2017, SocialNLP 2016, SocialSec 2016
  • Journal Reviewer:
    • ACM Transactions on Knowledge Discovery from Data (TKDD)
    • ACM Transactions on Information Systems (TOIS)
    • Computational Linguistics (CL)
    • Information Systems (IS)
    • ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP)
    • ACM Transactions on the Web (TWEB)
    • ACM Transactions on Intelligent Systems and Technology (TIST)
    • ACM Computing Surveys (CSUR)
    • IEEE Transactions on Knowledge and Data Engineering (TKDE)
    • IEEE Transactions on Computer (TOC)
    • IEEE Intelligent Systems (IS)
    • IEEE Transactions on Information Forensics and Security (TIFS)
    • Knowledge and Information Systems (KAIS)
    • Data Mining and Knowledge Discovery (DMKD)
    • Data and Knowledge Engineering (DKE)
    • AI Communications
    • Network Science
    • International Journal of Human-Computer Studies
    • Language Resources and Evaluation (LRE)
    • International Journal of Asian Language Processing (IJALP)
    • Frontier of Computer Science
    • Frontier of Information Technology & Electronic Engineering
  • External ReviewerACL 2011, EMNLP 2010, ACM Transactions on Asian Language Information Processing, Information Processing & Management, Language Resources and Evaluation, PACLIC 23, CIKM 2009, CIKM 2008, SIGIR 2006, AIRS 2006, ACL 2005, CIKM 2005, SIGMOD 2004, AIRS 2004
  • Workshop Co-chair: BigComp 2016
  • Session Chair: ASONAM 2015 (Wikipedia and Collaboration)
  • Area Co-chair: NLPCC 2015
  • Tutorial Co-chair: IJCNLP 2011

Publications

2017

  • From Retweet to Believability: Utilizing Trust to Identify Rumor Spreaders on Twitter
    Bhavtosh Rath, Wei Gao, Jing Ma, and Jaideep Srivastava
    ASONAM 2017: The 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, August 2017, Sydney, Australia

  • Recommendation vs Sentiment Analysis: A Text-Driven Latent Factor Model for Rating Prediction with Cold-Start Awareness [pdf]
    Kaisong Song, Wei Gao, Shi Feng, Daling Wang, Kam-Fai Wong, and Chengqi Zhang
    IJCAI 2017: The 26th International Joint Conference on Artificial Intelligence, August 2017, Melbourne, Australia

  • Detect Rumors in Microblog Posts Using Propagation Structure via Kernel Learning [pdfdataset]
    Jing Ma, Wei Gao, and Kam-Fai Wong
    ACL 2017: The 55th Annual Meeting of the Association for Computational Linguistics, July 30-August 4, Vancouver, Canada

  • Social Media Content Analysis: NLP and Beyond  (forthcoming book)
    Kam-Fai Wong, Wei Gao, Wenjie Li, and Ruifeng Xu
    World Scientific Publishing

2016

  • Topic Extraction from Microblog Posts Using Conversation Structures [pdf, dataset]
    Jing Li, Ming Liao, Wei Gao, Yulan He, and Kam-Fai Wong
    ACL 2016: The 54th Annual Meeting of the Association for Computational Linguistics, August 2016, Berlin, Germany

  • Detecting Rumors from Microblogs with Recurrent Neural Networks [pdf, dataset]
    Jing Ma, Wei Gao, Prasenjit Mitra, Sejeong Kwon, Bernard J. Jansen, Kam-Fai Wong, and Meeyoung Cha
    IJCAI 2016: The 25th International Joint Conference on Artificial Intelligence, July 2016, New York, USA

  • Ordinal Text Quantification [pdfDOIdataset/code]
    Giovanni Da San Martino, Wei Gao, and Fabrizio Sebastiani
    SIGIR 2016The 39th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2016, Pisa, Italy

  • Build Emotion Lexicon from the Mood of Crowd via Topic-Assisted Joint Non-negative Matrix Factorization [pdfDOIdataset]
    Kaisong Song, Wei Gao, Ling Chen, Shi Feng, Daling Wang, and Chengqi Zhang
    SIGIR 2016
    The 39th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2016, Pisa, Italy

  • QCRI at SemEval-2016 Task 4: Probabilistic Methods for Binary and Ordinal Quantification [pdf, dataset/code]
    Giovanni Da San Martino, Wei Gao, and Fabrizio Sebastiani
    SemEval 2016The 10th International Workshop on Semantic Evaluation, June 2016, San Diego, California, USA (1st place in sub-task E of Sentiment Analysis in Twitter)

  • From Classification to Quantification in Tweet Sentiment Analysis [pdfDOIdataset, code]
    Wei Gao and Fabrizio Sebastiani
    SNAM: Social Network Analysis and Mining, Volume:6, Issue:1, Article 19, 2016, Springer

  • PerSentiment: A Personalized Sentiment Classification System for Microblog Users [pdfDOIsytem demo]
    Kaisong Song, Ling Chen, Wei Gao, Shi Feng, Daling Wang, and Chengqi Zhang
    WWW 2016 (demo): The 25th International World Wide Web Conference, April 2016, Montreal, Canada

2015

  • Using Content-level Structures for Summarizing Microblog Repost Trees [pdf, dataset]
    Jing Li, Wei Gao, Zhongyu Wei, Baolin Peng, and Kam-Fai Wong
    EMNLP 2015: The 2015 Conference on Empirical Methods in Natural Language Processing, September 2015, Lisboa, Portugal

  • Detect Rumors Using Time Series of Social Context Information on Microblogging Websites [pdf, DOI]
    Jing Ma, Wei Gao, Zhongyu Wei, Yueming Lu, and Kam-Fai Wong
    CIKM 2015: The 24th ACM International Conference on Information and Knowledge Management, October 2015, Melbourne, Australia

  • Tweet Sentiment: From Classification to Quantification [pdf, DOIdataset]
    Wei Gao and Fabrizio Sebastiani
    ASONAM 2015: The 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, August 2015, Paris, France (Best Paper Runner-Up)

  • Using Tweets to Help Sentence Compression for News Highlights Generation [pdf]
    Zhongyu Wei, Yang Liu, Chen Li, and Wei Gao
    ACL-IJCNLP 2015: The 53rd Annual Meeting of the Association for Computational Linguistics, July 2015, Beijing, China

  • Build Emotion Lexicon from Microblogs by Combining Effects of Seed Words and Emoticons in a Heterogeneous Graph  [pdf, DOIdataset]
    Kaisong Song, Shi Feng, Wei Gao, Daling Wang, Ling Chen, and Chengqi Zhang
    Hypertext 2015: The 26th ACM Conference on Hypertext and Social Media, September 2015, Cyprus

  • Personalized Sentiment Classification Based on Latent Individuality of Microblog Users   [pdf]
    Kaisong Song, Shi Feng, Wei Gao, Daling Wang, Ge Yu, and Kam-Fai Wong
    IJCAI 2015: The 24th International Joint Conference on Artificial Intelligence, July 2015, Buenos Aires, Argentina

  • Gibberish, Assistant, or Master? Using Tweets Linking to News for Extractive Single-Document Summarization   [pdf, DOI, datasetposter]
    Zhongyu Wei and Wei Gao
    SIGIR 2015: The 38th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, August 2015, Santiago, Chile

  • QCRI: Answer Selection for Community Question Answering - Experiment for Arabic and English   [pdf]
    Massimo Nicosia, Simone Filice, Alberto Barron-Cedeno, Iman Saleh, Hamdy Mubarak, Wei Gao, Preslav Nakov, Giovanni Da San Martino, Alessandro Moschitti, Kareem Darwish, Lluis Marquz, Shafiq Joty and Walidy Magdy
    SemEval 2015: The 9th International Workshop on Semantic Evaluation, May 2015, Denver, Colorado, USA 

2014

  • QCRI at TREC 2014: Applying the KISS Principle for the TTG Task in the Microblog Track   [pdf]
    Walid Magdy, Wei Gao, Tarek El-Ganainy, and Zhongyu Wei
    TREC 2014:
    The 23rd Text REtrieval Conference, November 2014, Gaithersburg, Maryland, USA (Microblog track, 2nd place among 13 groups)

  • Utilizing Microblogs for Automatic News Highlights Extraction   [pdf, dataset]
    Zhongyu Wei and Wei Gao
    COLING 2014: The 25th International Conference on Computational Linguistics, August 2014, Dublin, Ireland

  • Detecting Semantic Uncertainty by Learning Hedge Cues in Sentences using an HMM   [pdf]
    Xiujun Li, Wei Gao, and Jude Shavlik
    SMIR 2014: SIGIR 2014 Workshop on Semantic Matching in Information Retrieval, July 2014, Gold Coast, Australia

  • Ranking Model Selection and Fusion for Effective Microblog Search   [pdf, DOI]
    Zhongyu Wei, Wei Gao, Tarek El-Ganainy, Walid Magdy, and Kam-Fai Wong
    SoMeRA 2014: SIGIR 2014 Workshop on Social Media Retrieval and Analysis, July 2014, Gold Coast, Australia

  • Simple Effective Microblog Named Entity Recognition: Arabic as an Example   [pdf]
    Kareem Darwish and Wei Gao
    LREC 2014: The 9th Language Resources and Evaluation Conference, May 2014, Reykjavik, Iceland

  • Information-Theoretic Multi-view Domain Adaptation: A Theoretical and Empirical Study   [pdf, DOIerrata]
    Pei Yang and Wei Gao
    JAIR: Journal of Artificial Intelligence Research, 49(2014):201-525

  • Democracy is Good for Ranking: Towards Multi-view Rank Learning and Adaptation in Web Search   [pdf, DOIslides]
    Wei Gao and Pei Yang
    WSDM 2014: The 7th ACM International Conference on Web Search and Data Mining, February 2014, New York, USA

    2013

    • QCRI at TREC 2013 Microblog Track   [pdfslides]
      Tarek El-Ganainy, Zhongyu Wei, Walid Magdy, and Wei Gao
      TREC 2013: The 22nd Text REtrieval Conference, November 2013, Gaithersburg, Maryland, USA (Microblog track, 2nd place among 65 automatic runs)

    • A Link-Bridged Topic Model for Cross-Domain Document Classification  [pdf, DOI]
      Pei Yang, Wei Gao, Qi Tan, and Kam-Fai Wong
      IP&M: Information Processing and Management, 49(6):1181-1193, 2013, Elsevier

    • An Empirical Study on Uncertainty Identification in Social Media Context   [pdf]
      Zhongyu Wei, Junwen Chen, Wei Gao, Binyang Li, Lanjun Zhou, Yulan He, and Kam-Fai Wong
      ACL 2013: The 51st Annual Meeting of the Association for Computational Linguistics, August 2013, Sofia, Bulgaria

    • Multi-view Discriminant Transfer Learning   [pdf]
      Pei Yang and Wei Gao
      IJCAI 2013: The 23rd International Joint Conference on Artificial Intelligence, August 2013, Beijing, China

    • Mainstream Media Behavior Analysis on Twitter: A Case Study on UK General Election   [pdf, DOI]
      Zhongyu Wei, Yulan He, Wei Gao, Binyang Li, Lanjun Zhou, and Kam-Fai Wong
      Hypertext 2013: The 24th ACM Conference on Hypertext and Social Media, May 2013, Paris, France

    • A Self-Training Framework for Automatic Identification of Exploratory Dialogue
      Zhongyu Wei, Yulan He, Simon Shum, Rebecca Ferguson, Wei Gao, and Kam-Fai Wong
      CICLing 2013: The 14th International Conference on Intelligent Text Processing and Computational Linguistics, March 2013, Samos, Greece

    • Dynamic Joint Sentiment-Topic Model   [pdf, DOI]
      Yulan He, Chenghua Lin, Wei Gao, and Kam-Fai Wong
      TIST: ACM Transactions on Intelligent Systems and Technology -- Special Issue: Social Web Mining, Volume 5, Issue 1, Article 6, 2013

    2012

    • Cross-lingual Identification of Ambiguous Discourse Connectives for Resource-Poor Language   [pdf]
      Lanjun Zhou, Wei Gao, Binyang Li, Zhongyu Wei, and Kam-Fai Wong
      COLING 2012: The 24th International Conference on Computational Linguistics, December 2012, Bombay, India 

    • Microblog Search and Filtering with Time Sensitive Feedback and Thresholding Based on BM25   [pdf]
      Wei Gao, Zhongyu Wei, and Kam-Fai Wong
      TREC 2012: The 21st Text REtrieval Conference, November 2012, Geithersburg, Maryland, USA (Microblog track)

    • Joint Topic Modeling for Event Summarization across News and Social Media Streams   [pdf, DOIslides]
      Wei Gao, Peng Li, and Kareem Darwish
      CIKM 2012: The 21st ACM Conference on Information and Knowledge Management, October 2012, Maui, Hawaii, USA

    • Information-Theoretic Multi-view Domain Adaptation   [pdf]
      Pei Yang, Wei Gao, Qi Tan, and Kam-Fai Wong
      ACL 2012: The 50th Annual Meeting of the Association for Computational Linguistics, July 2012, Jeju Island, Korea

    • Tracking Sentiment and Topic Dynamics from Social Media   [pdf]
      Yulan He, Chenghua Lin, Wei Gao, and Kam-Fai Wong
      ICWSM 2012: The 6th International AAAI Conference on Weblogs and Social Media, June 2012, Dublin, Ireland

    2011

    • Exploring Tweets Normalization and Query Time Sensitivity for Twitter Search   [pdf]
      Zhongyu Wei, Wei Gao, Lanjun Zhou, Binyang Li, and Kam-Fai Wong
      TREC 2011: The 20th Text REtrieval Conference, November 2012, Gaithersburg, Maryland, USA (Microblog Track)

    • An Effective Approach for Topic-Specific Opinion Summarization   [pdf, DOI]
      Binyang Li, Lanjun Zhou, Wei Gao, Kam-Fai Wong and Zhongyu Wei
      AIRS 2011: The 7th Asian Information Retrieval Societies Conference, December 2011, Dubai, UAE

    • Generating Aspect-oriented Multi-Document Summarization with Event-aspect Model   [pdf]
      Peng Li, Yinglin Wang, Wei Gao, and Jing Jiang
      EMNLP 2011: The 2011 Conference on Empirical Methods in Natural Language Processing, July 2011, Edinburgh, Scotland, UK

    • Unsupervised Discovery of Discourse Relations for Eliminating Intra-Sentence Polarity Ambiguities   [pdf]
      Lanjun Zhou, Binyang Li, Wei Gao, Zhongyu Wei, and Kam-Fai Wong
      EMNLP 2011: The 2011 Conference on Empirical Methods in Natural Language Processing, July 2011, Edinburgh, Scotland, UK
    • Relevant Knowledge Helps in Choosing Right Teacher: Active Query Selection for Ranking Adaptation   [pdf, DOIslides]
      Peng Cai, Wei Gao, Aoying Zhou, and Kam-Fai Wong
      SIGIR 2011: The 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2011, Beijing, China
    • Query Weighting for Ranking Model Adaptation   [pdfslides]
      Peng Cai, Wei Gao, Aoying Zhou, and Kam-Fai Wong
      ACL-HLT 2011: The 49th Annual Meeting of the Association for Computational Linguistics, June 2011, Portland, Oregon, USA
    • Weight-based Boosting Model for Cross-Domain Relevance Ranking Adaptation   [pdf, DOIslides]
      Peng Cai, Wei Gao, Kam-Fai Wong, and Aoying Zhou
      ECIR 2011: The 33rd European Conference on Information Retrieval, April 2011, Dublin, Ireland

    2010

    • Extracting Common Emotions from Blogs Based on Fine-grained Sentiment Clustering   [pdf, DOI]
      Shi Feng, Daling Wang, Ge Yu, Wei Gao, and Kam-Fai Wong
      KAIS: Knowledge and Information Systems, 27(2):281-302, 2010
    • Learning to Rank Only Using Training Data from Related Domain   [pdf, DOIslides]
      Wei Gao, Peng Cai, Kam-Fai Wong, and Aoying Zhou
      SIGIR 2010: The 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2010, Geneva, Switzerland
    • Exploiting Query Logs for Cross-Lingual Query Suggestions   [pdf, DOI]
      Wei Gao, Cheng Niu, Jian-Yun Nie, Ming Zhou, Kam-Fai Wong, and Hsiao-Wuen Hon
      TOIS: ACM Transactions on Information Systems, Volume 28, No.2, Article 6, 2010
       

    2009

    • Cross-Language Mining and Retrieval   [DOI]
      Wei Gao and Cheng Niu
      In Ling Liu and Tamer Özsu (Eds.) "Encyclopedia of Database Systems", 523-528, Springer US 2009, ISBN 978-0-387-35544-3,978-0-387-39940-9 
    • Exploiting Bilingual Information to Improve (Monolingual) Web Search   [pdfslides]
      Wei Gao, John Blitzer, Ming Zhou, and Kam-Fai Wong
      ACL-IJCNLP 2009: The 47th Annual Meeting of the Association for Computational Linguistics, August 2009, Singapore
    • Joint Ranking for Multilingual Web Search   [pdf, DOIslides]
      Wei Gao, Cheng Niu, Ming Zhou, and Kam-Fai Wong
      ECIR 2009: The 31st European Conference on Information Retrieval, April 2009, Toulouse, France (Best Student Paper)

    2008

    • Using English Information in Non-English Web Search   [pdf, DOI]
      Wei Gao, John Blitzer, and Ming Zhou
      iNEWS 2008: ACM International Workshop on Improving Non-English Web Searching, October 2008, Nappa Valley, California, USA 

    2007

    • Cross-Lingual Query Suggestion Using Query Logs of Different Languages   [pdf, DOIslides]
      Wei Gao, Cheng Niu, Jian-Yun Nie, Ming Zhou, Jian Hu, Kam-Fai Wong, and Hsiao-Wuen Hon
      SIGIR 2007: The 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, July 2007, Amsterdam, Netherland

    2006
    • Experimental Studies Using Statistical Algorithms on Transliterating Phoneme Sequences for English-Chinese Name Translation  [DOI]
      Wei Gao and Kam-Fai Wong
      IJCPOL: International Journal of Computer Processing of Oriental Languages, 19(1):63-88, 2006, World Scientific Publish Co., Imperial College Press
    • Clique Percolation Method for Finding Naturally Cohesive and Overlapping Document Clusters  [DOI]
      Wei Gao, Kam-Fai Wong, Yunqing Xia, and Ruifeng Xu
      ICCPOL 2006: The 21st International Conference on Computer Processing of Oriental Languages, November 2006, Singapore
    • Natural Document Clustering by Clique Percolation in Random Graphs   [pdf, DOI]
      Wei Gao and Kam-Fai Wong
      AIRS 2006: The 3rd Asia Information Retrieval Symposium, October 2006, Singapore

    2005
    • NIL Is Not Nothing: Recognition of Chinese Network Informal Language Expressions   [pdf]
      Yunqing Xia, Kam-Fai Wong, and Wei Gao
      SIGHAN 2005: The 4th SIGHAN workshop on Chinese Language Processing, October 2005, Jeju Island, Korea
    2004
    • Improving Transliteration with Precise Alignment of Phoneme Chunks and Using Contextual Features   [pdf, DOI]
      Wei Gao, Kam-Fai Wong, and Wai Lam
      AIRS 2004: The First Asia Information Retrieval Symposium, October 2004, Beijing, China
    • Phoneme-based Transliteration of Foreign Names for OOV Problem   [pdf, DOI]
      Wei Gao, Kam-Fai Wong, and Wai Lam
      IJCNLP 2004: The First International Joint Conference on Natural Language Processing, March 2004, Sanya, Hainan Island, China
    2003
    • Design Issues in a Chinese Financial Information Extraction System
      Qingzhong Li, Wei Gao, Wenjie Li, Kam-Fai Wong, Chunfa Yuan, and Yun Wang
      ICCPOL 2003: The 20th International Joint Conference on Computer Processing of Oriental Languages, August 2003, Shenyang, China