Roles and responsibilities
Welcome! I am Director of Research in Computational Linguistics at the Department of Theoretical and Applied Linguistics (DTAL) in the University of Cambridge and co-founder of the Language Technology Laboratory. I am also a Visiting Scientist at the European Bioinformatics Institute (EMBL-EBI), a member of the EPSRC Peer Review College (2015- present), an elected member of the faculty board in Modern and Medieval Languages (2016-2018) and a study steering committee member on the NIHR DEPEND project (2015-2018). I am a member of the Cambridge Centre for Science and Policy (CSaP), Cambridge Big Data, and Cambridge Language Sciences.
EPSRC SIPHS (EP/M005089/1): I am funded by a 1.2 million 5-year EPSRC fellowship to investigate the Semantic Interpretation of Personal Health messages on the Web (SIPHS) project. This is an international collaborative effort to leverage social media data for digital disease applications such as detecting infectious disease outbreaks and adverse drug reaction.
MRC PheneBank (MR/M025160/1): I am PI on the PheneBank project. This project seeks to develop a new method for the identification and harmonisation of human phenotypes from the scientific literature as well as their associations to entities of interest such as diseases, genes and other phenotypes.
I have been actively involved in teaching throughout my career and have taught a range of computational linguistics units both at the Department of Informatics in Sokendai and in the University of Cambridge where I have given frequent guest lectures. I currently teach on the Biomedical Information Processing course at the Computer Lab during the Lent term.
Prospective PhD students
I am delighted to consider applications for PhD project proposals from students with a strong background in computing, linguistics or AI. I do however receive a steady stream of such contacts and in order to save time request that in your initial message you (a) provide a brief overview of your project idea and - importantly - how it relates to my research interests, and (b) provide an up to date CV including overall course grades. If you wish to apply for the PhD course in October 2017 please contact me by October/November 2016.
MPhil students on the ACS course please contact me about project proposals on Biomedical Information Processing.
I am currently:
Prior to joining the University of Cambridge I was a FP7 Marie Curie fellow on the PhenoMiner project at EMBL-EBI (2012-2014) and Associate Professor at the National Institute of Informatics in Tokyo where I led the Natural Language Processing laboratory. From 2007 - 2012 I served as a technology advisor on the international Global Health Security Action Group technical working group on Risk Management and Communication. I obtained my PhD in computational linguistics at UMIST in 1996 (now the University of Manchester) for my research into the application of neural networks for machine translation.
I am a senior member of the Association for Computing Machinery (1996 - present) and a member of the Association for Computational Linguistics (1996 - present).
Director of Research in Computational Linguistics, andCo-Director of the Language Technology Lab