Public profiles

My Ph.D. work


Dr. Paolo Missier, Lecturer

School of Computing Science,
Newcastle University,
Room 7.04, Claremont Tower
Newcastle upon Tyne, NE1 7RU
United Kingdom

paolo dot missier at newcastle dot ac dot uk
I hope that from this page you don't get the false perception that I am a Computer Scientist. I am really a photographer


I am a Lecturer Information and Knowledge Management with the School of Computing at Newcastle University. My core expertise is in Information Management, with applications mainly to e-science. Specific interests: 

scientific workflow technology and its applications (Taverna, e-Science Central); provenance of data; bioinformatics; data analytics for HAR (Human Activity Recognition).

Current and recently completed research projects:

Bioinformatics: exome-based genetic testing: 

  • PI, Cloud e-Genome (2013-2015). Large-scale cloud-based deployment of 2nd Gen Sequencing analysis pipelines for exome variant analysis and interpretation. Funded by the NIHR/BioMedical Research Council, in collaboration with the Institute of Genetic Medicine at Newcastle University;

Learning and data analytics

  • CO-I, Hazard perception study (2013-2014). Pilot project aimed at developing models of drivers' behaviour using a variety of body and car sensors, with a view to enable personalized interventions for safer driving, aimed specifically at older/more vulnerable drivers.
Pervasive Computing (a much more recent interest):
  • Online learning for Human Activity Recognition (with Mr. Tudor Miu, 3rd year PhD student)

Data provenance:

  •  Generation of large synthetic provenance graphs for benchmarking provenance management systems (with Mr. Hugo Firth, PhD student)
  • Abstraction in provenance: PI, "Trusted Dynamic Coalitions" project (EPSRC/DSTL), 2012-2013, investigating provenance-based policies for information exchanges amongst partners with limited trust.
  • Member, W3C Working Group on Provenance on the Web, 2011-2013.
  • Co-lead, Scientific Workflow and Provenance Working Group, DataONE project (NSF, 2010-2014).

Undergraduate teaching:  Database technology

Postgraduate teaching: Big Data Analytics 

Past history, in brief: 

  • Ph.D. in Computer Science from the University of Manchester, UK (2007), M.Sc. in Computer Science from University of Houston, Tx., USA (1993), B.Sc. and M.Sc. in Computer Science from Universita' di Udine, Italy (1990).

Research Projects ‎[Last updated: Autumn 2013]‎

  • Cloud e-Genome, started Sept. 2013 Cloud-based data analysis and clinical interpretation of high throughput genomics sequences.Sponsored by the Newcastle Biomedical Research Centre and Microsoft Research. In collaboration with the Institute of Genetic Medicine ...
    Posted Sep 9, 2013, 8:32 AM by Paolo Missier
  • Trusted Dynamic Coalitions -- EPSRC funded, 1 year project May 2012 - May 2013.In collaboration with the DSTL.Co-investigators: Jeremy Bryans and John Fitzgerald, NewcastleEssential info from the executive summary:In a virtual organisation, partners need to ...
    Posted May 28, 2013, 12:39 AM by Paolo Missier
  • DataONE summer project: GoldenTrail DataONE is sponsoring two interns to work on a short, 8 weeks project centred on the development of a provenance repository.Project runs June - August, 2011Read here for d ...
    Posted Jun 28, 2011, 6:21 AM by Paolo Missier
  • Wf4Ever This is a 3 year EU/STREP project centred on the preservation of workflows within the context of collaborative science.Official siteWiki (partly public)
    Posted May 28, 2013, 12:40 AM by Paolo Missier
  • Qurator The ESPRC Qurator project ran from 2004 through 2007. Its main goals have been to characterise the notion of personalised Information Quality in e-science, in terms of a conceptual ...
    Posted Feb 20, 2011, 11:36 AM by Paolo Missier
Showing posts 1 - 5 of 7. View more »


Publications seem to be what we live and die for. Not right, and perhaps not inevitable, but since you ask, here they are:
  • my own list. Uses BibBase. Up-to-date, accurate, with bibtex entries
  • The official School of Computing list. Normally mirrors the former, but no bibtex or pdfs. Terribly hard to maintain so the first list above is more authoritative
  • The Google Scholar list - lags behind but has encouraging citation counts. It includes Tech reports, which are not in the lists above.
  • The DBLP list: lags behind and generally incomplete unless you only publish in DBLP space