Content details
Prior knowledge required by attendees
Ability to install and run software on their personal computer. Understanding of biomedical terminological resources (e.g., UMLS Metathesaurus) and text processing will be helpful. No software programming knowledge will be required. Familiarity with the Java programming language is helpful, but not required. Code samples and step-by-step instructions will be provided when needed.
Tutorial material
Slides: An introduction to NLP and information extraction, to clinical text characteristics, and existing NLP resources for clinical text processing will be presented with slides. They can be downloaded (see above).
Clinical text corpus: A collection of synthetic clinical notes from MTSamples will be made available to registered participants, along with annotations.
NLP tools: Several NLP tools will be used for the hands-on exercises during the tutorial. The main one used for exercises will be CLAMP. Preference was given to open source tools.
For active participation in exercises, participants should have a laptop computer. We will have flash drives to distribute the software.
Handout: Will be distributed at the tutorial and be made available to registered participants. So avoid printing large quantities of paper, it combines all slides presented during the tutorial with references about the NLP resources presented and used for the hands-on activities, and all are available through links above.