AMP 2016 Tutorial

This page contains information about my AMP 2016 tutorial. The tutorial is intended for audiences with limited Python experience. Previous programming experience will be helpful but not necessary. The tutorial will focus on the Python skills needed to do basic corpus work, including the following:

    • basic syntax of Python 2

    • loops and conditionals

    • opening files and reading/writing information

    • cleaning text (stripping punctuation, isolating wordforms, etc..)

    • counting things (e.g. frequency, concordance)

If there is time, I will cover one of the following, slightly more advanced topics:

    • scraping text from the Web

    • woking with non-English text (encoding/decoding, Unicode standards, etc..)

    • sampling words/phrases from a frequency distribution

    • multi-agent modeling

The tutorial will use the Jupyter notebook, which is a browser-enabled interactive Python environment. The easiest way to obtain this, and to get a hassle-free SciPy (scientific Python) install is to download the integrated Enthought Canopy environment. So please do that before you come to AMP!

Before you leave home:

When you arrive in LA:

    1. Check back to this website to download the tutorial file and associated text files.

    2. AMP 2016 Jupyter notebook