Literate models for computer vision:

Combining vision, language and reading