file format for the preprocessed lexical data