Datasets, Codes, and Language Models