Glitch Tokens in Real-world Datasets