Owers, J., Simpson E., Lewis, M. (2026). As Easy As Rocket Science: Assessing the Ability of Large Language Models to Interpret Negation in Figurative Language. Under review at TACL
Tong, X., Zhang, Z., Sommerauer, P., Lewis, M., & Shutova, E. (2026). Hummus: A dataset of humorous multimodal metaphor use. Under review at TACL
Lewis, M., & Mitchell, M. (2025). Evaluating the Robustness of Analogical Reasoning in Large Language Models. Transactions on Machine Learning Research.
Tong, X., Choenni, R., Lewis, M., & Shutova, E. (2024, August). Metaphor understanding challenge dataset for LLMs. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 3517-3536).
Lewis, M., & Mitchell, M. (2024). Using Counterfactual Tasks to Evaluate the Generality of Analogical Reasoning in Large Language Models. In Proceedings of the Annual Meeting of the Cognitive Science Society (Vol. 46).
Tong, X., Shutova, E., & Lewis, M. (2021, June). Recent advances in neural metaphor processing: A linguistic, cognitive and social perspective. In Proceedings of the 2021 conference of the North American chapter of the association for computational linguistics: human language technologies (pp. 4673-4686).
Dankers, V., Rei, M., Lewis, M., & Shutova, E. (2019, November). Modelling the interplay of metaphor and emotion through multitask learning. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP) (pp. 2218-2229).