Research

Welcome to the Montclair State University NLP Lab's Research Page!

You can find our recent publications and their BibTeX files in this page.

Clicking [paper] will direct you to the published paper.

Clicking [bib] will automatically download the BibTeX file for the paper.

Julia Sammartino, Libby Barack, Jing Peng, Anna Feldman. 2025.

When Does Language Transfer Help? Sequential Fine-Tuning for Cross-Lingual Euphemism Detection

Euphemisms are culturally variable and often ambiguous, posing challenges for language models, especially in low-resource settings. This paper investigates how cross-lingual transfer via sequential fine-tuning affects euphemism detection across five languages: English, Spanish, Chinese, Turkish, and Yoruba. We compare sequential fine-tuning with monolingual and simultaneous fine-tuning using XLM-R and mBERT, analyzing how performance is shaped by language pairings, typological features, and pretraining coverage. Results show that sequential fine-tuning with a high-resource L1 improves L2 performance, especially for low-resource languages like Yoruba and Turkish. XLM-R achieves larger gains but is more sensitive to pretraining gaps and catastrophic forgetting, while mBERT yields more stable, though lower, results. These findings highlight sequential fine-tuning as a simple yet effective strategy for improving euphemism detection in multilingual models, particularly when low-resource languages are involved.

In Proceedings of RANLP 2025 (Recent Advances in Natural Language Processing).

Hasan Can Biyik, Patrick Lee, Anna Feldman. 2024.

Turkish Delights: a Dataset on Turkish Euphemisms

This research extends NLP work on potentially euphemistic terms (PETs) to Turkish, introducing the first Turkish PET dataset with both euphemistic and non-euphemistic examples. By listing Turkish euphemisms, collecting contexts, and annotating them, we describe the dataset and methodologies. We also experiment with transformer-based models for Turkish euphemism detection, evaluating them using F1, accuracy, and precision metrics.

In Proceedings of The First SIGTURK workshop co-located with ACL 2024

Research

Julia Sammartino, Libby Barack, Jing Peng, Anna Feldman. 2025.

When Does Language Transfer Help? Sequential Fine-Tuning for Cross-Lingual Euphemism Detection

Hasan Can Biyik, Patrick Lee, Anna Feldman. 2024.

Turkish Delights: a Dataset on Turkish Euphemisms

Patrick Lee, Anna Feldman. 2024.

Report on the Multilingual Euphemism Detection Task

Patrick Lee, Iyanuoluwa Shode, Alain Chirino Trujillo, Yuan Zhao, Olumide Ebenezer Ojo, Diana Cuevas Plancarte, Anna Feldman, Jing Peng. 2023.

FEED PETs: Further Experimentation and Expansion on the Disambiguation of Potentially Euphemistic Terms

Iyanuoluwa Shode, David Ifeoluwa Adelani, Jing Peng, Anna Feldman. 2023.

NollySenti: Leveraging Transfer Learning and Machine Translation for Nigerian Movie Sentiment Classification

Libby Barak, Zara Harmon, Naomi H. Feldman, Jan Edwards, Patrick Shafto. 2023.

When Children's Production Deviates From Observed Input: Modeling the Variable Production of the English Past Tense

Levi Corallo, Aparna S Varde. 2023.

Optical Character Recognition and Transcription of Berber Signs from Images in a Low-Resource Language Amazigh

AfroLM: A Self-Active Learning-based Multilingual Pretrained Language Model for 23 African Languages

Kenna Reagan, Aparna Varde, Lei Xie. 2022.

Evolving Perceptions of Mental Health on Social Media and their Medical Impacts

Patrick Lee, Martha Gavidia, Anna Feldman, Jing Peng. 2022.

Searching for PETs: Using Distributional and Sentiment-Based Methods to Find Potentially Euphemistic Terms

Avery Field, Aparna Varde, Pankaj Lal. 2022.

Sentiment Analysis and Topic Modeling for Public Perceptions of Air Travel: COVID Issues and Policy Amendments

Brad McNamee, Aparna Varde, Simon Razniewski. 2022.

Correlating Facts and Social Media Trends on Environmental Quantities Leveraging Commonsense Reasoning and Human Sentiments

Martha Gavidia, Patrick Lee, Anna Feldman, Jing Peng. 2022.

CATs are Fuzzy PETs: A Corpus and Analysis of Potentially Euphemistic Terms

Iyanuoluwa Shode, David Ifeoluwa Adelani, Anna Feldman. 2022.

YOSM: A New Yoruba Sentiment Corpus For Movie Reviews

Levi Corallo, Guanghui Li, Kenna Reagan, Abhishek Saxena, Brandon Wilde, Aparna S. Varde. 2022.

A Framework for German-English Machine Translation with GRU RNN

Azza Abugharsa. 2021.

Sentiment Analysis in Poems in Misurata Sub-dialect

Zach Dau, Anna Feldman, Jing Peng. 2021.

Computational Analysis of the Coronavirus Pandemic; Response of Tri-State Area Politicians on Twitter

Martina Ducret, Lauren Kruse, Carlos Martinez, Anna Feldman, Jing Peng. 2020.

You Don’t Say… Linguistic Features in Sarcasm Detection