Sachin Kumar

Contact: kumar [dot] 1145 [at] osu [dot] edu

Google Scholar | Twitter | Blue Sky | Github | LinkedIn

I am an Assistant Professor at The Ohio State University.

My research interests broadly include topics in Machine Learning for Natural Language Processing (NLP). In particular, I am interested in building language technologies that work for all people---language models that can uniformly support diverse languages, domains, populations, and individuals. Some research directions I am currently excited about are:

Multilingual models that can equitably support written and spoken languages.
Continual adaptation of NLP models to support new languages, varieties, domains, and capabilities.
Personalization of language technologies to diverse user preferences.
Evaluation of language models for scenarios that real users care about (especially interested in uses for experts like scientists and clinicians)

These directions touch various parts of a typical machine learning pipeline, including building new datasets, modeling paradigms and architectures, training and inference algorithms, and evaluation methodologies.

I was a postdoctoral researcher at the Allen Institute for AI (AI2) and obtained my Ph.D. at the Language Technologies Institute at Carnegie Mellon University (CMU) in 2023, with the final two years of my PhD spent visiting the University of Washington in Seattle.

News:

[August 2025] Steering Off Course wins SAC Highlights award at ACL 2025! 🎉

[July 2025] Co-organizing Tokenization Workshop at ICML 2025! Also (co-)presenting BLAB at ML4Audio Workshop, Steering Off Course at Actionable Interp Workshop, and Flexitokens at TokShop.

[July 2025] New paper out on making tokenization more flexible to adaptation: FlexiTokens.

[July 2025] Personalization survey accepted at COLM 2025!

[July 2025] Three papers now accepted at ACL 2025 (Steering Off Course (Oral [top 8%], Panel [top 0.8%]), TESS 2 (Oral), HybridPref)!

[May 2025] New paper out on benchmarking audio LMs: BLAB!

[April 2025] New survey paper on personalized preference learning.

[April 2025] New paper on brittleness of steering methods: Steering Off Course.

[March 2025] Invited talk at UPenn CLunch Seminar.

[February 2025] New paper on diffusion LMs: TESS 2.

[January 2025] Three papers (ComPO, GroundCocoa, and RewardBench) now accepted at NAACL 2025 (see you in New Mexico)!

[November 2024] Will be in Miami to organize Customizable NLP @ EMNLP 2024! Proceedings now live here.

[October 2024] New paper on personalization: ComPO.

[October 2024] New paper on preference annotation: HybridPref.

[September 2024] MAGNET and WildTeaming now accepted at NeurIPS 2024!

[August 2024] Moved to Columbus and started at OSU!

[August 2024] Dolma won the best resource paper award at ACL 2024! 🎉

[July 2024] The website for the Customizable NLP workshop (to be held at EMNLP 2024) is up, including the CFP. Submit your papers!

[July 2024] New paper: Improving multilingual fairness of language models (MAGNET).

[July 2024] New paper on Contextual Noncompliance!

[June 2024] New paper WildTeaming on arXiv!

[May 2024] Dolma is accepted at ACL 2024!

[May 2024] Presented Gen-Z at ICLR 2024 in Vienna.

[April 2024] Invited talk on language model refusals at MilaNLP.

[March 2024] Two papers accepted at NAACL 2024!

[March 2024] Reward-bench paper is on arXiv!

[March 2024] Gave a guest lecture on mitigating societal harms of LLMs at KAIST.

[February 2024] Dolma is on arXiv!

[January 2024] Paper on Generative zero-shot classification accepted at ICLR 2024!

[December 2023] Giving an invited talk at the IndoML symposium on December 22nd, 2023. Come say hi!

[November 2023] I am co-teaching a tutorial on mitigating societal harms in LLMs at EMNLP 2023. See you in Singapore!

[November 2023] New preprint out on zero-shot text classification.

[November 2023] New preprint out on preserving author perspectives in news summarization.

[October 2023] I’m co-organizing a workshop on Customizable NLP at EMNLP 2024! Details forthcoming.

[October 2023] A paper accepted at EMNLP 2023! An updated camera ready soon.

[August 2023] Successfully defended my thesis 🥳.

[August 2023] Started at AI2 as a Young Investigator.

[July 2023] Our paper Minding Language Models' (Lack of) Theory of Mind: A Plug-and-Play Multi-Character Belief Tracker got outstanding paper awards at ACL 2023 and ICML 2023 Theory of Mind Workshop 🎉.

[July 2023] I will join Ohio State University as an Assistant Professor in the CS Department in Fall 2024!

[July 2023] I will spend a year as a postdoc at Allen Institute for AI working with Hanna Hajishirzi and Noah Smith!

[May 2023] Two new preprints on arXiv: Do All Languages Cost the Same? Tokenization in the Era of Commercial Language Models and SSD-2: Scaling and Inference-time Fusion of Diffusion Language Models.

[May 2023] 3 papers accepted at ACL 2023! Camera-ready versions out soon.

[May 2023] Vidhisha presented our survey paper at EACL in Croatia!

[April 2023] New preprint out on arXiv: Assessing Language Model Deployment with Risk Cards.

[January 2023] Our survey paper got accepted at EACL 2023! The camera-ready version out soon.

[December 2022] New preprint out of arXiv: On the Blind Spots of Model-Based Evaluation Metrics for Text Generation

[October 2022] New preprint out on arXiv: SSD-LM: Semi-autoregressive Simplex-based Diffusion Language Model for Text Generation and Modular Control.

[October 2022] Passed my thesis proposal. I am now a Ph.D. candidate!

[October 2022] Gave an invited talk about my latest research at Google.

[October 2022] New preprint out on arXiv: "Language Generation Models Can Cause Harm: So What Can We Do About It? An Actionable Survey".

[October 2022] Two papers accepted at EMNLP 2022! arXiv versions out soon.

[June 2022] My research is now funded by Google PhD Fellowship!

[May 2022] New preprint out on arXiv: "Constrained Sampling from Language Models via Langevin Dynamics in Embedding Spaces".

[April 2022] Gave a tutorial at TheWebConf 2022 on "Mitigating Societal Harms of Large Language Models: A Case Study in Language Generation"

[September 2021] Paper on "Controlled Text Generation as Continuous Optimization with Multiple Constraints" accepted at NeurIPS 2021!

[September 2021] Short paper on "Improving the Diversity of Unsupervised Paraphrasing with Embedding Outputs" accepted at MRL@EMNLP 2021!

[May 2021] Short paper on "Machine Translation into Low Resource Language Varieties" accepted at ACL 2021! arXiv preprint coming soon.

[March 2021] Paper on "An Exploration of Data Augmentation Techniques for Improving English to Tigrinya Translation" accepted at AfricaNLP@EACL 2021. Preprint coming soon.

[November 2020] Paper on "End-to-End Differentiable GANs for Text Generation" accepted at the ICBINB@NeurIPS 2020.

[November 2020] Invited Talk on "Language Generation with Continuous Outputs" at G-Research, London.

[Aug 2020] Teaching Assistant for the brand new course on Multilingual NLP (Fall 2020) at CMU.

[May 2020] Paper on "A Deep Reinforced Model for Cross-Lingual Summarization with Bilingual Semantic Similarity Reward" accepted at WNGT@ACL 2020.

[December 2019] Going to Facebook AI Research, Seattle (virtually) for the summer.

[November 2019] Presented two posters at EMNLP 2019 in Hong Kong

[September 2019] Paper on "A Margin-based Loss with Synthetic Negative Samples for Continuous-output Machine Translation" accepted at EMNLP-WNGT workshop!

[September 2019] Teaching Assistant for Algorithms for NLP (Fall 2019)

[August 2019] Paper on "Topics to Avoid: Demoting Latent Confounds in Text Classification" accepted at EMNLP 2019!

[May 2019] Headed to Facebook for the summer as a research intern in their conversational AI team in Menlo Park.

[December 2018] Paper on "Von Mises-Fisher Loss for Training Sequence to Sequence Models with Continuous Outputs" accepted at ICLR 2019!

[October 2018] Gave my first ever lecture in Algorithms for NLP on Structural Classification

[September 2018] Teaching Assistant for Algorithms for NLP (Fall 2018)

[August 2018] Gave a talk about my research on Machine Translation with Continuous Outputs at LTI's Student Research Symposium

[August 2017] Headed to CMU LTI to start my PhD

Google Sites

Report abuse