About Me

Hi! I am Vishwa Shah, I am a ML Research Engineer at Apple working on Intelligent Input Experience. I am passionate about adaptable mechanisms for language-guided AI and studying their robustness across domains.

I recently graduated from LTI, Carnegie Mellon University where I pursued the MIIS program. At CMU, I worked on enhancing memory in Web-Agents by learning during inference. I also explored projects on analyzing the robustness of LLMs to position of information in long-context and mitigating hallucinations in models through reinforcement learning from automated fine-grained reasoning.

Prior to this, I completed my AI Residency at Meta in July 2023, where I first worked on low-resource Multimodal detection of integrity violations in VR and later on inducing personality traits in LLaMA to build AI Characters using reward-modeling.

I graduated with a B.E. (Hons.) Computer Science from BITS Pilani, K K Birla Goa Campus, India in 2022. I have had the opportunity to work as a research intern at MIDAS in the domain of Multimodal AI and at APPCAIR on Neural Algorithmic reasoning.

Find out more about my past research , experiences and projects.

News

Apr'25 - Exploring the Pre-conditions for Memory-Learning Agents accepted at SSI-FM workshop at ICLR 2025!
Feb'25 - MERLIN: A Testbed for Multilingual Multimodal Entity Recognition and Linking accepted in the TACL Journal
Jan'25 - Started as an ML Research Engineer at Apple!
June'24 - Pre‑Calc: Learning to Use the Calculator Improves Numeracy in Language Models accepted at AI4Math, ICML'24
May'24 - Started my Research Internship at Datology AI!
Mar'24 - AdaPT: A Set of Guidelines for Hyperbolic Multimodal Multilingual NLP accepted at NAACL'24 Findings!
Aug'23 - Will be joining the MIIS program at LTI, CMU for Fall'23
July'22 - Started working as an AI Resident at Meta!
July'22 - Knowledge-based Analogical Reasoning in

Neuro-symbolic Latent Spaces accepted at NeSy-IJCLR'22.

Contact me here!

GitHub

Twitter

Page updated

Google Sites

Report abuse