Hi! I am Vishwa Shah, I am a ML Research Engineer at Apple working on Intelligent Input Experience. I am passionate about adaptable mechanisms for language-guided AI and studying their robustness across domains.
I recently graduated from LTI, Carnegie Mellon University where I pursued the MIIS program. At CMU, I worked on enhancing memory in Web-Agents by learning during inference. I also explored projects on analyzing the robustness of LLMs to position of information in long-context and mitigating hallucinations in models through reinforcement learning from automated fine-grained reasoning.
Prior to this, I completed my AI Residency at Meta in July 2023, where I first worked on low-resource Multimodal detection of integrity violations in VR and later on inducing personality traits in LLaMA to build AI Characters using reward-modeling.
I graduated with a B.E. (Hons.) Computer Science from BITS Pilani, K K Birla Goa Campus, India in 2022. I have had the opportunity to work as a research intern at MIDAS in the domain of Multimodal AI and at APPCAIR on Neural Algorithmic reasoning.
Find out more about my past research , experiences and projects.
Jan'25 - Started as an ML Research Engineer at Apple!
June'24 - Pre‑Calc: Learning to Use the Calculator Improves Numeracy in Language Models accepted at AI4Math, ICML'24
May'24 - Started my Research Internship at Datology AI!
Mar'24 - AdaPT: A Set of Guidelines for Hyperbolic Multimodal Multilingual NLP accepted at NAACL'24 Findings!
Aug'23 - Will be joining the MIIS program at LTI, CMU for Fall'23
July'22 - Started working as an AI Resident at Meta!
July'22 - Knowledge-based Analogical Reasoning in
Neuro-symbolic Latent Spaces accepted at NeSy-IJCLR'22.