(Most articles available on Scholar or arxiv, which are also more up-to-date---Google Scholar / DBLP).

2025 - present (Foundation models / LLMs)


Universal Model Routing for Efficient LLM Inference

W. Jitkrittum, H. Narasimhan, A.S. Rawat, J. Juneja, Z. Wang, C-Y. Lee, P. Shenoy, R. Panigrahy, A.K. Menon, S. Kumar. ICLR 2026.

Also presented at SCOPE Workshop, ICLR-W 2025


Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities

Gemini team. Arxiv 2025.


Masked Generative Nested Transformers with Decode Time Scaling

S. Goyal, D. Tula, G. Jain, P. Shenoy, P. Jain, S. Paul. ICML 2025

Also presented at the Delta workshop, ICLR-W 2025.