(Most articles available on Scholar or arxiv, which are also more up-to-date---Google Scholar / DBLP).

2025 - present (Foundation models / LLMs)


Gemini 2.5: Pushing the frontier with advanced reasoning, multimodality, long context, and next generation agentic capabilities

Gemini team. Arxiv 2025.


Masked Generative Nested Transformers with Decode Time Scaling

S. Goyal, D. Tula, G. Jain, P. Shenoy, P. Jain, S. Paul. ICML 2025

Also presented at the Delta workshop, ICLR-W 2025.


Universal Model Routing for Efficient LLM Inference

W. Jitkrittum, H. Narasimhan, A.S. Rawat, J. Juneja, Z. Wang, C-Y. Lee, P. Shenoy, R. Panigrahy, A.K. Menon, S. Kumar. SCOPE Workshop, ICLR-W 2025.