Publications

2026

Aligning Large Language Models With Human Feedback: Mathematical foundations and algorithm design

IEEE Signal Processing Magazine paper

Siliang Zeng*, Luca Viano*, Chenliang Li*, Jiaxiang Li*, Markus Wulfmeier, Stefano Ermon, Alfredo Garcia, Mingyi Hong

Split the Differences, Pool the Rest: Provably Efficient Multi-Objective Imitation

arxiv

Z. Sheebaelhamd*, L. Viano*, V. Cevher, C. Vernade

Provably avoiding over-optimization in Direct Preference Optimization without knowing the data distribution

arxiv

A. Barla*, E.Nevali*, L. Viano*, V. Cevher

Direct Preference Optimization with Rating Information: Practical Algorithms and Provable Gains

arxiv

L. Viano, R. Zhou, Y. Sun, M. Namazifar, V. Cevher, S. Sabach, M. Ghavamzadeh

Multi-agent imitation learning with function approximation: Linear Markov games and beyond

ICML 2026 paper

L. Viano*, T. Freihaut*, E. Nevali, V. Cevher, M. Geist, G. Ramponi

Rate optimal learning equilibria from data

AISTATS 2026 paper

T. Freihaut*, L. Viano*, E. Nevali, V. Cevher, M. Geist, G. Ramponi

2025

Learning equilibria from data: Provably efficient multi agent imitation learning.

NeurIPS 2025 paper

T. Freihaut*, L. Viano*, V. Cevher, M. Geist, G. Ramponi

Inverse Q-Learning Done Right: Offline Imitation Learning in Qπ MDPs.

NeurIPS 2025 paper

A. Moulin, G. Neu, L. Viano

Optimistically Optimistic Exploration for Provably Efficient Infinite-Horizon Reinforcement and Imitation Learning.

COLT 2025 paper

A. Moulin, G. Neu, L. Viano

IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic

ICML 2025 paper

Stefano Viel*, Luca Viano*, Volkan Cevher

Best of Both Worlds: Regret Minimization versus Minimax Play

ICML 2025 paper

Adrian Mueller*, Jon Schneider*, Stratis Skoulakis*, Luca Viano*, Volkan Cevher

Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees.

paper

Yongtao Wu*, Luca Viano*, Yihang Chen, Zhenyu Zhu, Kimon Antonakopoulos, Quanquan Gu, Volkan Cevher

Adaptive Bilevel Optimization

ACM Journal of Data Science

K. Antonakopolous, S. Sabach, L. Viano, M. Hong, V. Cevher

2024

Polynomial Convergence of Bandit No-Regret Dynamics in Congestion Games.

WINE 2024 paper ( merged with https://arxiv.org/abs/2306.13673 )

L. Dadi*, I. Panageas*, S. Skoulakis*, L. Viano*, V. Cevher

Imitation Learning in Discounted Linear MDP without exploration assumptions.

ICML 2024 paper

L. Viano, S. Skoulakis, V. Cevher

2023

Alternation makes the adversary weaker in two players game.

NeurIPS 2023 (Spotlight) [paper]

V. Cevher, A. Cutkosky, A.Kavis, G. Piliouras, S. Skoulakis, L. Viano

Semi-Bandit Dynamics in Congestion Games: Convergence to Nash Equilibrium and No Regret guarantees.

ICML 2023 (Oral) [paper]

I. Panageas*, S. Skoulakis*, L. Viano*, X. Wang, V. Cevher

What can online reinforcement learning with function approximation benefit from general coverage conditions?

ICML 2023 [paper]

F. Liu, L. Viano, V. Cevher

2022

Proximal Point Imitation Learning

NeurIPS 2022 [paper]

L. Viano, A.Kamoutsi, G. Neu, I. Krawczuk, V. Cevher

Identifiability and Generalizability from multiple experts in Inverse Reinforcement Learning

NeurIPS 2022 [paper]

P. Rolland, L. Viano, Norman Schuerhoff, Boris Nikolov, V. Cevher

Understanding Deep Neural Function Approximation in Reinforcement Learning via epsilon-Greedy Exploration

NeurIPS 2022 [paper]

F. Liu, L. Viano, V. Cevher

A Natural Actor-Critic Framework for Zero-Sum Markov Games

ICML 2022 [paper]

A. Alacaoglu, L. Viano, N.He, V.Cevher

Robust Learning from Observation under Model Misspecification

AAMAS 2022 [paper]

L. Viano, Y.T. Huang, P. Kamalaruban, Craig Innes, S. Ramamoorthy, A. Weller

2021

Robust Inverse Reinforcement Learning under Transition Dynamics Mismatch

Advances in Neural Information Processing Systems 2021 [paper]

L. Viano, Y.T. Huang, P. Kamalaruban, A. Weller, V. Cevher

Neural NID Rules

Physical Reasoning and Inductive Biases for the Real World at NeurIPS 2021 [paper]

L. Viano, J. Brea

Page updated

Google Sites

Report abuse