I research on online learning in non-stationary environments, where the decision maker (DM) could start with partial knowledge on the unknown online model. 

Non-stationarity: Classical research works provide important foundation on online learning under stationarity. In non-stationary worlds, the classical sense of convergence no longer applies. 

How should the DM adapt to drifts, while accumulating online information?

Partial knowledge: At the start of the e-commerce era, learning from scratch is often the norm. Correspondingly, most existing online learning models start with little or no knowledge on the latent model. By contrast, in the current data-rich era, the DM often has non-trivial knowledge on the latent model.

How/when should the DM embed his/her auxiliary knowledge in online learning? What sort of auxiliary knowledge is useful?

My research agenda sheds light on the above in two models: multi-armed bandits (MABs) and online resource allocation.