Efficient Inference of Vision Instruction-Following Models with Elastic Cache 


Zuyan Liu   Benlin Liu   Jiahui Wang   Yuhao Dong   

Guangyi Chen   Yongming Rao   Ranjay Krishna   Jiwen Lu   

 Tsinghua University    University of Washington    Carnegie Mellon University

Mohamed bin Zayed University of Artificial Intelligence   Tencent    Allen Institute for AI

[Paper (arXiv)]      [Code (GitHub)]