Efficient Inference of Vision Instruction-Following Models with Elastic Cache
Zuyan Liu Benlin Liu Jiahui Wang Yuhao Dong
Guangyi Chen Yongming Rao Ranjay Krishna Jiwen Lu
Tsinghua University University of Washington Carnegie Mellon University
Mohamed bin Zayed University of Artificial Intelligence Tencent Allen Institute for AI
[Paper (arXiv)] [Code (GitHub)]