VideoAgent: Long-Form Video Understanding with Large Language Model as Agent
Xiaohan Wang(Stanford University), Yuhui Zhang(Stanford University), Orr Zohar(Stanford University), Serena Yeung-Levy(Stanford University)
Cited by 39
Abstract
Related Papers
No related papers found
Powered by citation graph analysis