VideoAgent: Long-Form Video Understanding with Large Language Model as Agent

Xiaohan Wang(Stanford University), Yuhui Zhang(Stanford University), Orr Zohar(Stanford University), Serena Yeung-Levy(Stanford University)
Lecture notes in computer science
October 25, 2024
Cited by 39

Abstract


Related Papers

No related papers found

Powered by citation graph analysis