VLM: Task-agnostic Video-Language Model Pre-training for Video Understanding
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 (2021) - Comments
doi: 10.18653/v1/2021.findings-acl.370 

Hu Xu, Gargi Ghosh, Po-Yao Huang, Prahal Arora, Masoumeh Aminzadeh, Christoph Feichtenhofer, Florian Metze, Luke Zettlemoyer