Spatially Aware Multimodal Transformers for TextVQA
arXiv (2020) - Comments
arxiv: 2007.12146  issn: 2331-8422 

Yash Kant, Dhruv Batra, Peter Anderson, Alex Schwing, Devi Parikh, Jiasen Lu, Harsh Agrawal