Abstract: Multimodal Object-Entity Relation Extraction (MORE) is an emerging task in information extraction, which aims to extract object-entity relational facts from text and image data. Despite ...