Publications

nocaps: novel object captioning at scale

Published in International Conference on Computer Vision (ICCV), 2019

Authors: Harsh Agrawal*, Karan Desai*, Yufei Wang, Xinlei Chen, Rishabh Jain, Mark Johnson, Dhruv Batra, Devi Parikh, Stefan Lee, Peter Anderson

To encourage the development of image captioning models that can learn visual concepts from alternative data sources, such as object detection datasets, we present the first large-scale benchmark for this task. Dubbed nocaps, for novel object captioning at scale, our benchmark consists of 166,100 human-generated captions describing 15,100 images from the Open Images validation and test sets.

Download here

How to best use Syntax in Semantic Role Labelling

Published in 57th Annual Meeting of the Association for Computational Linguistics, 2019

Authors: Yufei Wang, Mark Johnson, Stephen Wan, Yifang Sun, Wei Wang

We evaluate three different ways of encoding syntactic parses and three different ways of injecting them into a state-of-the-art neural ELMo-based SRL sequence labelling model.

Download here

Neural Constituency Parsing of Speech Transcripts

Published in Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), 2019

Authors: Paria Jamshid Lou, Yufei Wang, Mark Johnson

Parsing speech with Transformer.

Download here