Publications

(2023). An Analysis of Visually Grounded Instructions in Embodied AI Tasks. Proceedings of the 9th Italian Conference on Computational Linguistics, Venice, Italy, November 30 - December 2, 2023.
(2023). 'What are you referring to?' Evaluating the Ability of Multi-Modal Dialogue Models to Process Clarificational Exchanges. CoRR.
(2023). 'What are you referring to?' Evaluating the Ability of Multi-Modal Dialogue Models to Process Clarificational Exchanges. Proceedings of the 24th Meeting of the Special Interest Group on Discourse and Dialogue, SIGDIAL 2023, Prague, Czechia, September 11 - 15, 2023.
(2022). Task Formulation Matters When Learning Continually: A Case Study in Visual Question Answering. CoRR.
(2022). Going for GOAL: A Resource for Grounded Football Commentaries. CoRR.
(2022). Exploring Multi-Modal Representations for Ambiguity Detection & Coreference Resolution in the SIMMC 2.0 Challenge. CoRR.
(2022). Demonstrating EMMA: Embodied MultiModal Agent for Language-guided Action Execution in 3D Simulated Environments. Proceedings of the 23rd Annual Meeting of the Special Interest Group on Discourse and Dialogue, SIGDIAL 2022, Edinburgh, UK, 07-09 September 2022.
(2022). Combine to Describe: Evaluating Compositional Generalization in Image Captioning. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics: Student Research Workshop, ACL 2022, Dublin, Ireland, May 22-27, 2022.
(2022). ACT-Thor: A Controlled Benchmark for Embodied Action Understanding in Simulated Environments. Proceedings of the 29th International Conference on Computational Linguistics, COLING 2022, Gyeongju, Republic of Korea, October 12-17, 2022.
(2021). Embodied BERT: A Transformer Model for Embodied, Language-guided Visual Task Completion. CoRR.