Publications

(2024). Learning To See But Forgetting To Follow: Visual Instruction Tuning Makes LLMs More Prone To Jailbreak Attacks. CoRR.
(2024). Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks. CoRR.
(2024). Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024, Miami, FL, USA, November 12-16, 2024.
(2024). Human - Large Language Model Interaction: The dawn of a new era or the end of it all?. Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, HRI 2024, Boulder, CO, USA, March 11-15, 2024.
(2024). Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation. CoRR.
(2024). CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts. CoRR.
(2024). AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding. CoRR.
(2024). AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding. Findings of the Association for Computational Linguistics: EMNLP 2024, Miami, Florida, USA, November 12-16, 2024.
(2023). Multitask Multimodal Prompted Training for Interactive Embodied Task Completion. CoRR.