AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding

Jan 1, 2024ยท
Alessandro Suglia
,
Claudio Greco
,
Katie Baker
,
Jose L. Part
,
Ioannis Papaioannou
,
Arash Eshghi
,
Ioannis Konstas
,
Oliver Lemon
ยท 0 min read
Type
Publication
Findings of the Association for Computational Linguistics: EMNLP 2024, Miami, Florida, USA, November 12-16, 2024