Dr Alessandro Suglia ☕️
Dr Alessandro Suglia

Assistant Professor

About Me

Alessandro Suglia is an Assistant Professor at Heriot-Watt University (HWU) and co-lead of the “Generative AI for Robotics” theme at the National Robotarium. I am also a member of the ELLIS network and the academic liaison between HWU and the Alan Turing Institute.

Download CV
Interests
  • Generative AI for Robotics and Embodied AI
  • Multimodal Learning
  • Conversational AI
Education
  • PhD in Robotics and Autonomous Systems

    Heriot-Watt University & University of Edinburgh

  • MRes in Robotics and Autonomous Systems

    Heriot-Watt University & University of Edinburgh

  • MSc in Computer Science

    University of Bari, Aldo Moro

  • BSc in Computer Science

    University of Bari, Aldo Moro

📚 My Research

Alessandro’s research focuses on designing artificial agents that learn language by leveraging sensory information derived from interacting with the world and with other agents. During his PhD, he was one of the main developers of Alana, the Heriot-Watt conversational AI which ranked 3rd in the Amazon Alexa Prize challenge in 2018. In his role as Assistant Professor at HWU, he led the HWU team “EMMA”, the only non-American university team which was one of the finalists of the Amazon Simbot Challenge—the first Amazon competition to push the boundaries of Embodied Conversational AI. Alongside several academic collaborations, he also completed research collaborations with Amazon Alexa AI, Meta AI, and the European Space Agency focused on developing innovative Multimodal Generative AI models for embodied and situated human-robot interaction tasks.

N.B.: This website is under construction. Content might be broken at times!

Featured Publications
Recent Publications
(2024). AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding. Findings of the Association for Computational Linguistics: EMNLP 2024, Miami, Florida, USA, November 12-16, 2024.
(2024). AlanaVLM: A Multimodal Embodied AI Foundation Model for Egocentric Video Understanding. CoRR.
(2024). CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts. CoRR.
(2024). Enhancing Continual Learning in Visual Question Answering with Modality-Aware Feature Distillation. CoRR.
(2024). Human - Large Language Model Interaction: The dawn of a new era or the end of it all?. Companion of the 2024 ACM/IEEE International Conference on Human-Robot Interaction, HRI 2024, Boulder, CO, USA, March 11-15, 2024.