Publications

Ethan Smyth, Alessandro Suglia (2026). VoyagerVision: Investigating the Role of Multi-modal Information for Open-ended Learning Systems. Advances in Intelligent Systems and Computing ((AISC,volume 1468)).

PDF Cite Code Project

Rohit Saxena, Alessandro Suglia, Pasquale Minervini (2026). VLM-RobustBench: A Comprehensive Benchmark for Robustness of Vision-Language Models. ICML 2026.

Cite DOI URL

Farooq Ahmad Wani, Alessandro Suglia, Rohit Saxena, Aryo Pradipta Gema, Wai-Chung Kwan, Fazl Barez, Maria Sofia Bucarelli, Fabrizio Silvestri, Pasquale Minervini (2026). Same Answer, Different Representations: Hidden instability in VLMs. CoRR.

Cite DOI URL

Georgios Pantazopoulos, Malvina Nikandrou, Ioannis Konstas, Alessandro Suglia (2026). Retrievit: In-context Retrieval Capabilities of Transformers, State Space Models, and Hybrid Architectures. CoRR.

Cite DOI URL

Hamza Mooraj, Georgios Pantazopoulos, Alessandro Suglia (2026). AgriPath: A Systematic Exploration of Architectural Trade-offs for Crop Disease Classification. CoRR.

Cite DOI URL

Malvina Nikandrou, Georgios Pantazopoulos, Nikolas Vitsakis, Ioannis Konstas, Alessandro Suglia (2025). CROPE: Evaluating In-Context Adaptation of Vision and Language Models to Culture-Specific Concepts. NAACL 2025.

Cite DOI URL

Filippo Momentè, Alessandro Suglia, Mario Giulianelli, Ambra Ferrari, Alexander Koller, Oliver Lemon, David Schlangen, Raquel Fernández, Raffaella Bernardi (2025). Triangulating LLM Progress through Benchmarks, Games, and Cognitive Tests. Findings of the Association for Computational Linguistics: EMNLP 2025, Suzhou, China, November 4-9, 2025.

Cite URL

Nicola Horst, Davide Mazzaccara, Antonia Schmidt, Michael Sullivan, Filippo Momentè, Luca Franceschetti, Philipp Sadler, Sherzod Hakimov, Alberto Testoni, Raffaella Bernardi, Raquel Fernández, Alexander Koller, Oliver Lemon, David Schlangen, Mario Giulianelli, Alessandro Suglia (2025). Playpen: An Environment for Exploring Learning From Dialogue Game Feedback. Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, EMNLP 2025, Suzhou, China, November 4-9, 2025.

Cite DOI URL

Emmanouil Zaranis, António Farinhas, Saul Santos, Beatriz Canaverde, Miguel Moura Ramos, Aditya K Surikuchi, André Viveiros, Baohao Liao, Elena Bueno-Benito, Nithin Sivakumaran, Others (2025). Movie Facts and Fibs (MF $^ 2$): A Benchmark for Long Movie Understanding. arXiv preprint arXiv:2506.06275.

Cite

Alessandro Suglia, Ioannis Konstas, Oliver Lemon (2024). Visually Grounded Language Learning: A Review of Language Games, Datasets, Tasks, and Models. J. Artif. Intell. Res..

Cite DOI URL