Federico Girella

Research Scientist intern at Canva, PhD at University of Verona, Italy

I am currently a Research Scientist intern for Canva. I obtained my Ph.D. at the University of Verona (IT), and I am a member of IntelligoLabs supervised by Prof. Marco Cristani. My research focuses on Deep Learning models for the Vision and Language multi-modal domain. I work with (Large) Vision and Language Models and Text-to-Image Generative Models. My main body of work revolves around the Fashion domain and the use of Generative Models for image generation.

Google Scholar

Linkedin

Github

News

26 Jan 2026

Joined Canva as Research Scientist intern 🎨
1 Nov 2025

Invited Speaker at 'Best of ICCV25' 🎊
23 Oct 2025

Presenting LOTS at ICCV25 in Honolulu 🌺
25 Jul 2025

🏆 LOTS accepted as ORAL at ICCV25
11 Jun 2025

Presenting Fashion-ACT at CVPR25 in Nashville 🎶

Publications

LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing

Federico Girella, Davide Talon, Ziyue Liu, Zanxi Ruan, Yiming Wang, Marco Cristani

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Seeing the Abstract: Translating the Abstract Language for Vision Language Models

Davide Talon, Federico Girella, Ziyue Liu, Marco Cristani, Yiming Wang

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Leveraging Latent Diffusion Models for Training-Free in-Distribution Data Augmentation for Surface Defect Detection

Federico Girella, Ziyue Liu, Franco Fummi, Francesco Setti, Marco Cristani, Luigi Capogrosso

International Conference on Content-Based Multimedia Indexing (CBMI), 2024

See more

Education

Visiting Ph.D. Student

University of Surrey, United Kingdom

Apr 2025 - Sep 2025

Supervised by Prof. Yi-Zhe SonG.

Ph.D., Computer Science

University of Verona

Oct 2022 - March 2026

Computer Science, Artificial Intelligence.
AI for Vision and Language, Generative Models.

Artificial Intelligence. AI for Vision and Language, Generative Models.

M.Sc. Computer Science, Visual Computing

University of Verona

Oct 2019 - May 2022

Thesis title: Multi-Task Learning: Pose Estimation for Video Analytics.
Final grade: 110/110 Cum Laude

B.Sc. Computer Science, Computer Science and Engineering

University of Verona

Oct 2019 - May 2022

Thesis title: Study on Augmented Reality Functionalities.
Final grade: 110/110

Skills

Languages

Coding

Interests