profile photo

Federico Girella

Research Scientist intern at Canva, PhD at University of Verona, Italy

I am currently a Research Scientist intern for Canva. I obtained my Ph.D. at the University of Verona (IT), and I am a member of IntelligoLabs supervised by Prof. Marco Cristani. My research focuses on Deep Learning models for the Vision and Language multi-modal domain. I work with (Large) Vision and Language Models and Text-to-Image Generative Models. My main body of work revolves around the Fashion domain and the use of Generative Models for image generation.

News

  • Joined Canva as Research Scientist intern 🎨
  • Invited Speaker at 'Best of ICCV25' 🎊
  • Presenting LOTS at ICCV25 in Honolulu 🌺
  • 🏆 LOTS accepted as ORAL at ICCV25
  • Presenting Fashion-ACT at CVPR25 in Nashville 🎶

Publications

LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing

Federico Girella, Davide Talon, Ziyue Liu, Zanxi Ruan, Yiming Wang, Marco Cristani

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Seeing the Abstract: Translating the Abstract Language for Vision Language Models

Davide Talon*, Federico Girella*, Ziyue Liu, Marco Cristani, Yiming Wang

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Leveraging Latent Diffusion Models for Training-Free in-Distribution Data Augmentation for Surface Defect Detection

Federico Girella, Ziyue Liu, Franco Fummi, Francesco Setti, Marco Cristani, Luigi Capogrosso

International Conference on Content-Based Multimedia Indexing (CBMI), 2024

Education

Visiting Ph.D. Student

University of Surrey, United Kingdom

Apr 2025 - Sep 2025

  • Supervised by Prof. Yi-Zhe SonG.

Ph.D., Computer Science

University of Verona

Oct 2022 - March 2026

  • Computer Science, Artificial Intelligence.
  • AI for Vision and Language, Generative Models.

Artificial Intelligence. AI for Vision and Language, Generative Models.

M.Sc. Computer Science, Visual Computing

University of Verona

Oct 2019 - May 2022

  • Thesis title: Multi-Task Learning: Pose Estimation for Video Analytics.
  • Final grade: 110/110 Cum Laude

B.Sc. Computer Science, Computer Science and Engineering

University of Verona

Oct 2019 - May 2022

  • Thesis title: Study on Augmented Reality Functionalities.
  • Final grade: 110/110

Skills

Languages

Coding

Interests