profile photo

Federico Girella

PhD Student, University of Verona, Italy

I am a third year Ph.D. student at the University of Verona (IT), and member of IntelligoLabs supervised by Prof. Marco Cristani. My research focuses on Deep Learning models for the Vision and Language multi-modal domain. I work with (Large) Vision and Language Models and Text-to-Image Generative Models. My main body of work revolves around the Fashion domain and the use of Generative Models for image generation. I have collaborated for a year with Humatics SYS-DAT developing AI tools for the fashion industry.
Currently looking for a Full-Time position as AI Research Scientist!

News

  • 🏆 LOTS accepted as ORAL at ICCV25
  • Presenting Fashion-ACT at CVPR25 in Nashville
  • Visiting Period at the University of Surrey w/ Prof. Yi-Zhe SonG
  • 🏆 Fashion-ACT accepted at CVPR25
  • Attending ECCV24 in Milan

Publications

LOTS of Fashion! Multi-Conditioning for Image Generation via Sketch-Text Pairing

Federico Girella, Davide Talon, Ziyue Liu, Zanxi Ruan, Yiming Wang, Marco Cristani

Proceedings of the IEEE/CVF International Conference on Computer Vision, 2025

Seeing the Abstract: Translating the Abstract Language for Vision Language Models

Davide Talon*, Federico Girella*, Ziyue Liu, Marco Cristani, Yiming Wang

Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2025

Leveraging Latent Diffusion Models for Training-Free in-Distribution Data Augmentation for Surface Defect Detection

Federico Girella, Ziyue Liu, Franco Fummi, Francesco Setti, Marco Cristani, Luigi Capogrosso

International Conference on Content-Based Multimedia Indexing (CBMI), 2024

Education

Visiting Ph.D. Student

University of Surrey, United Kingdom

Apr 2025 - Sep 2025

  • Supervised by Prof. Yi-Zhe SonG.

Ph.D. Student, Computer Science

University of Verona

Oct 2022 - May 2026

  • Computer Science, Artificial Intelligence.
  • AI for Vision and Language, Generative Models.

Artificial Intelligence. AI for Vision and Language, Generative Models.

M.Sc. Computer Science, Visual Computing

University of Verona

Oct 2019 - May 2022

  • Thesis title: Multi-Task Learning: Pose Estimation for Video Analytics.
  • Final grade: 110/110 Cum Laude

B.Sc. Computer Science, Computer Science and Engineering

University of Verona

Oct 2019 - May 2022

  • Thesis title: Study on Augmented Reality Functionalities.
  • Final grade: 110/110

Skills

Languages

Coding

Interests