Mert Bulent Sariyildiz's Personal Website

Mert Bulent Sariyildiz

I was a doctoral researcher at NAVER LABS Europe and Inria Grenoble (the THOTH Team) in France, under the supervision of Yannis Kalantidis, Diane Larlus and Karteek Alahari. My PhD focused on learning general-purpose visual representations from images.

I received my M.Sc. degree from the Computer Engineering Department at Bilkent University in Türkiye, where I worked with Gokberk Cinbis on learning data-efficient visual classification models. Before that, I received my B.Sc. from the Electrical and Electronics Engineering Department at Anadolu (now Eskisehir Technical) University in Türkiye.

Email / CV / Google Scholar / Twitter / Github

News

[2025-02] - DUNE is accepted to CVPR 2025!
[2024-09] - Selected as an outstanding reviewer at ECCV 2024!
[2024-08] - UNIC is accepted to ECCV 2024!
[2024-07] - Gave two lectures (on transfer learning and self-supervised learning) at the BAYOYO Machine Learning Summer School in Ankara.
[2024-01] - Ret4Loc is accepted to ICLR 2024!
[2023-07] - Joined NAVER LABS Europe as a research scientist!
[2023-06] - Successfully defended my PhD thesis!
[2023-02] - ImageNet-SD is accepted to CVPR 2023!
[2023-01] - t-ReX is accepted to ICLR 2023 as a spotlight presentation!
[2022-12] - New pre-print on arXiv: Fake it till you make it: Learning(s) from a synthetic ImageNet clone
[2022-10] - Selected as an outstanding reviewer at ECCV 2022.
[2022-06] - Our work Improving the Generalization of Supervised Models is on ArXiv.
Pretrained weights for t-ReX and t-ReX* are available on our project webpage.
[2021-08] - The ImageNet-CoG benchmark is accepted to ICCV 2021!
We have a newer version of the manuscript, and our code is out!
[2021-07] - Selected as an outstanding reviewer at CVPR 2021.
[2020-12] - The pre-print of the ImageNet-CoG Benchmark is on arXiv.
[2020-09] - I officially started my PhD at University of Grenoble!
[2020-09] - MoCHi is accepted to NeurIPS 2020!
[2020-08] - ICMLM is accepted to ECCV 2020! Check out our demo here (it is very cool!).
[2020-08] - "Key protected classification for collaborative learning" is accepted to Pattern Recognition!
[2019-09] - I Joined NAVER LABS Europe as a researcher.
[2019-09] - I defended my master thesis, yey!
[2019-05] - GMN is accepted to CVPR 2019 (oral presentation)!

Research

I'm broadly interested in computer vision problems. If I have to be more specific, I like working on learning visual representations from imagery data with different forms of supervision (including no supervision at all!) so that they are useful for a range of vision tasks.

	DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers Mert Bulent Sariyildiz, Philippe Weinzaepfel, Thomas Lucas, Pau de Jorge, Diane Larlus, Yannis Kalantidis CVPR 2025 In this paper, we define and investigate the problem of heterogeneous teacher distillation, or co-distillation, a challenging multi-teacher distillation scenario where teacher models vary significantly in both (a) their design objectives and (b) the data they were trained on. code & models, poster
	UNIC: Universal Classification Models via Multi-teacher Distillation Mert Bulent Sariyildiz, Philippe Weinzaepfel, Thomas Lucas, Diane Larlus, Yannis Kalantidis ECCV 2024 We distill multiple pretrained models into a single, unified encoder. Beginning with an in-depth analysis of standard distillation techniques using multiple complementary teachers, we introduce a series of enhancements tailored for multi-teacher distillation. Our approach produces student models that match the capacity of the original teachers while being equal to or surpassing the performance of the best teacher on each task. code & models, poster
	Ret4Loc \| Tailoring Retrieval Representations to Long-term Visual Localization Yannis Kalantidis, Mert Bulent Sariyildiz, Rafael S. Rezende, Philippe Weinzaepfel, Diane Larlus and Gabriela Csurka ICLR 2024 Visual localization methods generally rely on a first image retrieval step whose role is crucial. In this paper, we improve this retrieval step and tailor it to the final localization task. We propose to synthesize variants of the training set images, obtained from generative text-to-image models, in order to automatically expand the training set towards a number of nameable variations that particularly hurt visual localization.
	ImageNet-SD \| Fake it till you make it: Learning(s) from a synthetic ImageNet clone Mert Bulent Sariyildiz, Karteek Alahari, Diane Larlus and Yannis Kalantidis CVPR 2023 Recent text-to-image generative models, generate fairly realistic images. Could such models render real images obsolete for training image prediction models? We answer part of this provocative question by questioning the need for real images when training models for ImageNet-1K classification. We show that models trained on synthetic images exhibit strong generalization properties and perform on par with models trained on real data.
	t-ReX \| No reason for no supervision: Improving the generalization of supervised models Mert Bulent Sariyildiz, Yannis Kalantidis, Karteek Alahari and Diane Larlus ICLR 2023 We revisit supervised learning on ImageNet-1K and propose a training setup which improves transfer learning performance of supervised models. project website, OpenReview
	ImageNet-CoG \| Concept Generalization in Visual Representation Learning Mert Bulent Sariyildiz, Yannis Kalantidis, Diane Larlus and Karteek Alahari ICCV 2021 We propose a benchmark tailored for measuring concept generalization capabilities of models trained on ImageNet-1K. project website, code, poster, presentation (PDF), presentation (PPT), video
	MoCHi \| Hard Negative Mixing for Contrastive Learning Yannis Kalantidis, Mert Bulent Sariyildiz, Noe Pion, Philippe Weinzaepfel and Diane Larlus NeurIPS 2020 For contrastive learning, sampling more or harder negatives often improve performance. We propose two ways to synthesize more negatives using the MoCo framework. project website
	ICMLM \| Learning Visual Representations with Caption Annotations Mert Bulent Sariyildiz, Julien Perez and Diane Larlus ECCV 2020 Images often come with accompanying text describing the scene in images. We propose a method to learn visual representations using (image, caption) pairs. project website, demo
	Key protected classification for collaborative learning Mert Bulent Sariyildiz, Ramazan Gokberk Cinbis and Erman Ayday Pattern Recognition, Vol. 104, August 2020 Vanilla collaborative learning frameworks are vulnerable to an active adversary that runs a generative adversarial network attack. We propose a classification model that is resilient against such attacks by design. code repo
	GMN \| Gradient Matching Generative Networks for Zero-Shot Learning Mert Bulent Sariyildiz and Ramazan Gokberk Cinbis CVPR 2019, oral presentation Zero-shot learning models may suffer from the domain-shift due to the difference between data distributions of seen and unseen concepts. We propose a generative model to synthesize samples for unseen concepts given their visual attributes and use these samples for training a classifier for both seen and unseen concepts. code repo

Huge thanks to Jon Barron, who provides the template of this website.