Yavuz, Mehmet Can and Yanıkoğlu, Berrin (2022) VCL-PL: semi-supervised learning from noisy web data with variational contrastive learning. In: 26th International Conference on Pattern Recognition (ICPR), Montreal, Canada
PDF
EK-7-ICPR_VCL_PL.pdf
Download (527kB)
EK-7-ICPR_VCL_PL.pdf
Download (527kB)
Official URL: http://dx.doi.org/10.1109/ICPR56361.2022.9956152
Abstract
We address the problem of web supervised learning, in particular for face attribute classification. Web data suffers from image set noise, due to unrelated images that may be retrieved in response to the query. We propose a semi-supervised pseudo-labeling approach where the embedding space distribution
is learnt via variational contrastive learning. We use 40 Gaussian sampling heads for the 40 attributes in the CelebA dataset and apply supervised contrastive learning over a limited amount of labelled data, to address the multi-label face attribute classification problem. Soft pseudo-labeling is then used to label the unlabelled data at attribute level, followed by two-stage domain adaptation. We show that the proposed method using noisy web data
brings improvements in accuracy over supervised multi-label face attribute classification in all experimental settings (over 2% points for very low-data setting). We suggest that learning the embedding distribution and the subsequent soft pseudo-labeling according to the nearest neighbors help in overcoming the noise in the unlabeled data.
Item Type: | Papers in Conference Proceedings |
---|---|
Divisions: | Faculty of Engineering and Natural Sciences |
Depositing User: | Berrin Yanıkoğlu |
Date Deposited: | 28 Sep 2022 14:34 |
Last Modified: | 10 Apr 2023 15:56 |
URI: | https://research.sabanciuniv.edu/id/eprint/44638 |