GitHub - mahdihosseini/KMCL: Kernel based Multilabel Contrastive Learning

End-to-End Supervised Multilabel Contrastive Learning

A. Sajedi*, S. Khaki*, K. N. Plataniotis, M. S. Hosseini, 'End-to-End Supervised Multilabel Contrastive Learning,' in review.
*: equal contribution
arXiv preprint from here

Abstract

Multilabel representation learning is recognized as a challenging problem that can be associated with either label dependencies between object categories or data related issues such as the inherent imbalance of positive/negative samples. Recent advances address these challenges from model- and data-centric viewpoints. In model-centric, the label correlation is obtained by an external model designs (e.g., graph CNN) to incorporate an inductive bias for training. However, they fail to design an end-to-end training framework, leading to high computational complexity. On the contrary, in data-centric, the realistic nature of the dataset is considered for improving the classification while ignoring the label dependencies. In this paper, we propose a new end-to-end training framework–dubbed KMCL (Kernel-based Mutlilabel Contrastive Learning)–to address the shortcomings of both model- and data-centric designs. The KMCL first transforms the embedded features into a mixture of exponential kernels in Gaussian RKHS. It is then followed by encoding an objective loss that is comprised of (a) reconstruction loss to reconstruct kernel representation, (b) asymmetric classification loss to address the inherent imbalance problem, and (c) contrastive loss to capture label correlation. The KMCL models the uncertainty of the feature encoder while maintaining a low computational footprint. Extensive experiments are conducted on both computer vision and medical image classification tasks to showcase the consistent improvements of KMCL over the SOTA methods.

PyTorch Implementation for KMCL

In this PyTorch file, we provide the KMCL implementations which augments the base ASL loss function.

For the multi-label case (sigmoids), the implementation is :

class KMCL_Loss(nn.Module)

This class leverages three separate loss formulations:

ASL
Reconstruction Loss
KMCL Loss The losses and generic class take keyword arguments to specify the loss case (either "isotropic" or "anisotropic") as well as the distance measure (Bhattacharya, Mahalonbis, RBF).

Running either train.py or train_dist.py will execute the end-to-end contrastive learning for the dataset of interest. The latter runs the script under a fully distributed framework enabling our method to scale easily to larger image resolutions, architectures, and batch size configurations.

Pretrained Models & Results

We provide our pre-trained models and results on several datasets across both the TResNet-M and TResNet-L architectures.

Backbone	Input Size	Dataset	mAP
KMCL TResNet_M	224	MS-COCO	82.1
KMCL TResNet_L	448	MS-COCO	88.6
KMCL TResNet_M	224	PASCAL-VOC	95.2
KMCL TResNet_L	448	PASCAL-VOC	96.2
KMCL TResNet_M	224	Chest-Xray14	30.0
KMCL TResNet_L	448	Chest-Xray14	31.5
KMCL TResNet_M	224	ADP	95.1
KMCL TResNet_L	448	ADP	96.5

This project source code is based on https://github.com/Alibaba-MIIL/ASL.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
src		src
Kmcl_Class.py		Kmcl_Class.py
MODELS.md		MODELS.md
README.md		README.md
meters.py		meters.py
train.py		train.py
train_dist.py		train_dist.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

End-to-End Supervised Multilabel Contrastive Learning

PyTorch Implementation for KMCL

Pretrained Models & Results

About

Releases

Packages

Contributors 2

Languages

mahdihosseini/KMCL

Folders and files

Latest commit

History

Repository files navigation

End-to-End Supervised Multilabel Contrastive Learning

PyTorch Implementation for KMCL

Pretrained Models & Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages