GitHub - dimitri009/Clustering-with-DNN: Implementation of the DCN

DCN: Deep Clustering Network

This repository contains a PyTorch implementation of the paper:

"Towards K-means-friendly Spaces: Simultaneous Deep Learning and Clustering" Jianbo Yang, Devin M. Kaufman, Jinsung Yoon, and Mihaela van der Schaar, original paper

The code is originally from : @xuyxu and @guenthereder

I have merged the code in a jupyter notebook and added some minor changes.

Overview

Deep Clustering Network (DCN) is a method that jointly optimizes a deep autoencoder and K-means clustering objective. The goal is to learn a feature space where K-means performs well, combining representation learning and clustering in a unified framework.

image credits.

This implementation includes:

A configurable deep autoencoder
Joint training with K-means loss
Evaluation metrics: NMI and ARI
Comparisons with vanilla K-means (on raw data and autoencoder features)

Install requirements

Scikit-learn:

pip install -U scikit-learn

Pytorch:

pip install torch torchvision (without CUDA) or

pip install torch torchvision --index-url https://download.pytorch.org/whl/cu126 (with CUDA 12.6)

Pandas and Matplot:

pip install pandas , pip install matplotlib

Experiment

Dataset

The dataset used for the experiments is the mnist dataset:

Pre-training

The reconstruction loss:

Training

The ARI and NMI scores during the training:

Test

The ARI, NMI, ACC scores on the test set:

NMI	ARI	ACC
84.22	75.76	83.34

On the original paper:

NMI	ARI	ACC
81.-	75.-	83.-

The ARI, NMI, ACC scores of the vanilla Kmeans:

NMI	ARI	ACC
43.01	39.89	49.00

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
fig		fig
.gitignore		.gitignore
DCN.py		DCN.py
DCN_mnist.ipynb		DCN_mnist.ipynb
LICENSE		LICENSE
README.md		README.md
autoencoder.py		autoencoder.py
kmeans.py		kmeans.py
meanshift.py		meanshift.py
mnist.py		mnist.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DCN: Deep Clustering Network

Overview

Install requirements

Experiment

Dataset

Pre-training

Training

Test

Visualisation of the feature latent space

About

Uh oh!

Releases

Packages

Languages

License

dimitri009/Clustering-with-DNN

Folders and files

Latest commit

History

Repository files navigation

DCN: Deep Clustering Network

Overview

Install requirements

Experiment

Dataset

Pre-training

Training

Test

Visualisation of the feature latent space

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages