GitHub - Koren-Ben-Ezra/kmeans-algorithm: k-means algorithm implemented in C and provided as a Python module.

📚 Academic Course Project

As part of an academic course, I implemented the K-means++ algorithm in C Python.

🚀 Practical Applications of K-means++ Algorithm

The K-means++ algorithm has many real-world uses. For example, in marketing, it helps companies group customers based on their buying habits for better targeting. In image processing, it compresses images by grouping similar pixels together, saving storage space without losing quality. Also, in social networks, it identifies communities by grouping users with similar interactions, useful for targeted content and network analysis.

🛠️ How to Use the Project

1. Provide arguments

K - the number of required clusters
inter - maximum iteration count
eps - convergence value

2. Data Preparation

Combine input files by inner join using the first column as a key and sort the data points in ascending order.

3. Interfacing with C Extension

Import the C module import mykmeanssp, call the fit() method with initial centroids and data points, and retrieve the final centroids.

📝 Appendix: Special Mathematical Matrices

The implementation involves several mathematical concepts and algorithms, including:

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
analysis.py		analysis.py
kmeans_pp.py		kmeans_pp.py
kmeansmodule.c		kmeansmodule.c
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📚 Academic Course Project

🚀 Practical Applications of K-means++ Algorithm

🛠️ How to Use the Project

1. Provide arguments

2. Data Preparation

3. Interfacing with C Extension

📝 Appendix: Special Mathematical Matrices

Euclidean Distance: Measure of distance between data points.

Cluster Assignment: Assigning data points to the closest cluster based on distance.

Update Centroids: Recalculating centroids based on data points in each cluster.

Convergence Criteria: Checking for convergence based on centroid updates and maximum iteration count.

About

Releases

Packages

Languages

Koren-Ben-Ezra/kmeans-algorithm

Folders and files

Latest commit

History

Repository files navigation

📚 Academic Course Project

🚀 Practical Applications of K-means++ Algorithm

🛠️ How to Use the Project

1. Provide arguments

2. Data Preparation

3. Interfacing with C Extension

📝 Appendix: Special Mathematical Matrices

Euclidean Distance: Measure of distance between data points.

Cluster Assignment: Assigning data points to the closest cluster based on distance.

Update Centroids: Recalculating centroids based on data points in each cluster.

Convergence Criteria: Checking for convergence based on centroid updates and maximum iteration count.

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages