Skip to content

Vectors - Nearest neighbor search and Clustering using LSH, Hypercube (and Lloyd's only at the clustering) algorithms with L2 metric.

Notifications You must be signed in to change notification settings

Sitaras/Software-Development-for-Algorithmic-Problems_Project-1

Folders and files

NameName
Last commit message
Last commit date

Latest commit

c9d61ba · Apr 4, 2022
Nov 10, 2021
Nov 10, 2021
Nov 10, 2021
Nov 10, 2021
Nov 10, 2021
Nov 10, 2021
Nov 13, 2021
Apr 4, 2022
Nov 13, 2021
Nov 13, 2021
Nov 13, 2021
Nov 10, 2021
Nov 10, 2021

Repository files navigation

Project 1

Development Team

Part 1

Given a input dataset with vectors and a query dataset, for every query vector find:

  • The true nearest neighbor of the input dataset.
  • The aproximate nearest neighbor of the input dataset.
  • The aproximate N nearest neighbors of the input dataset.
  • All vectors inside a given range R. (approximate search)

In order to find the aproximate nearest neighbor(s) we use:

Part 2

Vector Clustering.

The initialization of the clusters is done using kMeans++.

The assignment to each cluster can be performed by each of the following:

  • Lloyds assignment.
  • LSH reverse assignment using Range search.
  • Hypercube reverse assignment using Range search.