DBSCAN-Cluster

DBSCAN implementation in C. Uses a quadtree datastructure to handle very large, sparse, binary feature spaces. Implements Jaccard distance as the default distance metric (neighbours.c).

Building and running the tests

Use cmake . to generate the build files.
Run make to build the library, applications and Python SWIG wrapper (if applicable)
Run make test to run the tests.

Using the Python wrapper

See test.py for an example.

Create the quadtree

  import pydbscan
  tree = pydbscan.create_quadtree(8, 8)

Its recommended to have height and width as the same power of two.

Insert points into the quadtree

  pydbscan.quadtree_insert(tree, 0, 1) # Sets a 1 for x = 0, y = 1

This function returns 0 if the insert did not succeed (e.g. the point was out of range or was already set).
For most typical applications, the document is the first argument, the label is the second.
It's recommended to count documents from zero.

Cluster

  pydbscan.pyDBSCAN(tree, 6, 0.67, 2)

The first argument is the number of documents input into the quadtree.
The second is the epsilon value (points are considered part of another's neighbourhood if their distance is less than 1 - epsilon).
The third argument is the minimum number of points needed to make a cluster.

Using the C API

quadtree_init allocates and initialises the quadtree (arguments must be one less than a powers of two).
quadtree_insert adds a document-label pair in the quadtree, and returns a non-zero value if the insert succeeded.
DBSCAN takes a quadtree as a reference, an array of unsigned integers of length of the size of the documents, the number of documents, the epsilon value,the minimum points, and a pointer to a neighbourhood distance function ** neighbours_search implements neighborhood filtering via the Jaccard index.

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
bitvec.c		bitvec.c
bitvec.h		bitvec.h
cluster.c		cluster.c
cluster.py		cluster.py
cluster.sqlite		cluster.sqlite
cluster_test.sqlite		cluster_test.sqlite
date_cluster.c		date_cluster.c
dbscan.c		dbscan.c
dbscan.h		dbscan.h
neighbours.c		neighbours.c
neighbours_naive.c		neighbours_naive.c
pydbscan.c		pydbscan.c
pydbscan.i		pydbscan.i
quadtree.c		quadtree.c
quadtree.h		quadtree.h
record_clusters.py		record_clusters.py
stack.c		stack.c
stack.h		stack.h
test.py		test.py
test_bitvec_alloc.c		test_bitvec_alloc.c
test_bitvec_check.c		test_bitvec_check.c
test_bitvec_clear.c		test_bitvec_clear.c
test_bitvec_set.c		test_bitvec_set.c
test_bitvec_set_extend.c		test_bitvec_set_extend.c
test_bitvec_set_extend_clear.c		test_bitvec_set_extend_clear.c
test_dbscan_1.c		test_dbscan_1.c
test_dbscan_2.c		test_dbscan_2.c
test_neighbour_distance.c		test_neighbour_distance.c
test_quadtree_init.c		test_quadtree_init.c
test_quadtree_insert_2.c		test_quadtree_insert_2.c
test_quadtree_insert_query_rand.c		test_quadtree_insert_query_rand.c
test_quadtree_insert_simple.c		test_quadtree_insert_simple.c
test_quadtree_integrity.c		test_quadtree_integrity.c
test_quadtree_node_contains.c		test_quadtree_node_contains.c
test_quadtree_query.c		test_quadtree_query.c
test_quadtree_scan_x.c		test_quadtree_scan_x.c
test_quadtree_scan_x_sorted.c		test_quadtree_scan_x_sorted.c
test_quadtree_scan_y.c		test_quadtree_scan_y.c
test_quadtree_scan_y_sorted.c		test_quadtree_scan_y_sorted.c
test_quadtree_subdivide_1.c		test_quadtree_subdivide_1.c
test_quadtree_subdivide_2.c		test_quadtree_subdivide_2.c
test_stack.c		test_stack.c

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DBSCAN-Cluster

Building and running the tests

Using the Python wrapper

Create the quadtree

Insert points into the quadtree

Cluster

Using the C API

About

Releases

Packages

Languages

License

Sentimentron/DBSCAN-Cluster

Folders and files

Latest commit

History

Repository files navigation

DBSCAN-Cluster

Building and running the tests

Using the Python wrapper

Create the quadtree

Insert points into the quadtree

Cluster

Using the C API

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages