Skip to content

stg7/cppsimhash

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

cppsimhash

c++ simhash implementation for documents and an additional (prototyp) simhash index for text documents

Required

  • python3
  • scons
  • g++ (c++14)
  • cpu with hardware aes, cat /proc/cpuinfo | grep "aes" | wc -l should be > 0

Build Steps

Just run scons

Simidx -- usage

add a text document using simidx.py:

# add one document
./simidx.py add textfile

# add a folder
./simidx.py add textfolder

# after you created an index you can query it with
./simidx.py query <document.txt>

Idea

For the approach and core idea have a look at papers in doc.

About

c++ simhash implementation for documents

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published