Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
python
machine-learning
information-retrieval
clustering
tika
cosine-similarity
jaccard-similarity
cosine-distance
similarity-score
tika-similarity
metadata-features
tika-python
-
Updated
Mar 26, 2024 - Python