Original code
Pre-release
Pre-release
madelonhulsebos
released this
09 Feb 11:57
·
106 commits
to master
since this release
This release reflects the code that was used for the experiments in the paper "Sherlock: a deep learning approach to semantic data type detection" (link to the paper on arXiv). This release provides code for:
- Download of the original train and test data used for the experiment results as reported in the paper.
- Feature extraction to numerically represent new columns.
- Evaluating a trained Sherlock model on unseen table columns.
- Retraining the original Sherlock model.
This release consists inefficiencies and bugs, hence it is recommended to use the latest release of this project in production settings or new research projects. More about this project can be found on this website.