This repository includes the code associated with my Bachelor Thesis called "Analysis of Contract Lifecycles Based on Multiclass Classification Using Natural Language Processing and Machine Learning". In this NLP project, real-world contract documents are classified (into, e.g., attachment, amendment, agreement) employing several feature extraction & selection, dimensionality reduction and machine / deep learning techniques. Finally, the results of the different classification learning algorithms are used in a semi-supervised fashion (self-training) to label the unlabeled documents. Based on the classification, the contract lifecycles consisting of different document types are visualized using a Sankey diagram.
-
Notifications
You must be signed in to change notification settings - Fork 1
timbuendert/BachelorThesis
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Title of thesis: "Analysis of Contract Lifecycles Based on Multiclass Classification Using Natural Language Processing and Machine Learning"
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published