Skip to content

Antiplagiarism algorithm that finds similarities between documents.

Notifications You must be signed in to change notification settings

ivaste/pattern_matching

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Contributors Forks Stargazers Issues LinkedIn


Logo

Documents Pattern Matching

An awesome script for matching words of one document to other documents in order to detect plagiarism!
Live Demo · Report Bug · Request Feature

Table of Contents
  1. About The Project
  2. Getting Started
  3. Contributing
  4. Contact
  5. Todo

About The Project

Product Name Screen Shot

Find similarities on a .txt file, given several other .txt files, to detect plagiarism.

Built With

Getting Started

Installation and Usage

  1. Clone the repo

    git clone https://github.com/ivaste/pattern_matching.git
  2. Convert your .pdf files in .txt with https://pdftotext.com/ (In future releases this step will be automated)

  3. Open the index.html file with your browser

  4. Drag&Drop your reference .txt files in the Reference File Box

  5. Drag&Drop the .txt file in the File to Check Box

  6. Click on the Find Similarities button

Contributing

Contributions are what make the open source community such an amazing place to be learn, inspire, and create. Any contributions you make are greatly appreciated.

  1. Fork the Project
  2. Create your Feature Branch (git checkout -b feature/AmazingFeature)
  3. Commit your Changes (git commit -m 'Add some AmazingFeature')
  4. Push to the Branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

Contact

Stefano Ivancich - stefano-ivancich

Project Link: https://github.com/ivaste/pattern_matching

Todo

  • Multiple pattern matching algorithm with Trie
  • Check if the user uploaded the right file type (.txt)
  • Convert automatically from PDF (or other) to txt. link1
  • Disable Button when click on it
  • Progress bar when calculating. link1 link2-promise
  • Remove non ascii chars
  • Better pattern matching algorithm (aho corasick)
  • Let user choose window_size
  • Multiple matching with different windows sizes and then combine the results
  • Re-Do everithing with react

About

Antiplagiarism algorithm that finds similarities between documents.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published