GitHub - KushagraChauhan/Auto-Tagger

Auto-Tagger

This repo will help to get the tags from a file. Install all the required modules by-

	pip3 install -r requirements.txt

How to run the file-

	python3 main_app.py

After that open:

	http://localhost:4500/file-upload

and upload the files.

3 main files- --app.py --main.py --main_app.py And their functions-

Download the file using URL and give the response
Work on the server as a system API and store the tags in a DB
Work in-sync with KM, and give real-time response

Keyword extraction technique used-

RAKE-

Rapid Automatic Keyword Extraction (RAKE) is a well-known keyword extraction method which uses a list of stopwords and phrase delimiters to detect the most relevant words or phrases in a piece of text.

Parser used to parse the text from a pdf or a docx-

TIKA-

Tika works on .pdf, the most recent OOXML Microsoft Office file types and older binary file formats such as .doc, .ppt and .xls.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
app.py		app.py
main.py		main.py
main_app.py		main_app.py
requirements.txt		requirements.txt
stop-list.txt		stop-list.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Auto-Tagger

RAKE-

TIKA-

About

Releases

Packages

Languages

KushagraChauhan/Auto-Tagger

Folders and files

Latest commit

History

Repository files navigation

Auto-Tagger

RAKE-

TIKA-

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages