GitHub - v-nafiseh/textProcessing: Text processing program which crawls imdb and extracts keywords with TextRank algorithm and crawls Digikala special offers and extracts some feature and shows them on web using Django framework

teamwork with

extracting keywords from storylines
maintaining a weighted graph between movies in which the movies' names are nodes & links are common keywords
saving graph details as csv file

scraping with BeautifulSoup library
using regex for extracting exact details
saving files into json and csv format
using django fixtures for populating database with the data derived from previous steps

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
django_pr/env		django_pr/env
.gitignore		.gitignore
Figure_1.png		Figure_1.png
README.md		README.md
digi.py		digi.py
imdb_final.py		imdb_final.py
product_detail.csv		product_detail.csv

Provide feedback