-
Notifications
You must be signed in to change notification settings - Fork 0
Summarization of Text from PDF, URL as a source text; Text and Image Extraction form Web Link/URLs & PDF
License
deepak-mandal/DueDash-Germany
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
(a). From Any Web Link - could be generated summary x percentage (eg. 50%) of the Original web source Text data. & finally created a Word cloud.
(b). From Any PDF file - Generated summary of the .pdf file, and their word cloud
Colab: https://colab.research.google.com/drive/1uWLS3FeO1U9jUCQCjtlQxJMstc28GupJ?usp=sharing
(a). Generated the formated HTML file from source code
(b). Extraced all the Images from the web Link, and Downloaded into a folder automatically
(c). Extracted Various text data such as paragraph tags, anchor tags, header tags, Further saved all data in a file
Further Extracted Text and Image Data from the PDF file format.
Colab: https://colab.research.google.com/drive/1lyBAsNTcgpi0-7yh7Mycbx2qvCEhQKAD?usp=sharing
About
Summarization of Text from PDF, URL as a source text; Text and Image Extraction form Web Link/URLs & PDF
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published