[Service] Non article like documents are not properly extracted. #2

melphi · 2017-06-29T12:12:00Z

For example this url is not scraped as it does not look like an article.
http://stopfake.org/en/news

Evaluate the Python library newspaper to parse remaining articles https://github.com/codelucas/newspaper

melphi changed the title ~~[Service] Non article like documents are not extracted.~~ [Service] Non article like documents are not properly extracted. Jun 29, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Service] Non article like documents are not properly extracted. #2

[Service] Non article like documents are not properly extracted. #2

melphi commented Jun 29, 2017 •

edited

Loading

[Service] Non article like documents are not properly extracted. #2

[Service] Non article like documents are not properly extracted. #2

Comments

melphi commented Jun 29, 2017 • edited Loading

melphi commented Jun 29, 2017 •

edited

Loading