Skip to content

Releases: gambolputty/newscorpus

2.0.3

28 Dec 18:31
Compare
Choose a tag to compare

2.0.1

27 Dec 15:02
Compare
Choose a tag to compare

Changed

  • Provide example sources.example.json file

Added

  • Add database method iter_articles to iterate over all articles in the database

Removed

  • Remove default sources.json file

v2.0.0

26 Dec 23:02
Compare
Choose a tag to compare

Changed

  • Remove Docker setup and use Poetry for dependencies
  • Replace MongoDB with SQLite

Added

  • Optional CLI arguments

v1.2.0

12 Feb 22:40
Compare
Choose a tag to compare

Added

  • Shell script that executes the crawl command

Changed

  • Directory structure
  • Simplified crawl command: docker-compose run --rm crawler ./crawl.sh

v1.1.0

11 Feb 00:04
8669bc0
Compare
Choose a tag to compare

Added

  • Three new config variables (MONGO_OUTSIDE_PORT, MAX_WORKERS, KEEP_DAYS). See readme for details.
  • Network name for this project (network name)
  • Changelog file

Changed

  • Config variables are now uppercase
  • Readme

v1.0

09 Feb 10:04
Compare
Choose a tag to compare

The initial release.