Skip to content

Releases: neuml/paperetl

v2.3.0

28 Dec 19:31
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Use figure index rather than xml:id attribute this is not always present (#46, #51)
  • Added note on grobid concurrency configuration to README (#50, #52)
  • Can't insert all my data into sqlite database (#56)
  • Close processes at end of Execute.run method (#57)
  • Require Python >= 3.9 (#58)

v2.2.1

18 Sep 21:16
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Update setup.py to only show standard image on PyPI (#48)

v2.2.0

18 Sep 20:30
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Add example notebook (#43)
  • Update CORD-19 scripts (#44)
  • Update minimum Python version to 3.8 (#47)

v2.1.0

20 Jan 14:24
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Issue processing into Elasticsearch (#41)
  • Improve PMB filtering logic (#42)

v2.0.0

12 Mar 01:05
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Add PubMed as source (#16)
  • Add arXiv as source (#17)
  • Detect month changes in CORD-19 entry date process (#33)
  • Remove study attribute and design models and all related dependencies (#34)
  • Add pre-commit checks (#35)
  • Remove legacy merge logic (#36)
  • Add database flag to determine if database should be replaced (#37)
  • Add multiprocessing support to files process (#38)
  • Support reading compressed files (#39)
  • Require Python 3.7+ (#40)

v1.6.0

23 Apr 12:12
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Improve sample size extraction (#29)
  • Add generic CSV source (#30)
  • Add common method for accessing Grammar object (#31)
  • Update CORD-19 entry dates source (#32)
  • Limit docker and setup.py to spaCy 2.x until attribute/design models rebuilt

v1.5.0

21 Jan 18:43
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Add dockerfile for building paperetl environment (#9)
  • Add component to build entry-dates.csv (#18)
  • Add pre-trained study design models to GitHub (#19)
  • Update README to correct and improve documentation (#20)
  • Ensure length of sections is less than max nlp length (#27)

v1.4.0

13 Jan 14:52
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Handle PDF parsing exceptions (#22)
  • Increase test coverage (#23)
  • Modify merge method to handle no update merges (#24)
  • Fix bug with JSON export (#25)
  • Fix bug with study model training (#26)

v1.3.0

06 Nov 14:51
Compare
Choose a tag to compare

This release adds the following enhancements and bug fixes:

  • Add file name as source for file process (#12)
  • Use XML id for file figure processing (#13)
  • Filter duplicate ids (#14)
  • Build test suite (#15)

v1.2.0

11 Oct 18:23
Compare
Choose a tag to compare

This release adds the following enhancements:

  • Support recursive directory processing (#7)
  • Improve publication date parsing (#8)
  • Added incremental database updates (#10)
  • Remove citations (#11)