Skip to content

GMMDMDIDEMS/abbreviation-extractor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

abbreviation-extractor

The abbreviation-extractor is a tool designed to identify and extract abbreviations from PDF documents. This Rust implementation is inspired by the Schwartz-Hearst1 algorithm and is intended to be useful for researchers, scholars and people dealing with academic PDF content.

References

Footnotes

  1. Schwartz, Ariel & Hearst, Marti. (2003). A Simple Algorithm For Identifying Abbreviation Definitions in Biomedical Text. Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing. 4. 451-62. 10.1142/9789812776303_0042.

Releases

No releases published

Packages

No packages published