Releases: VikParuchuri/marker
Releases · VikParuchuri/marker
Faster OCR
- OCR is now ~2.5x faster, due to improvements in surya
Speed up inference
- (from surya) faster ocr, line detection, layout inference
- Unpin transformers version after testing
Should be significantly faster now, but haven't fully benchmarked, since I'm running low on time this week!
Fix memory leak
- Fix a memory leak (fixed in surya, bumped the version). This caused high CPU memory usage on long docs.
- Improve load_all_models to take device and dtype
Marker v2
Basically a full rewrite!
Main features:
- Extracts and saves images
- Improved table formatting
- Better markdown wrapping
- Better reading order on complex docs
- Improved OCR engine with more language options
- Simple pip package install (no more required system dependencies), so can be used easily on Windows
- Can be used commercially (pymupdf and layoutlmv3 dependencies removed)
It takes ~2x as long to run now, but seems like a decent tradeoff.
See the README for details.