Please refer to the feishu for better reading experience.
Using requests
to using API.
pdfdeal is a wrapper package for the Doc2X API
- Convert a PDF file to docx
- Convert all PDF files in a folder to docx
- Convert all PDF files in a folder to docx and markdown
- Convert HTML tables in text to Markdown format, note that since Markdown tables do not support merged cells, data misalignment may occur when there are merged cells (especially vertically merged cells).
- Convert HTML tables in text to Markdown format (using beautifulsoup4), note that since Markdown tables do not support merged cells, data misalignment may occur when there are merged cells (especially vertically merged cells).
- Convert HTML tables in text to Markdown format, note that since Markdown tables do not support merged cells, data misalignment may occur when there are merged cells (especially vertically merged cells).