Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support <strong> tag in HTML #1138

Open
remod opened this issue Mar 9, 2025 · 2 comments
Open

Support <strong> tag in HTML #1138

remod opened this issue Mar 9, 2025 · 2 comments
Assignees
Labels
enhancement New feature or request

Comments

@remod
Copy link

remod commented Mar 9, 2025

Requested feature

Right now, if an HTML page contains a pseudo-header which is wrapped with a <strong> tag, docling skips it. An example page which contains such a tag can be found here.

I think that ideally it would be including it as bold text.

Alternatives

...

@remod remod added the enhancement New feature or request label Mar 9, 2025
@ceberam
Copy link
Contributor

ceberam commented Mar 10, 2025

Thanks @remod to submit this issue.
Formatted text in HTML is indeed skipped unless it is part of a paragraph or another supported tag.
This will be addressed soon together with other formatting styles, once the data schema in docling-core supports it. There is a draft in progress (docling-project/docling-core#182)

@ceberam ceberam self-assigned this Mar 10, 2025
@remod
Copy link
Author

remod commented Mar 10, 2025

@ceberam Thanks for circling back, that's great to hear! And thank you for the great tool!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants