You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, Docling recognizes those as the main text usually. Please see attached.
Such manuscript templates are quite popular within the IEEE organization and contain important meta-information about the authors.
Would it be possible to retrain/fine-tune the model so that Docling would recognize those as footnotes? Thank you!
Alternatives
I tried to use a simple rule-based approach to check the font size, but unfortunately docling doesn't extract the font size. And it probably wouldn't be that reliable anyway.
The text was updated successfully, but these errors were encountered:
@q0oz Yes, we plan to retrain the layout segmentation. Thank you for providing this sample, it was not on our radar and we definitely need to fix this!
Requested feature
...
Hello,
It looks like Docling doesn't recognize footnotes properly for some IEEE manuscripts. For example,
https://arxiv.org/pdf/2503.08661
https://arxiv.org/pdf/2503.08027
https://arxiv.org/pdf/2503.08609
Currently, Docling recognizes those as the main text usually. Please see attached.
Such manuscript templates are quite popular within the IEEE organization and contain important meta-information about the authors.
Would it be possible to retrain/fine-tune the model so that Docling would recognize those as footnotes? Thank you!
Alternatives
I tried to use a simple rule-based approach to check the font size, but unfortunately docling doesn't extract the font size. And it probably wouldn't be that reliable anyway.
The text was updated successfully, but these errors were encountered: