Skip to content

h1 h2 h3 tags are removed #48

@anthony-foulfoin

Description

@anthony-foulfoin

I don't know if it is on purpose or not, but the h* tags are removed from the parsed articles. For instance: http://www.liberation.fr/france/2017/11/24/chomage-toujours-fluctuant-a-nouveau-a-la-hausse_1612338 All the h2 and h3 tags are removed.

To fix it, I used a custom div2p regexp: this.regexps.div2p(/<(h1|h2|h3|h4|h5|h6)/); but I was wondering if it should not be part of the defaults ?

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions