Skip to content

Word splitting in non-Latin text, and over ligatures #3

@PhilterPaper

Description

@PhilterPaper

The PDF::Builder package can typeset using HarfBuzz::Shaper to substitute a font's ligatures for sequences of lowercase letters. It does not currently natively call Text::KnuthPlass, but I plan to add this in the near future. Some potential problems arise when Harfbuzz::Shaper is used, and decides it wants to substitute some ligatures. This will mean that Text::KnuthPlass will have to accept not just plain text, but also the HarfBuzz arrays of processed glyphs, which could include ligatures. How this will interact with word-splitting (patterns and exceptions assuming no ligatures) remains to be seen. We also need to think about word-splitting with connected cursive scripts such as Arabic, and highly processed complex scripts such as Devanagari or Khmer, not to mention bi-directional (RTL) scripts, and mixtures of different types.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions