Add Unicode and escape sequence support by FrancoisLaferriere · Pull Request #624 · potassco/clingo

FrancoisLaferriere · 2026-04-10T10:29:25Z

Refs #123

Changes

Added support for \uXXXX, \t, and \r escape sequences in strings.

Lexer: Added \t, \r, and \uXXXX patterns to STRING and FLIT (f-strings)
unquote(): Added handling for \t, \r, and \uXXXX escapes
removed unused quote()
PrintQuoted: Added \r escape handling
Tests: Added tests for all new escape sequences

Questions

Should \uXXXX be converted to UTF-8 bytes in the internal representation (current behavior), or should it be preserved and printed back as \uXXXX? Currently:
- "caf\u00E9" → internal UTF-8 → output "café"
Currently, escape sequences in f-string literals are NOT processed, they pass through literally: f"\n" outputs "\\n"
Should we process escape sequences in f-string literals via unquote(), or keep the current behavior where they are passed through literally?

- Updated lexer to recognize \uXXXX, \t, and \r escape sequences in STRING and FLIT (f-strings) - Updated unquote() to parse \uXXXX and output UTF-8 - Updated PrintQuoted to output \r escape sequence - Removed unused quote() function - Added tests for escape sequences in strings

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Unicode and escape sequence support#624

Add Unicode and escape sequence support#624
FrancoisLaferriere wants to merge 1 commit intopotassco:wip-20from
FrancoisLaferriere:string-escapes

FrancoisLaferriere commented Apr 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

FrancoisLaferriere commented Apr 10, 2026

Refs #123

Changes

Questions

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant