feat: Wrappers for Consistent Response Formatting in GEM #121

N00bcak · 2025-12-30T08:08:08Z

tl;dr

Add a EncapsulateWrapper to detect and enforce common response signposts used by LLMs (like \boxed{} and <answer></answer>)

Motivation

Currently GEM as a library appears somewhat undecided about how to extract unambiguous answers from LLMs. QAEnv uses <answer> HTML tags while the TextArena environments appear to use \boxed{} TeX tags.

Since they are currently baked into the environment, the environments themselves can become more difficult to use if a different tagging convention is desired (either due to language model constraints/quirks, or experimental consistency).

Taking into account this potential diversity of tagging conventions, this PR hopes to offer a potential direction forward for decoupling action extraction from action handling within GEM environments, to make sure environments can be useful to a wider audience and to reduce implementation complexity (because now environments no longer need to assume a certain kind of tagging format)

Word of Caution

Despite this PR pointing out the inconsistencies in GEM's response tagging and extraction, no real changes have yet been made to existing/in-development environments, their prompts, or their action parsing procedures.

These are breaking changes which may affect experimental results (due to differences in prompting), and I am currently looking for ways to resolve this with minimal impact.

N00bcak added 5 commits December 30, 2025 15:13

Minor changes to last_tagged_answer to capture regex better

54929b6

first wrapper prototype

14641a5

cosmetic changes

526188f

chore: remove stray comments

c3768b9

chore: make format

803f1ca

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Wrappers for Consistent Response Formatting in GEM #121

feat: Wrappers for Consistent Response Formatting in GEM #121

N00bcak commented Dec 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

feat: Wrappers for Consistent Response Formatting in GEM #121

Are you sure you want to change the base?

feat: Wrappers for Consistent Response Formatting in GEM #121

Conversation

N00bcak commented Dec 30, 2025

tl;dr

Motivation

Word of Caution

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant