Add MVP of one-shot implementation #378

ppinchuk · 2026-02-10T20:23:05Z

The core fundamentals for using one-shot extraction are in place.

Still TODO (future PR):

LLM-generated website keywords
LLM-generated keyword heuristic
Text extractor based on extraction schema

Copilot

Copilot encountered an error and was unable to review this pull request. You can try again by re-requesting a review.

Copilot

Pull request overview

Copilot reviewed 39 out of 41 changed files in this pull request and generated 6 comments.

Comments suppressed due to low confidence (1)

pyproject.toml:50

compass.utilities.io now imports yaml and toml at runtime, but neither PyYAML nor a toml package is listed in the project dependencies. This will raise ImportError in environments where they aren't installed transitively. Add the needed dependencies (or switch TOML loading to the stdlib tomllib for Python 3.12+), and consider gating YAML/TOML support behind optional extras if you don't want them in the core install set.

Copilot · 2026-02-10T20:51:45Z

compass/plugin/one_shot/base.py

+    logger.debug("Loading query templates from cache at %s", cache_fp)
+    cache = json.loads(cache_fp.read_text(encoding="utf-8"))
+    if identifier.casefold() not in qt:
+        logger.debug(
+            "Adding query templates for %r to cache at %s",
+            identifier,
+            cache_fp,
+        )
+        cache[identifier.casefold()] = {
+            "templates": qt,
+            "sha256": hashlib.sha256(str(schema).encode()).hexdigest(),
+        }
+        cache_fp.write_text(json.dumps(cache, indent=4), encoding="utf-8")
+        return
+
+    potential_qt = qt[identifier.casefold()]
+    m = hashlib.sha256()


_qt_to_cache is using the qt list (query templates) where it should be using the cache dict loaded from disk. As written, if identifier.casefold() not in qt: will almost always be true, and potential_qt = qt[identifier.casefold()] will then crash because qt is a list, not a dict. Use cache for membership/indexing and only use qt as the templates payload being stored.

Copilot · 2026-02-10T20:51:45Z

compass/_cli/process.py

+    multiple=True,
+    help="One-shot plugin configuration to add to COMPASS before processing",
+)
+def process(config, verbose, no_progress, plugin):
    """Download and extract ordinances for a list of jurisdictions"""
    config = load_config(config)

+    for one_shot_plugin_config in plugin:
+        create_schema_based_one_shot_extraction_plugin(
+            config=one_shot_plugin_config, tech=config["tech"]


The CLI option --plugin/-p is declared as multiple=True, but each provided plugin config is registered with the same identifier tech=config["tech"]. If more than one --plugin is provided, register_plugin will raise because identifiers must be unique. Either make the option non-multiple, or allow each plugin config to supply its own unique identifier and pass that through here.

Suggested change

multiple=True,

help="One-shot plugin configuration to add to COMPASS before processing",

)

def process(config, verbose, no_progress, plugin):

"""Download and extract ordinances for a list of jurisdictions"""

config = load_config(config)

for one_shot_plugin_config in plugin:

create_schema_based_one_shot_extraction_plugin(

config=one_shot_plugin_config, tech=config["tech"]

help="One-shot plugin configuration to add to COMPASS before processing",

)

)

def process(config, verbose, no_progress, plugin):

"""Download and extract ordinances for a list of jurisdictions"""

config = load_config(config)

if plugin is not None:

create_schema_based_one_shot_extraction_plugin(

config=plugin, tech=config["tech"]

Copilot · 2026-02-10T20:51:45Z

compass/plugin/one_shot/components.py

+requirements. Keep the response concise and consistent.\
+"""
+_TEXT_COLLECTION_MAIN_PROMPT = """\
+Determine wether this text excerpt contains any information relevant to \


Spelling: _TEXT_COLLECTION_MAIN_PROMPT says "Determine wether..."; this should be "whether" to avoid prompting errors and to keep docs/prompt text professional.

Suggested change

Determine wether this text excerpt contains any information relevant to \

Determine whether this text excerpt contains any information relevant to \

Copilot · 2026-02-10T20:51:46Z

compass/utilities/io.py

+    config_type = ConfigType(config_filepath.name.split(".")[-1])
+    config = config_type.load(config_filepath)
+    if resolve_paths:
+        return resolve_all_paths(config, config_filepath.parent)
+
+    return config


load_config calls ConfigType(...) directly, so an unknown extension (e.g. .txt) will raise a ValueError from Enum construction rather than the documented COMPASSValueError (and the unit test expects COMPASSValueError). Consider catching the Enum error and raising COMPASSValueError with a clear message listing supported extensions.

compass/utilities/io.py

ppinchuk added 28 commits February 9, 2026 11:01

Rename class

2a2cf74

Add new LLM task and usage category

a6cd95d

Update base plugin abstract methods

0d7ddb0

Update plugin interface to implement new abstract methods

15de4f1

Base text collector has new base class

6005b3e

Add missing import

5f30992

Break out validation methods for easier subclassing

cdfdec9

Collected text now registered as a cleaned output file

026dbe3

Implement new base interface

e02900b

Update plugin implementations

1b237aa

Move file

6128720

Add SchemaOutputLLMCaller class

d8baf99

Use new base extractor functions

b43ba2b

Add NoOp implementations for extraction pieces

e1b7d9b

Add load config from GAPs to utilities IO module

0320751

Add CaseInsensitiveEnum

265d5b9

Remove unused function

9c99fb1

Use new loading function

e4aaf4e

question -> query

c0aafc4

Use new load function

54bdd0d

Fix imports

636e493

Add one-shot plugin components

d5a2c5e

Add generate_query_templates

18e4184

Add MVP of one-shot plugin implementation

975e863

Populate namespace

c1588e9

Add output schemas

015f191

Add dependency

f53c39a

Allow users to add plugin configs when running the CLI

bde6ea2

ppinchuk added this to the Infrastructure and accuracy improvements milestone Feb 10, 2026

ppinchuk self-assigned this Feb 10, 2026

ppinchuk requested a review from castelao as a code owner February 10, 2026 20:23

Copilot AI review requested due to automatic review settings February 10, 2026 20:23

ppinchuk added enhancement Update to logic or general code improvements new computation Update that adds a new computation method p-critical Priority: critical topic-python-llm Issues/pull requests related to LLMs topic-python-general Issues/pull requests related to python labels Feb 10, 2026

Copilot AI reviewed Feb 10, 2026

View reviewed changes

ppinchuk requested a review from Copilot February 10, 2026 20:39

Copilot started reviewing on behalf of ppinchuk February 10, 2026 20:43 View session

Copilot started reviewing on behalf of ppinchuk February 10, 2026 20:49 View session

Copilot AI reviewed Feb 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MVP of one-shot implementation #378

Add MVP of one-shot implementation #378

ppinchuk commented Feb 10, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 10, 2026

Uh oh!

Copilot AI Feb 10, 2026

Uh oh!

Copilot AI Feb 10, 2026

Uh oh!

Copilot AI Feb 10, 2026

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	Determine wether this text excerpt contains any information relevant to \
	Determine whether this text excerpt contains any information relevant to \

Add MVP of one-shot implementation #378

Are you sure you want to change the base?

Add MVP of one-shot implementation #378

Conversation

ppinchuk commented Feb 10, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 10, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant