[WIP] Implement support to BatchAPIs to gather evidence #687

maykcaldas · 2024-11-14T22:37:01Z

Due to the parallel nature of gathering evidence and summarizing all candidate papers, we plan to use the batch API when possible.

Task list

Create a class to make batch calls to openai
Create a class to make batch calls to anthropic
Integrate the openai class to the get_evidence method
Integrate the anthropic class to the get_evidence method
Update get_summary_llm to decide which provider to use given the llm in the config

This class is used to submit batch calls to the OpenAI batch API

jamesbraza · 2024-11-15T03:52:32Z

paperqa/llms.py

+        data: list[dict[str,str]],
+        callbacks: list[Callable] | None = None,
+        name: str | None = None,
+        skip_system: bool = False,


I refactored out skip_system in #680, can you propagate that change to here?

paperqa/llms.py

jamesbraza · 2024-11-15T03:53:37Z

paperqa/settings.py

@@ -609,6 +618,10 @@ class Settings(BaseSettings):
            " router_kwargs key with router kwargs as values."
        ),
    )
+    use_batch_in_summary: bool = Field(
+        default=False,
+        description="Whether to use batch API for LLMs in summarization",


Can you add a few words on how the batches are actually formed

Perhaps you can say something like:

Whether to use batch API for LLMs in summarization, which means multiple messages are sent in one API request.

It was updated to:

"Whether to use batch API for LLMs in summarization, " "which means multiple messages are sent in one API request " "to the LLM provider's batch API." "This option is only available for Claude(https://docs.anthropic.com/en/api/creating-message-batches)" "and OpenAI (https://platform.openai.com/docs/guides/batch) chat models."

paperqa/docs.py

mskarlin · 2024-11-15T16:02:38Z

paperqa/llms.py

+            }
+        )
+
+        while batch.status != "completed":


we probably want "completed" and "failed" to be OpenAI enums here rather than free strings.

mskarlin · 2024-11-15T16:04:51Z

paperqa/llms.py

+            batch = client.batches.retrieve(batch.id)
+            if batch.status == "failed":
+                raise Exception("Batch failed. \n\nReason: \n" + "\n".join([k.message for k in batch.errors.data]))
+            await asyncio.sleep(5)


Let's parameterize this waiting, and maybe make the default longer? like 30 second or 1 min polling?

We should probably add some debug/info logs here to track progress along with maybe a max-timeout which users can set.

Mayk Caldas added 2 commits November 14, 2024 14:26

Implements OpenAIBatchLLMModel

2385dce

This class is used to submit batch calls to the OpenAI batch API

Incorporates OpenAIBatchLLMModel to get_evidence

8a21055

maykcaldas self-assigned this Nov 14, 2024

jamesbraza reviewed Nov 15, 2024

View reviewed changes

mskarlin reviewed Nov 15, 2024

View reviewed changes

paperqa/docs.py Show resolved Hide resolved

mskarlin reviewed Nov 15, 2024

View reviewed changes

Merge branch 'main' into batch_api

5f59681

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Implement support to BatchAPIs to gather evidence #687

[WIP] Implement support to BatchAPIs to gather evidence #687

maykcaldas commented Nov 14, 2024 •

edited

Loading

jamesbraza Nov 15, 2024

jamesbraza Nov 15, 2024

jamesbraza Nov 15, 2024

maykcaldas Nov 15, 2024

mskarlin Nov 15, 2024

mskarlin Nov 15, 2024

[WIP] Implement support to BatchAPIs to gather evidence #687

Are you sure you want to change the base?

[WIP] Implement support to BatchAPIs to gather evidence #687

Conversation

maykcaldas commented Nov 14, 2024 • edited Loading

jamesbraza Nov 15, 2024

Choose a reason for hiding this comment

jamesbraza Nov 15, 2024

Choose a reason for hiding this comment

jamesbraza Nov 15, 2024

Choose a reason for hiding this comment

maykcaldas Nov 15, 2024

Choose a reason for hiding this comment

mskarlin Nov 15, 2024

Choose a reason for hiding this comment

mskarlin Nov 15, 2024

Choose a reason for hiding this comment

maykcaldas commented Nov 14, 2024 •

edited

Loading