fix: Optimize Stale Agent with GraphQL and Search API to resolve 429 Quota errors #3700

ryanaiagent · 2025-11-24T22:17:23Z

Description

This PR refactors the adk_stale_agent to address 429 RESOURCE_EXHAUSTED errors encountered during workflow execution. The previous implementation was inefficient in fetching issue history (using pagination over the REST API) and lacked server-side filtering, causing excessive API calls and huge token consumption that breached Gemini API quotas.

The new implementation switches to a GraphQL-first approach, implements server-side filtering via the Search API, adds robust concurrency controls, and significantly improves code maintainability through modular refactoring.

Root Cause of Failure

The previous workflow failed with the following error due to passing too much context to the LLM and processing too many irrelevant issues:

google.genai.errors.ClientError: 429 RESOURCE_EXHAUSTED. 
Quota exceeded for metric: generativelanguage.googleapis.com/generate_content_paid_tier_input_token_count

Key Changes

1. Optimization: REST → GraphQL (`agent.py`)

Old: Fetched issue comments and timeline events using multiple paginated REST API calls (/timeline).
New: Implemented get_issue_state using a single GraphQL query. This fetches comments, userContentEdits, and specific timeline events (Labels, Renames) in one network request.
Refactoring: The complex analysis logic has been decomposed into focused helper functions (_fetch_graphql_data, _build_history_timeline, _replay_history_to_find_state) for better readability and testing.
Configurable: Added GRAPHQL_COMMENT_LIMIT and GRAPHQL_TIMELINE_LIMIT settings to tune context depth
Impact: Drastically reduces the data payload size and eliminates multiple API round-trips, significantly lowering the token count sent to the LLM.

2. Optimization: Server-Side Filtering (`utils.py`)

Old: Fetched all open issues via REST and filtered them in Python memory.
New: Uses the GitHub Search API (get_old_open_issue_numbers) with created:<DATE syntax.
Impact: Only fetches issue numbers that actually meet the age threshold, preventing the agent from wasting cycles and tokens on brand-new issues.

3. Concurrency & Rate Limiting (`main.py` & `settings.py`)

Old: Sequential execution loop.
New: Implemented asyncio.gather with a configurable CONCURRENCY_LIMIT (set to 3).
New: Added urllib3 retry strategies (exponential backoff) in utils.py to handle GitHub API rate limits (HTTP 429) gracefully.

4. Logic Improvements ("Ghost Edits")

New Feature: The agent now detects "Ghost Edits" (where an author updates the issue description without posting a new comment).
Action: If a silent edit is detected on a stale candidate, the agent now alerts maintainers instead of marking it stale, preventing false positives.

File Comparison Summary

File	Change
`main.py`	Switched from `InMemoryRunner` loop to `asyncio` chunked processing. Added execution timing and API usage logging.
`agent.py`	Replaced REST logic with GraphQL query. Added logic to handle silent body edits. Decomposed giant get_issue_state into helper functions with docstrings. Added _format_days helper.
`utils.py`	Added `HTTPAdapter` with Retries. Added `get_old_open_issue_numbers` using Search API.
`settings.py`	Removed `ISSUES_PER_RUN`; added configuration for CONCURRENCY_LIMIT, SLEEP_BETWEEN_CHUNKS, and GraphQL limits.
`PROMPT_INSTRUCTIONS.txt`	Simplified decision tree; removed date calculation responsibility from LLM.

Verification

The new logic minimizes token usage by offloading date calculations to Python and strictly limiting the context passed to the LLM to semantic intent analysis (e.g., "Is this a question?").

Metric Check: The workflow now tracks API calls per issue to ensure we stay within limits.
Safety: Silent edits by users now correctly reset the "Stale" timer.
Maintainability: All complex logic is now isolated in typed helper functions with comprehensive docstrings.

Testing Plan

I have verified these changes on my personal fork by manually triggering the workflow to ensure it handles API rate limits correctly and processes issues without crashing.

Checklist

I have read the CONTRIBUTING.md document.
I have performed a self-review of my own code.
I have commented my code, particularly in hard-to-understand areas.
I have added tests that prove my fix is effective or that my feature works.
New and existing unit tests pass locally with my changes.
I have manually tested my changes end-to-end.
Any dependent changes have been merged and published in downstream modules.

gemini-code-assist · 2025-11-24T22:17:47Z

Summary of Changes

Hello @ryanaiagent, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request addresses critical 429 RESOURCE_EXHAUSTED errors and excessive token consumption in the adk_stale_agent by overhauling its data fetching and processing mechanisms. The core problem stemmed from inefficient REST API usage and client-side filtering, leading to unnecessary API calls and large data payloads. The solution involves a shift to a GraphQL-first approach for comprehensive issue state retrieval, leveraging the GitHub Search API for efficient server-side filtering of issues, and implementing robust concurrency controls and API rate limit handling to ensure the agent operates within quotas and performs efficiently.

Highlights

GraphQL Integration: Refactored issue history fetching in agent.py to use a single GraphQL query, replacing multiple paginated REST API calls for comments, timeline events, and user content edits. This drastically reduces data payload size and API round-trips.
Server-Side Filtering: Implemented the GitHub Search API in utils.py to pre-filter old open issues using created:<DATE syntax, ensuring the agent only fetches and processes issues that meet the age threshold, preventing wasted cycles and tokens.
Concurrency & Rate Limiting: Introduced asyncio.gather with a configurable CONCURRENCY_LIMIT in main.py for parallel issue processing and added urllib3 retry strategies with exponential backoff in utils.py to gracefully handle GitHub API rate limits (HTTP 429).
"Ghost Edit" Detection: Added logic to agent.py and updated PROMPT_INSTRUCTION.txt to detect silent updates to issue descriptions by authors. If a silent edit is detected on a stale candidate, the agent now alerts maintainers instead of marking it stale, preventing false positives.

Ignored Files

Ignored by pattern: .github/workflows/** (1)
- .github/workflows/stale-bot.yml

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

adk-bot · 2025-11-24T22:18:26Z

Response from ADK Triaging Agent

Hello @ryanaiagent, thank you for creating this PR!

This PR is a bug fix. Could you please provide logs or a screenshot after the fix is applied? This information will help reviewers to review your PR more efficiently. Thanks!

adk-bot · 2025-11-24T22:18:32Z

Response from ADK Triaging Agent

Hello @ryanaiagent, thank you for creating this PR!

This PR is a bug fix. Could you please associate a GitHub issue with this PR? If there is no existing issue, could you please create one?

In addition, could you please provide logs or a screenshot after the fix is applied to help reviewers better understand the fix?

This information will help reviewers to review your PR more efficiently. Thanks!

gemini-code-assist

Code Review

This pull request is an excellent and comprehensive refactoring of the stale agent. The switch from inefficient, paginated REST API calls to a single, targeted GraphQL query per issue is a major performance win. Similarly, using the GitHub Search API for server-side filtering of old issues drastically reduces unnecessary processing. The introduction of asyncio for concurrent processing and urllib3 retries for API resilience makes the agent much more robust and efficient. The new logic to detect 'ghost edits' is a clever addition that improves the accuracy of the staleness detection. My review includes a few suggestions for improving maintainability and making some of the new logic more configurable, but overall this is a very strong set of changes.

contributing/samples/adk_stale_agent/agent.py

gemini-code-assist · 2025-11-24T22:19:04Z

contributing/samples/adk_stale_agent/main.py

+  logger.debug(f"#{issue_number}: Initializing runner and session.")
+
+  try:
+    runner = InMemoryRunner(agent=root_agent, app_name=APP_NAME)


The InMemoryRunner is being instantiated inside process_single_issue, which means a new runner object is created for every single issue being processed. While InMemoryRunner is lightweight, it's more efficient to create it once outside the processing loop and reuse it for all issues.

For example:

# In main() runner = InMemoryRunner(agent=root_agent, app_name=APP_NAME) tasks = [process_single_issue(runner, issue_num) for issue_num in chunk] # In process_single_issue() async def process_single_issue(runner: InMemoryRunner, issue_number: int) -> ...: # ... # runner = InMemoryRunner(...) # REMOVE THIS session = await runner.session_service.create_session(...) # ...

contributing/samples/adk_stale_agent/main.py

ryanaiagent · 2025-11-24T22:23:59Z

Hi @xuanyang15 , Can you please review this.

TL;DR

Fixed 429 RESOURCE_EXHAUSTED errors in the Stale Agent workflow.

Problem: The previous agent used inefficient REST API calls and lacked filtering, causing it to breach Gemini API token quotas.
Solution: Refactored to use GraphQL (fetching all context in 1 call) and the GitHub Search API (filtering old issues server-side).
Result: Reduced API calls by ~90%, added "Ghost Edit" detection, and successfully ran on my fork without hitting rate limits.

…tting

…Quota errors Merge #3700 ### Description This PR refactors the `adk_stale_agent` to address `429 RESOURCE_EXHAUSTED` errors encountered during workflow execution. The previous implementation was inefficient in fetching issue history (using pagination over the REST API) and lacked server-side filtering, causing excessive API calls and huge token consumption that breached Gemini API quotas. The new implementation switches to a **GraphQL-first approach**, implements server-side filtering via the Search API, adds robust concurrency controls, and significantly improves code maintainability through modular refactoring. ### Root Cause of Failure The previous workflow failed with the following error due to passing too much context to the LLM and processing too many irrelevant issues: ```text google.genai.errors.ClientError: 429 RESOURCE_EXHAUSTED. Quota exceeded for metric: generativelanguage.googleapis.com/generate_content_paid_tier_input_token_count ``` ### Key Changes #### 1. Optimization: REST → GraphQL (`agent.py`) * **Old:** Fetched issue comments and timeline events using multiple paginated REST API calls (`/timeline`). * **New:** Implemented `get_issue_state` using a single **GraphQL** query. This fetches comments, `userContentEdits`, and specific timeline events (Labels, Renames) in one network request. * **Refactoring:** The complex analysis logic has been decomposed into focused helper functions (_fetch_graphql_data, _build_history_timeline, _replay_history_to_find_state) for better readability and testing. * **Configurable:** Added GRAPHQL_COMMENT_LIMIT and GRAPHQL_TIMELINE_LIMIT settings to tune context depth * **Impact:** Drastically reduces the data payload size and eliminates multiple API round-trips, significantly lowering the token count sent to the LLM. #### 2. Optimization: Server-Side Filtering (`utils.py`) * **Old:** Fetched *all* open issues via REST and filtered them in Python memory. * **New:** Uses the GitHub Search API (`get_old_open_issue_numbers`) with `created:<DATE` syntax. * **Impact:** Only fetches issue numbers that actually meet the age threshold, preventing the agent from wasting cycles and tokens on brand-new issues. #### 3. Concurrency & Rate Limiting (`main.py` & `settings.py`) * **Old:** Sequential execution loop. * **New:** Implemented `asyncio.gather` with a configurable `CONCURRENCY_LIMIT` (set to 3). * **New:** Added `urllib3` retry strategies (exponential backoff) in `utils.py` to handle GitHub API rate limits (HTTP 429) gracefully. #### 4. Logic Improvements ("Ghost Edits") * **New Feature:** The agent now detects "Ghost Edits" (where an author updates the issue description without posting a new comment). * **Action:** If a silent edit is detected on a stale candidate, the agent now alerts maintainers instead of marking it stale, preventing false positives. ### File Comparison Summary | File | Change | | :--- | :--- | | `main.py` | Switched from `InMemoryRunner` loop to `asyncio` chunked processing. Added execution timing and API usage logging. | | `agent.py` | Replaced REST logic with GraphQL query. Added logic to handle silent body edits. Decomposed giant get_issue_state into helper functions with docstrings. Added _format_days helper. | | `utils.py` | Added `HTTPAdapter` with Retries. Added `get_old_open_issue_numbers` using Search API. | | `settings.py` | Removed `ISSUES_PER_RUN`; added configuration for CONCURRENCY_LIMIT, SLEEP_BETWEEN_CHUNKS, and GraphQL limits. | | `PROMPT_INSTRUCTIONS.txt` | Simplified decision tree; removed date calculation responsibility from LLM. | ### Verification The new logic minimizes token usage by offloading date calculations to Python and strictly limiting the context passed to the LLM to semantic intent analysis (e.g., "Is this a question?"). * **Metric Check:** The workflow now tracks API calls per issue to ensure we stay within limits. * **Safety:** Silent edits by users now correctly reset the "Stale" timer. * **Maintainability:** All complex logic is now isolated in typed helper functions with comprehensive docstrings. Co-authored-by: Xuan Yang <xygoogle@google.com> COPYBARA_INTEGRATE_REVIEW=#3700 from ryanaiagent:feat/improve-stale-agent 888064e PiperOrigin-RevId: 838885530

adk-bot · 2025-12-01T20:26:03Z

Thank you @ryanaiagent for your contribution! 🎉

Your changes have been successfully imported and merged via Copybara in commit cb19d07.

Closing this PR as the changes are now in the main branch.

ryanaiagent added 6 commits November 23, 2025 00:50

Update GitHub Action

c7085fb

Update GitHub Action

7583b15

Update GitHub Action

e32c543

Feat:Refactor all files

0a3b572

Feat:add stale agent workflow (testing)

2080b4c

Feat:modify stale agent workflow

bf6249e

adk-bot added the tools [Component] This issue is related to tools label Nov 24, 2025

gemini-code-assist bot reviewed Nov 24, 2025

View reviewed changes

ryanaiagent assigned xuanyang15 Nov 24, 2025

ryanaiagent added 3 commits November 24, 2025 15:38

feat: make GraphQL limits and sleep duration configurable

0e30a65

refactor(agent): reuse runner instance and extract date formatter

b1167dd

refactor(agent): decompose issue analysis logic and standardize forma…

888064e

…tting

ryanaiagent added the needs-review [Status] The PR is awaiting review from the maintainer label Nov 30, 2025

adk-bot closed this Dec 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: Optimize Stale Agent with GraphQL and Search API to resolve 429 Quota errors #3700

fix: Optimize Stale Agent with GraphQL and Search API to resolve 429 Quota errors #3700

Uh oh!

ryanaiagent commented Nov 24, 2025 •

edited

Loading

Uh oh!

gemini-code-assist bot commented Nov 24, 2025

Uh oh!

adk-bot commented Nov 24, 2025

Uh oh!

adk-bot commented Nov 24, 2025

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Nov 24, 2025

Uh oh!

Uh oh!

ryanaiagent commented Nov 24, 2025

Uh oh!

adk-bot commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix: Optimize Stale Agent with GraphQL and Search API to resolve 429 Quota errors #3700

fix: Optimize Stale Agent with GraphQL and Search API to resolve 429 Quota errors #3700

Uh oh!

Conversation

ryanaiagent commented Nov 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Root Cause of Failure

Key Changes

1. Optimization: REST → GraphQL (agent.py)

2. Optimization: Server-Side Filtering (utils.py)

3. Concurrency & Rate Limiting (main.py & settings.py)

4. Logic Improvements ("Ghost Edits")

File Comparison Summary

Verification

Testing Plan

Checklist

Uh oh!

gemini-code-assist bot commented Nov 24, 2025

Summary of Changes

Highlights

Footnotes

Uh oh!

adk-bot commented Nov 24, 2025

Uh oh!

adk-bot commented Nov 24, 2025

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gemini-code-assist bot Nov 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ryanaiagent commented Nov 24, 2025

TL;DR

Uh oh!

adk-bot commented Dec 1, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ryanaiagent commented Nov 24, 2025 •

edited

Loading

1. Optimization: REST → GraphQL (`agent.py`)

2. Optimization: Server-Side Filtering (`utils.py`)

3. Concurrency & Rate Limiting (`main.py` & `settings.py`)