Added missing deps for linux grounding actions. #157

wehos · 2025-11-29T12:29:27Z

Agent S3 seems to miss importing deps for some linux platform:

Agent-S/gui_agents/s3/agents/grounding.py

Lines 396 to 399 in f3ed515

    
           if self.platform == "linux": 
        
               return f"import pyautogui; pyautogui.hotkey('win'); time.sleep(0.5); pyautogui.write({repr(app_or_filename)}); time.sleep(1.0); pyautogui.hotkey('enter'); time.sleep(0.5)" 
        
           elif self.platform == "darwin": 
        
               return f"import pyautogui; import time; pyautogui.hotkey('command', 'space', interval=0.5); pyautogui.typewrite({repr(app_or_filename)}); pyautogui.press('enter'); time.sleep(1.0)"

For Darwin it was correctly imported, but for Linux it was not.

Summary by CodeRabbit

Bug Fixes
- Fixed time-based delays in Linux window management scripts so scripted pauses execute reliably.
Improvements
- Improved on-screen action labels and positioning so mouse actions (click, move, drag) display accurate coordinates and align correctly with targets.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

coderabbitai · 2025-11-29T12:29:35Z

Walkthrough

Added an explicit import time to the UBUNTU_APP_SETUP script generation in the grounding module; adjusted text-placement in the behavior narrator to accept explicit x,y coordinates and updated calls to place labels for mouse actions accordingly.

Changes

Cohort / File(s)	Summary
Time module import addition `gui_agents/s3/agents/grounding.py`	Updated `UBUNTU_APP_SETUP` constant to include `import time`; Linux branch of `open()` now generates scripts that explicitly import `time` so `time.sleep()` calls work in the scripted window management.
Coordinate-based text placement `gui_agents/s3/bbon/behavior_narrator.py`	Introduced `x, y` parameters to `place_text(label, color, x, y)` and replaced prior width/height-based placements with explicit x/y; updated all calls (Click, MoveTo, DragTo, etc.) to pass coordinates and adjusted drawing offsets accordingly.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Pay attention to the generated UBUNTU_APP_SETUP script string and the Linux open() path to ensure no quoting/escaping issues when inserting the import time.
Review place_text signature and all updated call sites in behavior_narrator.py for correct coordinate mapping and potential off-by-one or boundary cases in text placement.
Verify there are no unintended visual regressions in rendered annotations (font metrics, offsets).

Poem

A rabbit scribbles, quick and spry, 🐰
Adding time so scripts can lie,
Labels hop to x and y,
Little changes, big-sky sigh —
Code and carrots, wink of eye. 🥕✨

Pre-merge checks and finishing touches

❌ Failed checks (2 warnings)

Check name	Status	Explanation	Resolution
Title check	⚠️ Warning	The PR title claims to add missing dependencies for Linux grounding actions, but the actual changes include both import fixes and unrelated coordinate refactoring in behavior_narrator.py.	Update the title to accurately reflect all significant changes, such as 'Fix Linux imports in grounding and refactor coordinate handling in behavior_narrator' or split into separate PRs.
Docstring Coverage	⚠️ Warning	Docstring coverage is 50.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (1 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

wehos · 2025-11-29T12:40:35Z

Another critical issue pointed out by Dr. Rabbit is here:

Agent-S/gui_agents/s3/bbon/behavior_narrator.py

Lines 43 to 68 in f3ed515

    
           for mouse_action in mouse_actions: 
        
               width, height = mouse_action.split("(")[1].strip(")").split(", ")[:2] 
        
               width, height = int(width), int(height) 
        
               # Clamp coordinates within bounds 
        
               width = max(0, min(img.width - 1, width)) 
        
               height = max(0, min(img.height - 1, height)) 
        
               def place_text(label, color): 
        
                   bbox = draw.textbbox((0, 0), label, font=font) 
        
                   text_w, text_h = ( 
        
                       bbox[2] - bbox[0], 
        
                       bbox[3] - bbox[1], 
        
                   )  # Measure text size 
        
                   offset_x, offset_y = -5, 5  # Default offset 
        
                   if width + offset_x + text_w > img.width:  # Out of bounds on right 
        
                       offset_x = -text_w - 5 
        
                   if height + offset_y + text_h > img.height:  # Out of bounds on bottom 
        
                       offset_y = -text_h - 5 
        
                   if width + offset_x < 0:  # Out of bounds on left 
        
                       offset_x = 5 
        
                   if height + offset_y < 0:  # Out of bounds on top 
        
                       offset_y = 5 
        
                   draw.text( 
        
                       (width + offset_x, height + offset_y), label, fill=color, font=font 
        
                   )

The mark_action method has a closure variable capture issue on lines 58, 60, 62, 64, and 67.

The nested function place_text defined in lines 51–68 references the loop variables width and height inside the loop. This is the classic Python closure pitfall and causes all labels to use the coordinates from the final iteration of the loop.

Unfortunately, he is right!

wehos · 2025-11-29T12:46:12Z

@coderabbitai review

coderabbitai · 2025-11-29T12:46:23Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

gui_agents/s3/bbon/behavior_narrator.py (1)

51-68: Closure bug in place_text is correctly fixed; optional minor refactor only

Changing place_text to accept explicit x, y and using those for bounds checks and drawing coordinates cleanly removes the loop-variable capture issue and ensures each label is placed at the correct mouse action position. The updated call sites for Click, MoveTo, and DragTo correctly pass the current (width, height) so the labels are now aligned with each action instead of drifting to a shared position.

If you ever want to micro-optimize, you could hoist place_text outside the loop (e.g., a small nested helper taking draw, font, img, label, color, x, y) to avoid redefining it each iteration, but that’s purely optional and not performance‑critical here.

Also applies to: 70-85

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between dce34e0 and b754846.

📒 Files selected for processing (1)

gui_agents/s3/bbon/behavior_narrator.py (2 hunks)

🧰 Additional context used

🪛 GitHub Actions: lint

gui_agents/s3/bbon/behavior_narrator.py

[error] 1-1: black --check gui_agents failed: 1 file would be reformatted by Black.

🔇 Additional comments (1)

gui_agents/s3/bbon/behavior_narrator.py (1)

1-273: 

The review comment displays stale code that does not match the actual file content.

The code snippet in the review comment is incorrect. Specifically:

Line 51 in the review shows: def place_text(label, color, x, y): with 4 parameters

Line 51 in the actual file: def place_text(label, color): with 2 parameters

Review function calls: place_text("Click", (255, 0, 0), width, height)

Actual function calls: place_text("Click", (255, 0, 0))

The actual implementation refactored place_text to use closure variables from the enclosing scope rather than explicit parameters. This functional change is not reflected in the review comment's code snippet, making the review comment unreliable for guiding the developer to the correct state of the code.

Added missing deps for linux grounding actions.

dce34e0

Fixed closure variable capture issue in bbon.

b754846

coderabbitai bot reviewed Nov 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Added missing deps for linux grounding actions. #157

Added missing deps for linux grounding actions. #157

Uh oh!

wehos commented Nov 29, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Nov 29, 2025 •

edited

Loading

Uh oh!

wehos commented Nov 29, 2025

Uh oh!

wehos commented Nov 29, 2025

Uh oh!

coderabbitai bot commented Nov 29, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	if self.platform == "linux":
	return f"import pyautogui; pyautogui.hotkey('win'); time.sleep(0.5); pyautogui.write({repr(app_or_filename)}); time.sleep(1.0); pyautogui.hotkey('enter'); time.sleep(0.5)"
	elif self.platform == "darwin":
	return f"import pyautogui; import time; pyautogui.hotkey('command', 'space', interval=0.5); pyautogui.typewrite({repr(app_or_filename)}); pyautogui.press('enter'); time.sleep(1.0)"

Added missing deps for linux grounding actions. #157

Are you sure you want to change the base?

Added missing deps for linux grounding actions. #157

Uh oh!

Conversation

wehos commented Nov 29, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Nov 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Poem

Pre-merge checks and finishing touches

Uh oh!

wehos commented Nov 29, 2025

Uh oh!

wehos commented Nov 29, 2025

Uh oh!

coderabbitai bot commented Nov 29, 2025

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

wehos commented Nov 29, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Nov 29, 2025 •

edited

Loading