Browser Explorer 🌐

Browser Explorer is an AI-powered web browsing agent that automatically explores websites, tests interactive elements, and generates comprehensive Playwright test suites. Built with a modular architecture combining traditional crawling with AI-enhanced capabilities, it's designed to be the ultimate tool for automated web testing and exploration.

🚀 Features

Core Capabilities

🔍 Intelligent Web Crawling: Breadth-first search algorithm for systematic website exploration
🤖 AI-Powered Element Detection: Hybrid AI + traditional detection for 25+ element types
🎯 Smart Interaction: Automated interaction with forms, buttons, dropdowns, modals, and more
📸 Comprehensive Recording: Complete user path capture with screenshots and timing
🧪 Test Generation: Automatic Playwright/Cypress/Puppeteer test suite generation
📄 Page Object Model: Clean, maintainable test structure with POM pattern

Enterprise Features

🔐 Multi-Strategy Authentication: Support for Basic, OAuth, MFA, and API key auth
🥷 Stealth Mode: Advanced anti-bot detection evasion
🧩 CAPTCHA Handling: Detection and solving capabilities
📊 Distributed Crawling: Redis-based queue for scalable operations
🐳 Docker Support: Full containerization with docker-compose
📈 Monitoring & Metrics: Built-in observability and health checks

🎯 Current Status

✅ Implemented (70%)

Complete infrastructure and crawling engine
Traditional element detection and interaction
Test generation pipeline with multiple frameworks
Authentication system and session management
CLI interface with comprehensive commands
Docker containerization and deployment ready

🔴 Missing AI Components (30%)

Mastra AI agent integration (shell classes only)
Stagehand browser automation tools (not integrated)
Natural language capabilities
Self-healing test features

Note: While the infrastructure is production-ready, the AI capabilities that differentiate this from traditional crawlers are pending integration. See TODO.md for the complete roadmap.

📦 Installation

Prerequisites

Node.js 20+
Docker and Docker Compose (for full deployment)
Redis (for distributed crawling)
PostgreSQL (for data persistence)

Quick Start

# Clone the repository
git clone https://github.com/yourusername/browser-explorer.git
cd browser-explorer

# Install dependencies
npm install

# Set up environment variables
cp .env.example .env

# Build the project
npm run build

# Run with Docker (recommended)
docker-compose up -d

# Or run locally
npm start

🎮 Usage

CLI Commands

# Initialize configuration
browser-explorer init

# Crawl a website and generate tests
browser-explorer crawl https://example.com --output ./tests --depth 3

# Run specific test generation
browser-explorer test https://example.com/login --framework playwright

# Validate generated tests
browser-explorer validate ./tests

# Start monitoring dashboard
browser-explorer serve --port 3000

# Debug mode with verbose logging
browser-explorer debug https://example.com --verbose

Programmatic API

import { BrowserExplorer } from 'browser-explorer';

// Initialize the explorer
const explorer = new BrowserExplorer({
  headless: true,
  depth: 3,
  parallel: 5,
  stealth: true
});

// Start exploration
const results = await explorer.explore('https://example.com', {
  authentication: {
    strategy: 'basic',
    credentials: { username: 'user', password: 'pass' }
  },
  output: {
    directory: './generated-tests',
    framework: 'playwright'
  }
});

// Generate Page Object Model
await explorer.generatePageObjects(results);

🏗️ Architecture

Browser Explorer
├── CLI/API Layer         # User interfaces
├── Core Services         # Orchestration and workflow
├── Components           # Element detection, interaction, recording
├── Infrastructure      # Config, logging, metrics, storage
└── Runtime            # Playwright, Redis, Docker, Node.js

Key Modules

Crawler Module: BFS, resilient, and distributed crawling strategies
Detection Module: AI-ready element detection with 25+ element types
Generation Module: Multi-framework test generation with POM
Authentication Module: Pluggable auth strategies and session management
Monitoring Module: Metrics, traces, and health monitoring

See ARCHITECTURE.md for detailed architecture documentation.

🛠️ Configuration

Basic Configuration (browser-explorer.yml)

crawler:
  depth: 3
  parallel: 5
  timeout: 30000
  respectRobotsTxt: true

detection:
  useAI: false  # Will be true when AI integration is complete
  fallbackToSelectors: true

generation:
  framework: playwright
  outputDir: ./tests
  generatePageObjects: true
  
browser:
  headless: true
  viewport:
    width: 1920
    height: 1080

Environment Variables

# Database
DATABASE_URL=postgresql://user:pass@localhost:5432/browser_explorer

# Redis
REDIS_URL=redis://localhost:6379

# API Keys (when AI integration is complete)
OPENAI_API_KEY=your-key-here
ANTHROPIC_API_KEY=your-key-here

# Monitoring
ENABLE_METRICS=true
METRICS_PORT=9090

🔧 Development

Setup Development Environment

# Install dependencies
npm install

# Run in development mode
npm run dev

# Run tests
npm test

# Run linting
npm run lint

# Type checking
npm run typecheck

Testing

# Unit tests
npm run test:unit

# Integration tests
npm run test:integration

# Performance tests
npm run test:performance

# Self-test the system
npm run self-test

🐳 Docker Deployment

# Build and start all services
docker-compose up -d

# View logs
docker-compose logs -f

# Scale crawler workers
docker-compose up -d --scale crawler=5

# Stop all services
docker-compose down

📊 Monitoring

The system includes comprehensive monitoring capabilities:

Metrics: Application and infrastructure metrics
Tracing: Distributed tracing for request flows
Health Checks: Liveness, readiness, and dependency checks
Dashboards: Built-in monitoring dashboard at /metrics

🤝 Contributing

We welcome contributions! Please see our Contributing Guide for details.

Development Priorities

Mastra AI Integration - Implement proper AI agent configurations
Stagehand Integration - Add natural language browser control
Self-Healing Tests - AI-powered test maintenance
Enhanced Detection - Natural language element queries

📚 Documentation

🚀 Getting Started

Getting Started Guide - Complete setup and first steps
Quick Examples - Working code examples
Configuration Guide - Setup and configuration

📖 Core Documentation

Documentation Index - Complete documentation overview
API Reference - All classes, interfaces, and functions
Architecture Overview - System design and patterns
Implementation Status - Current gaps and roadmap

🏗️ Core Modules

Browser Agent - Core browser automation
Authentication System - Multi-strategy auth
AI Element Detection - Element identification
Web Crawler - Systematic web exploration
Test Generation - Automated test creation

🔧 Development

Contributing Guide - How to contribute
Development Setup - Local development
Testing Guide - Running and writing tests
Docker Guide - Containerization

🎯 Advanced Topics

AI Integration - Mastra and AI workflows
Performance Optimization - Scaling and optimization
Security Guide - Security best practices
Troubleshooting - Common issues and solutions

🐛 Known Issues & Implementation Status

🔴 Critical Implementation Gaps

Server Mode: CLI advertises but not implemented (src/cli/BrowserExplorerCLI.ts:334)
Authentication Setup: CLI auth command exists but non-functional (src/cli/BrowserExplorerCLI.ts:417)
Stagehand AI Integration: Core AI detection completely disabled (src/detectors/AIElementDetector.ts:23-28)
Test Generation: Placeholder implementations for core features (src/generation/TestGenerator.ts:497-507)

🟡 Partially Implemented

OAuth authentication flows (framework exists, needs provider implementations)
MFA support (basic structure, needs TOTP integration)
Browser-Crawler integration (service architecture incomplete)
CLI debug commands (advertised but not functional)

✅ Fully Functional

Basic browser automation and control
Username/password authentication
Selector-based element detection
Basic test generation templates
Configuration management
Session persistence
Docker infrastructure

See Implementation Status Report for detailed analysis of all gaps, priorities, and planned fixes.

📈 Roadmap

Phase 1: Core AI Integration (Current Priority)

Mastra agent implementation
Stagehand tool integration
Basic natural language support

Phase 2: Enhanced AI Features

Self-healing test capabilities
AI-powered visual regression
Intelligent assertion generation

Phase 3: Production Features

Multi-agent collaboration
Advanced learning capabilities
Enterprise integrations

📝 License

This project is licensed under the MIT License - see the LICENSE file for details.

🙏 Acknowledgments

Built with Playwright for browser automation
Designed for Mastra AI framework integration
Prepared for Stagehand natural language control

Made with ❤️ by the Browser Explorer Team

Report Bug • Request Feature • Documentation

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.github		.github
documentation		documentation
examples		examples
scripts		scripts
src		src
.dockerignore		.dockerignore
.env.example		.env.example
.eslintrc.json		.eslintrc.json
.gitignore		.gitignore
.prettierrc.json		.prettierrc.json
ARCHITECTURE.md		ARCHITECTURE.md
BLOCKED_TODOS.md		BLOCKED_TODOS.md
COMPLETED_TODOS.md		COMPLETED_TODOS.md
Dockerfile		Dockerfile
Dockerfile.dev		Dockerfile.dev
FINAL_SUMMARY.md		FINAL_SUMMARY.md
GAP_ANALYSIS_REPORT.md		GAP_ANALYSIS_REPORT.md
NEEDS_CLARIFICATION.md		NEEDS_CLARIFICATION.md
PRD Web Browsing Agent.md		PRD Web Browsing Agent.md
PROJECT_SUMMARY.md		PROJECT_SUMMARY.md
README.md		README.md
REFACTORING_PLAN.md		REFACTORING_PLAN.md
TODO.md		TODO.md
TODO_BACKUP.md		TODO_BACKUP.md
TODO_BACKUP_2025-01-26_00-58-07.md		TODO_BACKUP_2025-01-26_00-58-07.md
TODO_BACKUP_2025-07-26_00-54-31.md		TODO_BACKUP_2025-07-26_00-54-31.md
TODO_BACKUP_20250725_011309.md		TODO_BACKUP_20250725_011309.md
TODO_BACKUP_2025_07_25.md		TODO_BACKUP_2025_07_25.md
TODO_WORK_SUMMARY.md		TODO_WORK_SUMMARY.md
docker-compose.dev.yml		docker-compose.dev.yml
docker-compose.yml		docker-compose.yml
documentation-report.md		documentation-report.md
implementation-log.md		implementation-log.md
jest.config.js		jest.config.js
package.json		package.json
test-detector-direct.js		test-detector-direct.js
test-element-detection-debug.js		test-element-detection-debug.js
test-element-detection.js		test-element-detection.js
test-fixing-log.md		test-fixing-log.md
tsconfig.json		tsconfig.json
type-fixing-log.md		type-fixing-log.md

AdamManuel-dev/explorer-browser-agent

Folders and files

Latest commit

History

Repository files navigation

Browser Explorer 🌐

🚀 Features

Core Capabilities

Enterprise Features

🎯 Current Status

✅ Implemented (70%)

🔴 Missing AI Components (30%)

📦 Installation

Prerequisites

Quick Start

🎮 Usage

CLI Commands

Programmatic API

🏗️ Architecture

Key Modules

🛠️ Configuration

Basic Configuration (browser-explorer.yml)

Environment Variables

🔧 Development

Setup Development Environment

Testing

🐳 Docker Deployment

📊 Monitoring

🤝 Contributing

Development Priorities

📚 Documentation

🚀 Getting Started

📖 Core Documentation

🏗️ Core Modules

🔧 Development

🎯 Advanced Topics

🐛 Known Issues & Implementation Status

🔴 Critical Implementation Gaps

🟡 Partially Implemented

✅ Fully Functional

📈 Roadmap

Phase 1: Core AI Integration (Current Priority)

Phase 2: Enhanced AI Features

Phase 3: Production Features

📝 License

🙏 Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages