=== Data Machine === Contributors: chubes4 Tags: ai, automation, content, workflow, pipeline Requires at least: 6.2 Tested up to: 6.8 Requires PHP: 8.0 Stable tag: 0.1.0 License: GPLv2 or later License URI: https://www.gnu.org/licenses/gpl-2.0.html
AI-first WordPress plugin for content processing workflows with visual pipeline builder and multi-provider AI integration.
Features: Tool-First AI, Visual Pipeline Builder, Multi-Provider AI (OpenAI, Anthropic, Google, Grok, OpenRouter), 6-Tier AI Directive Priority System, AIStepConversationManager with Turn Tracking, AIStepToolParameters, Pipeline Workflow Context, Site Context Integration, Three-Layer Tool Management, Social Publishing, OAuth System, Headless Mode, Universal Handler Settings Template
Requirements: WordPress 6.2+, PHP 8.0+, Composer
Pipeline+Flow: Pipelines are reusable templates, Flows are configured instances
Example: Automated Tech News Twitter Feed
- Pipeline Template: Fetch → AI → Twitter with system prompt "You are a tech news curator. Extract key insights and create engaging tweets that highlight innovation and industry impact. Maintain a professional but accessible tone."
- Flow A: TechCrunch RSS → AI agent → user message "Focus on AI/ML breakthroughs and venture funding" → Twitter (every 2 hours)
- Flow B: Reddit r/technology → AI agent → user message "Focus on open-source projects and developer tools" → Twitter (every 4 hours)
- Flow C: VentureBeat RSS → AI agent → user message "Focus on startup launches and product innovations" → Twitter (every 6 hours)
Development:
- Clone to
/wp-content/plugins/data-machine/
- Run
composer install
- Activate plugin
- Configure AI provider at Settings → Data Machine
Production:
- Run
./build.sh
for production zip - Install via WordPress admin
- Configure AI provider and tools
Google Search (optional):
- Create Custom Search Engine + get API key
- Add credentials at Settings → Data Machine → Tool Configuration
- Free tier: 100 queries/day
OAuth Providers:
- Twitter: OAuth 1.0a
- Reddit/Facebook/Threads/Google Sheets: OAuth2
- Bluesky: App Password
Auth via /dm-oauth/{provider}/
popup flow.
- Create Pipeline: "Tech News Bot"
- Add Steps: RSS → AI → Twitter
- Configure: TechCrunch feed + Claude 3.5 Sonnet + Twitter auth
- Schedule: Every 2 hours
- Monitor: Data Machine → Logs
Single Platform: RSS → AI → Twitter (recommended)
Multi-Platform: RSS → AI → Twitter → AI → Facebook
Content Updates: WordPress Local → AI → WordPress Update
Document Analysis: Files → AI → WordPress
Note: Multi-platform uses AI→Publish→AI→Publish pattern. Update steps require source_url from fetch metadata.
For detailed examples and technical specifications, see CLAUDE.md
// Pipeline creation and execution
$pipeline_id = apply_filters('dm_create_pipeline', null, ['pipeline_name' => 'My Pipeline']);
$step_id = apply_filters('dm_create_step', null, ['step_type' => 'fetch', 'pipeline_id' => $pipeline_id]);
$flow_id = apply_filters('dm_create_flow', null, ['pipeline_id' => $pipeline_id]);
do_action('dm_run_flow_now', $flow_id, 'manual');
// AI integration
$response = apply_filters('ai_request', [
'messages' => [['role' => 'user', 'content' => $prompt]],
'model' => 'gpt-5-mini'
], 'openai');
For complete API documentation, see CLAUDE.md
Complete extension framework supporting Fetch, Publish, Update handlers, AI tools, and Database services with filter-based auto-discovery.
See CLAUDE.md
for development guides and technical specifications
Fetch Sources: Local/remote files, RSS feeds, Reddit posts, WordPress Local, WordPress Media, WordPress API, Google Sheets
Publish Destinations: Twitter, Bluesky, Threads, Facebook, WordPress, Google Sheets
Update Handlers: WordPress Update (existing post/page modification via source_url)
AI Providers: OpenAI, Anthropic, Google, Grok, OpenRouter (200+ models)
General Tools: Google Search, Local Search, WebFetch (50K character limit), WordPress Post Reader
All handlers are fully functional with OAuth authentication where required and comprehensive error handling
For detailed specifications, see CLAUDE.md
- Content marketing automation
- News monitoring and alerts
- Document processing and extraction
- Social media management
- Content repurposing
- Research automation
- WordPress workflow integration
Pages: Pipelines, Flows, Jobs, Logs
Settings (WordPress Settings → Data Machine):
- Engine Mode (headless), page controls, tool toggles
- Site Context toggle (WordPress info injection)
- Job data cleanup on failure toggle (debugging)
- File retention settings (1-90 days)
- 6-Tier AI Directive Priority System: Plugin Core Directive → Global system prompt → Pipeline prompts → Tool definitions → Data packet structure → Site context
- AIStepConversationManager for multi-turn conversation state management with turn tracking and chronological message ordering
- AIStepToolParameters flat parameter architecture with content extraction
- Tool configuration (API keys, OAuth)
- WordPress defaults (post types, taxonomies, author, status)
- Three-layer tool management (global → modal → validation)
Features: Drag & drop, auto-save, status indicators, real-time monitoring
composer install # Development setup
./build.sh # Production build
Architecture: PSR-4 autoloading, filter-based service discovery, flat parameter architecture via dm_engine_parameters
filter, centralized cache system via Actions/Cache.php, 6-tier AI directive system with auto-registration (PluginCoreDirective, GlobalSystemPromptDirective, PipelineSystemPromptDirective, ToolDefinitionsDirective, DataPacketStructureDirective, SiteContextDirective), AIStepConversationManager for conversation state management with turn tracking, AIStepToolParameters class for unified tool execution, universal handler settings template system eliminating modal code duplication, Composer-managed ai-http-client dependency. See CLAUDE.md
for complete technical specifications.
GPL v2+ - License
Developer: Chris Huber
Documentation: CLAUDE.md