pgflow-dev
diff --git a/‎PLAN.md‎
Lines changed: 25 additions & 2 deletions b/‎PLAN.md‎
Lines changed: 25 additions & 2 deletions
diff --git a/‎PLAN_output_aggregation.md‎
Lines changed: 183 additions & 0 deletions b/‎PLAN_output_aggregation.md‎
Lines changed: 183 additions & 0 deletions
@@ -2,13 +2,21 @@
 
 **NOTE: This PLAN.md file should be removed in the final PR once all map infrastructure is complete.**
 
-### Current State
+### Features
 
 - ✅ **WORKING**: Empty array maps (taskless) cascade and complete correctly
 - ✅ **WORKING**: Task spawning creates N tasks with correct indices
 - ✅ **WORKING**: Dependency count propagation for map steps
 - ✅ **WORKING**: Array element extraction - tasks get full array instead of individual items
-- ❌ **MISSING**: Output aggregation - no way to combine map task outputs for dependents
+- ✅ **DONE**: Output aggregation - inline implementation aggregates map task outputs for dependents
+- ⏳ **WAITING**: DSL support for `.map()` for defining map steps
+
+### Chores
+
+- ⏳ **WAITING**: Integration tests for map steps
+- ⏳ **WAITING**: Consolidated migration for map steps
+- ⏳ **WAITING**: Documentation for map steps
+- ⏳ **WAITING**: Graphite stack merge for map steps
 
 ## Implementation Status
 
@@ -93,6 +101,21 @@
     - Type safety for input/output types
     - Compile-time enforcement of single dependency rule
 
+- [ ] **Performance Optimization: step_states.output Column**
+
+  - Migrate from inline aggregation to storing outputs in step_states
+  - See detailed plan: [PLAN_step_output.md](./PLAN_step_output.md)
+  - Benefits:
+    - Eliminate redundant aggregation queries
+    - 30-70% performance improvement for map chains
+    - Cleaner architecture with single source of truth
+  - Implementation:
+    - Add output column to step_states table
+    - Update complete_task to populate output on completion
+    - Simplify consumers (start_tasks, maybe_complete_run, broadcasts)
+    - Update all aggregation tests (~17 files)
+  - **Note**: This is an optimization that should be done after core functionality is stable
+
 - [ ] **Integration Tests**
 
   - End-to-end workflows with real array data
 
@@ -0,0 +1,183 @@
+# Output Aggregation Implementation Plan
+
+## Overview
+Implement output aggregation for map steps with performance-focused, test-first approach.
+
+## Stage 1: Baseline Performance Measurement
+
+### Tasks
+- Run existing performance tests multiple times (3-5 runs)
+- Calculate average values for each metric
+- Document results in `PERFORMANCE.md`
+
+### Commands
+```bash
+# Run performance tests (repeat 3-5 times)
+pnpm nx test:pgtap core -- pkgs/core/tests/performance/*.sql
+
+# Document results in PERFORMANCE.md with format:
+# - Test name
+# - Average execution time
+# - Min/Max values
+# - Standard deviation if significant
+```
+
+## Stage 2: Test-First Development (Naive Implementation)
+
+### Approach
+Write failing tests one at a time, implement inline solution to make them pass.
+
+### Test Scenarios (in order of complexity)
+1. **Basic map output aggregation**
+   - Single map step with 3 tasks
+   - Verify outputs aggregated in task_index order
+
+2. **Empty map output**
+   - Map step with 0 tasks
+   - Should return `[]` as output
+
+3. **Map feeding into single step**
+   - Map step output aggregated as array
+   - Single step receives full array as dependency input
+
+4. **Map feeding into another map**
+   - First map outputs array
+   - Second map processes each element
+
+5. **Edge case: NULL outputs**
+   - Some tasks return NULL
+   - Aggregation should include NULLs in array
+
+6. **Run completion with map leaf step**
+   - Map step as leaf (no dependents)
+   - Run output should contain aggregated array
+
+### Development Workflow
+```bash
+# 1. Write test
+vim pkgs/core/tests/map_output_aggregation_test.sql
+
+# 2. Run test (should fail)
+pkgs/core/scripts/run-test-with-colors pkgs/core/tests/map_output_aggregation_test.sql
+
+# 3. Update functions in database
+psql $DATABASE_URL -f updated_function.sql
+
+# 4. Re-run test (iterate until passing)
+pkgs/core/scripts/run-test-with-colors pkgs/core/tests/map_output_aggregation_test.sql
+
+# 5. Repeat for next test scenario
+```
+
+### Implementation Notes
+**Naive approach**: Inline aggregation directly in the affected functions
+- **`start_tasks`**: Aggregate map outputs inline in deps CTE
+- **`maybe_complete_run`**: Aggregate map outputs for leaf steps
+- **`complete_task`**: Aggregate for broadcast events
+
+## Stage 3: Performance Measurement (Naive)
+
+### Tasks
+- Run performance tests with naive implementation
+- Compare with baseline
+- Document in `PERFORMANCE.md`
+
+### Expected Impact
+- `start_tasks`: Moderate overhead (aggregation per dependency)
+- `maybe_complete_run`: Minimal (only at run completion)
+- `complete_task`: Minimal (only for broadcasts)
+
+## Stage 4: Map-to-Map Optimization
+
+### Concept
+Optimize the map->map case where we aggregate outputs only to immediately decompose them:
+- Map A task[i] → output[i]
+- Currently: Aggregate to array → decompose in Map B
+- Optimized: Map A task[i] → Map B task[i] directly
+
+### Implementation Strategy
+```sql
+-- In start_tasks deps CTE, add special case:
+CASE
+  WHEN step.step_type = 'map' AND dep_step.step_type = 'map' THEN
+    -- Direct task-to-task transfer
+    (SELECT output FROM pgflow.step_tasks
+     WHERE run_id = st.run_id
+       AND step_slug = dep.dep_slug
+       AND task_index = st.task_index
+       AND status = 'completed')
+  ELSE
+    -- Standard aggregation for non-map dependents
+    ...
+END
+```
+
+### Tests
+1. **Map-to-map direct transfer**
+   - Verify task[i] gets output[i] without aggregation
+
+2. **Map-to-map with different sizes**
+   - Source map: 5 tasks
+   - Target map: 5 tasks (should work)
+   - Error handling if sizes mismatch
+
+## Stage 5: Final Performance Measurement
+
+### Tasks
+- Run all performance tests
+- Compare baseline vs naive vs optimized
+- Document final results and recommendations
+
+### Metrics to Track
+- Execution time per function
+- Memory usage (if measurable)
+- Query complexity (EXPLAIN ANALYZE)
+
+## Stage 6: Function Extraction Decision
+
+### Evaluation Criteria
+After measuring performance of inline implementation:
+1. **Performance overhead**: Is function call cost acceptable?
+2. **Code duplication**: How much repetition exists?
+3. **Maintainability**: Would function improve code clarity?
+
+### If extracting to function:
+```sql
+-- Create pgflow.get_step_output() helper
+-- Update all three locations to use helper
+-- Re-run performance tests
+-- Document final decision and rationale
+```
+
+## Notes for Implementation
+
+### Key Files to Modify
+1. `pkgs/core/schemas/0120_function_start_tasks.sql` (lines 46-53)
+2. `pkgs/core/schemas/0100_function_maybe_complete_run.sql` (lines 16-27)
+3. `pkgs/core/schemas/0100_function_complete_task.sql` (line 156)
+
+### Testing Database Access
+```bash
+# Get database URL
+source .env.local
+echo $DATABASE_URL
+
+# Direct psql access for function updates
+psql $DATABASE_URL
+
+# View current function
+\sf pgflow.start_tasks
+```
+
+### Performance Testing Tips
+- Run tests when system is idle
+- Use consistent hardware/environment
+- Warm up database before measurements
+- Consider connection pooling effects
+
+## Success Criteria
+- [ ] All map output aggregation tests passing
+- [ ] Performance impact < 10% for typical workflows
+- [ ] Map-to-map optimization shows measurable improvement
+- [ ] Documentation complete with performance analysis
+- [ ] Decision made on function extraction based on data