Skip to content

Kafka Executor

James Vaughan edited this page Nov 14, 2018 · 11 revisions

Purpose

Components

Object: KafkaExecutor

Responsible for implementing the "ProcessExecutor" interface and is capable of managing the shutdown behaviour of any local components. Although at minimum requires a "ResultsListener" to return result futures.

To "execute" a process it generates a PiInstance with a new ID locally, sends a ReduceRequest with that PiInstance then uses a promise listener to wait and respond with the result or exception.

Topic: ReduceRequest

Contains PiInstance which need reducing, along with any returned AtomicProcess results which need to be posted to that instance.

Object: Reducer

Consumes the ReduceRequest topic, posts the results into the PiInstances and reduces them. Produces Assignment messages for AtomicProcesses when opening new threads, otherwise posts to "Results" topic with results. It also posts the latest PiInstances to the "PiiHistory" topic.

Topic: Assignment

A stream of AtomicProcess calls which need computing. Contains the process to call and the arguments to use. It also contains the information necessary to uniquely identify id: the PiInstance ID and the call ID (an incrementing id for each AtomicProcess call made by that PiInstance).

Object: AtomicProcessExecutor

Consumes the Assignment topic, executes AtomicProcesses, and posts SequenceRequests.

Topic: PiiHistory

Stores:

  1. PiiUpdate: The latest PiInstance state for a given PiInstance.
  2. SequenceRequest: All the results of AtomicProcess calls which need posting to its PiInstance.
  3. SequenceFailures: The outstanding tasks which need mopping up after an exception.

Object: Sequencer

Consumes the PiiHistory topic, matches up any pending SequenceRequests with the latest PiInstance and posts ReduceRequests for the Reducer.

NOTE: U = PiiUpdate, SR = SequenceRequest, Red = For "Red" PiInstance, Blue = For "Blue" PiInstance

The challenge with the Sequencer is that if multiple PiInstances are assigned to a single PiiHistory partition the resultant ReduceRequests cannot always be sent immediately (see above). If a SequenceRequest arrives without a matching PiiUpdate, we must wait to commit until one is present or it could be forgotten in event of a shutdown. PiiUpdates may be sent as they will return after a redundant pass through the reducer.

Topic: Results

Should really be called "PiiLogs" or "PiEvents", stores the history of all PiEvents .

Object: ResultsListener

"ResultsListener"s does no work per se, each one has it's own ConsumerGroup so each ResultsListener receives all PiEvents (a broadcast/subscribe pattern). Implements the "PiObservable" trait for the cluster and the KafkaExecutor.

Clone this wiki locally