Indexify

Create and Deploy durable, Data-Intensive Agentic Workflows

Indexify simplifies building and serve durable, multi-stage workflows as python functions inter-connected as graphs and automagically deploys them as APIs.

Some of the use-cases that you can use Indexify for -

Key Features

Conditional Branching and Data Flow: Router functions can conditionally chose one or more edges in Graph making it easy to invoke expert models based on inputs.
Local Inference: Run LLMs in workflow functions using LLamaCPP, vLLM, or Hugging Face Transformers.
Distributed Map and Reduce: Automatically parallelizes functions over sequences across multiple machines. Reducer functions are durable and invoked as map functions finish.
Version Graphs and Backfill: Backfill API to update previously processed data when functions or models are updated.
Placement Constraints: Allows graphs to span GPU instances and cost-effective CPU machines, with functions assigned to specific instance types.
Request Queuing and Batching: Automatically queues and batches parallel workflow invocations to maximize GPU utilization.

Installation

pip install indexify

Basic Usage

Workflows are written as Python functions and are connected as Graphs. Each function is a logical compute unit that can be retried upon failure or assigned to specific hardware.

1: Create a Compute Graph

from pydantic import BaseModel
from indexify import indexify_function, indexify_router, Graph
from typing import List, Union

class Total(BaseModel):
    val: int = 0

@indexify_function()
def generate_numbers(a: int) -> List[int]:
    return [i for i in range(a)]

@indexify_function()
def square(i: int) -> int:
    return i ** 2

@indexify_function(accumulate=Total)
def add(total: Total, new: int) -> Total:
    total.val += new
    return total

g = Graph(name="sequence_summer", start_node=generate_numbers, description="Simple Sequence Summer")
g.add_edge(generate_numbers, square)
g.add_edge(square, add)

You can separate heavy tasks like local inference of LLMs from database write operations to prevent reprocessing data if a write fails. Indexify caches each function's output, so when you retry downstream processes, previous steps aren't repeated.

2: Test the Graph In-Process

invocation_id = g.run(a=10)
result = g.output(invocation_id, "add")
print(result)

Running Graph's in-process makes writing and testing Graphs easy, for production environments you would want an API to call them whenever there is data to process.

3: Deploy your Graph as an API

Indexify server generates API endpoints for Compute Graphs, allowing external systems to invoke your workflows. The server can host multiple workflows and can execute functions across Graphs in parallel.

indexify-cli server-dev-mode

This starts the following processes on your terminal -

Server: Manages state of graphs, orchestrates functions, and stores function outputs.
API URL - http://localhost:8900
Executor: Runs python functions and coordinates execution state of functions with server.

Change the code above to deploy the graph as an API -

from indexify import RemoteGraph

graph = RemoteGraph.deploy(g)
# for graphs which are already deployed
# graph = RemoteGraph.by_name("sequence_summer") 
invocation_id = graph.run(block_until_done=True, a=10)
result = graph.output(invocation_id, "add")
print(result)

This serializes your Graph code and uploads it to the server, and instantiates a new endpoint. Everything else, remains the same in your application code that invokes the Graph to process data and retrieve outputs!

Name		Name	Last commit message	Last commit date
Latest commit History 2,854 Commits
.devcontainer		.devcontainer
.github/workflows		.github/workflows
.repo/conf		.repo/conf
dockerfiles		dockerfiles
docs		docs
examples		examples
operations/k8s		operations/k8s
python-sdk		python-sdk
server		server
.dockerignore		.dockerignore
.env		.env
.gitignore		.gitignore
.prettierignore		.prettierignore
.prettierrc.yaml		.prettierrc.yaml
LICENSE		LICENSE
README.md		README.md
client_cert_config		client_cert_config
docker-compose.yaml		docker-compose.yaml
run_tests.sh		run_tests.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Indexify

Create and Deploy durable, Data-Intensive Agentic Workflows

Key Features

Installation

Basic Usage

1: Create a Compute Graph

2: Test the Graph In-Process

3: Deploy your Graph as an API

More Topics

Roadmap

Scheduler

SDK

About

Releases 66

Packages

Contributors 46

Languages

License

tensorlakeai/indexify

Folders and files

Latest commit

History

Repository files navigation

Indexify

Create and Deploy durable, Data-Intensive Agentic Workflows

Key Features

Installation

Basic Usage

1: Create a Compute Graph

2: Test the Graph In-Process

3: Deploy your Graph as an API

More Topics

Roadmap

Scheduler

SDK

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 66

Packages 0

Contributors 46

Languages

Packages