Name		Name	Last commit message	Last commit date
parent directory ..
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
gecholog.Dockerfile		gecholog.Dockerfile
gl_config_mock.json		gl_config_mock.json
go.mod		go.mod
go.sum		go.sum
mock.go		mock.go

README.md

Mock

Purpose

The mock processor connects to the Gecholog LLM Gateway to mock and replicate your LLM API responses. It works like this:

Make a regular LLM API request to any gecholog router, for example /service/standard/
The mock custom processor will record the response payload and response headers
Send as many requests as you want to /mock/service/standard/ to get the same response over and over again

The mock processor will randomize the response time to resemble an LLM API. You can change this behavior via the LAMBDA environment variable.

Building

Before you proceed, build the gecholog container.

From this directory, run

  export NATS_TOKEN=changeme
  export GUI_SECRET=changeme
  export AISERVICE_API_BASE=https://your.openai.azure.com/
  docker compose up -d

Using

export AISERVICE_API_KEY=your_api_key
export DEPLOYMENT=your_deployment

Send the request to the /service/standard/ router:

curl -sS -H "api-key: $AISERVICE_API_KEY" -H "Content-Type: application/json" -X POST -d '{
  "messages": [
    {
      "role": "system",
      "content": "Assistant is a large language model trained by OpenAI."
    },
    {
      "role": "user",
      "content": "Who were the founders of Microsoft?"
    }
  ],
  "max_tokens": 15
}' "http://localhost:5380/service/standard/openai/deployments/$DEPLOYMENT/chat/completions?api-version=2023-05-15"

Now try to make your requests to the mock router /mock/service/standard/:

curl -sS -H "Content-Type: application/json" -X POST -d '{
  "messages": [
    {
      "role": "system",
      "content": "Assistant is a large language model trained by OpenAI."
    },
    {
      "role": "user",
      "content": "Who were the founders of Microsoft?"
    }
  ],
  "max_tokens": 15
}' "http://localhost:5380/mock/service/standard/openai/deployments/any/chat/completions?api-version=2023-05-15"

And you should receive the same response back, without Gecholog forwarding your second request.

Check the logs

Use the Log Lister to inspect the traffic

How it works

mock will store the last response for each router.

request1 to /service/standard/ returns answer1
request2 to /service/standard/ returns answer2
request3 to /service/standard/ returns answer3
request4 to /mock/service/standard/ returns answer3

mock will separate the responses for each router.

request1 to /service/standard/ returns answer1
request2 to /service/capped/ returns answer2
request3 to /mock/service/standard/ returns answer1
request4 to /mock/service/capped/ returns answer2

Change response time

mock will randomize response time using the Exponential distribution with environment variable LAMBDA. Set LAMBDA=0 to disable the latency which is the default value. The docker-compose.yml uses LAMBDA=0.2 which gives a mean response time of 500 ms.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

mock

mock

README.md

Mock

Purpose

Building

Using

Check the logs

How it works

Change response time

Files

mock

Directory actions

More options

Directory actions

More options

Latest commit

History

mock

Folders and files

parent directory

README.md

Mock

Purpose

Building

Using

Check the logs

How it works

Change response time