feat(ai-proxy): support Google Cloud Vertex #2119

HecarimV · 2025-04-24T07:53:08Z

Ⅰ. Describe what this PR did

support Google Cloud Vertex provider

Ⅱ. Does this pull request fix one issue?

Ⅳ. Describe how to verify it

docker-compose.yaml

version: '3.7'
services:
  envoy:
    image: higress-registry.cn-hangzhou.cr.aliyuncs.com/higress/gateway:v1.4.0-rc.1
    entrypoint: /usr/local/bin/envoy
    # 开启了 debug 级别日志方便调试
    command: -c /etc/envoy/envoy.yaml --component-log-level wasm:debug
    networks:
      - higress-net
    ports:
      - "10000:10000"
    volumes:
      - ./envoy.yaml:/etc/envoy/envoy.yaml
      - ./plugin.wasm:/etc/envoy/plugin.wasm
networks:
  higress-net: {}

envoy.yaml

# File generated by hgctl. Modify as required.

admin:
  address:
    socket_address:
      protocol: TCP
      address: 0.0.0.0
      port_value: 9901
static_resources:
  listeners:
    - name: listener_0
      address:
        socket_address:
          protocol: TCP
          address: 0.0.0.0
          port_value: 10000
      filter_chains:
        - filters:
            - name: envoy.filters.network.http_connection_manager
              typed_config:
                "@type": type.googleapis.com/envoy.extensions.filters.network.http_connection_manager.v3.HttpConnectionManager
                scheme_header_transformation:
                  scheme_to_overwrite: https
                stat_prefix: ingress_http
                # Output envoy logs to stdout
                access_log:
                  - name: envoy.access_loggers.stdout
                    typed_config:
                      "@type": type.googleapis.com/envoy.extensions.access_loggers.stream.v3.StdoutAccessLog
                # Modify as required
                route_config:
                  name: local_route
                  virtual_hosts:
                    - name: local_service
                      domains: [ "*" ]
                      routes:
                        - match:
                            prefix: "/"
                          route:
                            cluster: vertex
                            timeout: 300s
                http_filters:
                  - name: wasmtest
                    typed_config:
                      "@type": type.googleapis.com/udpa.type.v1.TypedStruct
                      type_url: type.googleapis.com/envoy.extensions.filters.http.wasm.v3.Wasm
                      value:
                        config:
                          name: wasmtest
                          vm_config:
                            runtime: envoy.wasm.runtime.v8
                            code:
                              local:
                                filename: /etc/envoy/plugin.wasm
                          configuration:
                            "@type": "type.googleapis.com/google.protobuf.StringValue"
                            value: |
                              {
                                "provider": {
                                  "type": "vertex",                                
                                  "apiTokens": [
                                    "your-api-token"
                                  ],
                                  "geminiSafetySetting": {
                                    "HARM_CATEGORY_DANGEROUS_CONTENT": "OFF",
                                    "HARM_CATEGORY_HARASSMENT": "OFF",
                                    "HARM_CATEGORY_HATE_SPEECH": "OFF",
                                    "HARM_CATEGORY_SEXUALLY_EXPLICIT": "OFF"
                                  },       
                                  "vertexProjectId": "eastern-concord-457601-e9",
                                  "vertexRegion": "us-central1"
                                }
                              }
                  - name: envoy.filters.http.router
  clusters:
    - name: vertex
      connect_timeout: 30s
      type: LOGICAL_DNS
      dns_lookup_family: V4_ONLY
      lb_policy: ROUND_ROBIN
      load_assignment:
        cluster_name: vertex
        endpoints:
          - lb_endpoints:
              - endpoint:
                  address:
                    socket_address:
                      address: us-central1-aiplatform.googleapis.com
                      port_value: 443
      transport_socket:
        name: envoy.transport_sockets.tls
        typed_config:
          "@type": type.googleapis.com/envoy.extensions.transport_sockets.tls.v3.UpstreamTlsContext
          "sni": "us-central1-aiplatform.googleapis.com"

测试非流式请求：

curl -X POST 'http: //localhost:10000/v1/chat/completions' \
  -H 'Content-Type: application/json' \
  -d '{
    "model": "gemini-2.0-flash-001",
    "messages": [
        {
            "role": "user",
            "content": "你好，你是谁？"
        }
    ],
    "max_tokens": 100,
    "temperature": 0.3,
    "stream": false
}'

测试流式请求：

curl -X POST 'http: //localhost:10000/v1/chat/completions' \
  -H 'Content-Type: application/json' \
  -d '{
    "model": "gemini-2.0-flash-001",
    "messages": [
        {
            "role": "user",
            "content": "你好，你是谁？"
        }
    ],
    "stream": true
}'

Ⅴ. Special notes for reviews

vertex api 文档：https://cloud.google.com/vertex-ai/generative-ai/docs/model-reference/inference

codecov-commenter · 2025-04-24T07:56:41Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 46.06%. Comparing base (ef31e09) to head (d9226f4).
Report is 581 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #2119       +/-   ##
===========================================
+ Coverage   35.91%   46.06%   +10.15%     
===========================================
  Files          69       81       +12     
  Lines       11576    13010     +1434     
===========================================
+ Hits         4157     5993     +1836     
+ Misses       7104     6671      -433     
- Partials      315      346       +31

see 78 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

plugins/wasm-go/extensions/ai-proxy/provider/vertex.go

CH3CHO

我改了一点格式问题，麻烦pull一下。

plugins/wasm-go/extensions/ai-proxy/provider/vertex.go

CH3CHO

还有两个地方希望能够完善一下：

更新 README.md，添加 Google Vertex 的配置说明
等到 TTL 完全到了再刷新 token 可能会因计时偏差导致部分请求使用到过期 token。建议加一个提前量，可以允许用户配置，默认可以 1分钟

plugins/wasm-go/extensions/ai-proxy/provider/vertex.go

CH3CHO · 2025-06-09T09:19:37Z

plugins/wasm-go/extensions/ai-proxy/README.md

+| `vertexRegion` | string   | 必填 | -      | Google Cloud 区域（如 us-central1, europe-west4 等），用于构建 Vertex API 地址             |
+| `vertexProjectId` | string   | 必填 | -      | Google Cloud 项目 ID，用于标识目标 GCP 项目                                              |
+| `vertexAuthServiceName` | string   | 必填 | -      | 用于 OAuth2 认证的服务名称，该服务为了访问oauth2.googleapis.com                                |
+| `vertexGeminiSafetySetting` | map of string   | 非必填 | -      | Gemini 模型的内容安全过滤设置。                                                           |


没看到新加的那个 ahead 。。。

CH3CHO

LGTM

Co-authored-by: Kent Dong <ch3cho@qq.com>

Colstuwjx · 2025-06-18T13:54:48Z

Hi @HecarimV , 请教下，这个 vertexAuthServiceName 字段应该怎么配置呢，我这边有用 litellm proxy 配置一个 vertex ai 的代理，但是好像没看到有这个字段，我看你这个字段实际也没用作调用方面，而是做了一个 dns 服务发现？有一个类似的例子可以参考下看看吗

Hi @HecarimV, please tell me how to configure the vertexAuthServiceName field? I used litellm proxy to configure a vertex ai proxy, but I didn't seem to see this field. I see that your field is not actually used as a call, but made a dns service discovery? Is there a similar example that can be found?

feat(ai-proxy): support Google Cloud Vertex

ff51654

HecarimV requested review from cr7258, CH3CHO and rinfx as code owners April 24, 2025 07:53

feat(ai-proxy): support Google Cloud Vertex

ab41a50

CH3CHO requested changes May 2, 2025

View reviewed changes

HecarimV added 3 commits May 6, 2025 09:50

Merge branch 'main' into support-vertex-v2

e8afdbc

Merge branch 'main' into support-vertex-v2

2210c53

feat(ai-proxy): support Google Cloud Vertex

5261b3e

CH3CHO requested changes May 6, 2025

View reviewed changes

plugins/wasm-go/extensions/ai-proxy/provider/vertex.go Outdated Show resolved Hide resolved

feat(ai-proxy): support Google Cloud Vertex

7d5f899

HecarimV requested a review from CH3CHO May 26, 2025 07:34

HecarimV and others added 2 commits May 27, 2025 15:22

Merge branch 'main' into support-vertex-v2

70a20db

Update vertex.go

a0336cb

CH3CHO reviewed May 27, 2025

View reviewed changes

plugins/wasm-go/extensions/ai-proxy/provider/vertex.go Show resolved Hide resolved

feat(ai-proxy): cache vertex access token

0bbd6e0

CH3CHO requested changes Jun 9, 2025

View reviewed changes

plugins/wasm-go/extensions/ai-proxy/provider/vertex.go Outdated Show resolved Hide resolved

HecarimV added 3 commits June 9, 2025 14:27

feat(ai-proxy): support Google Cloud Vertex

bff20c1

Merge branch 'main' into support-vertex-v2

b822746

feat(ai-proxy): support Google Cloud Vertex

7583384

CH3CHO reviewed Jun 9, 2025

View reviewed changes

feat(ai-proxy): support Google Cloud Vertex

d9226f4

CH3CHO approved these changes Jun 9, 2025

View reviewed changes

CH3CHO merged commit d4e114b into alibaba:main Jun 9, 2025
12 checks passed

daixijun pushed a commit to daixijun/higress that referenced this pull request Jun 10, 2025

feat(ai-proxy): support Google Cloud Vertex (alibaba#2119)

a7a774e

Co-authored-by: Kent Dong <ch3cho@qq.com>

lingma-agents bot mentioned this pull request Jun 17, 2025

add release-notes of 2.1.4 #2433

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(ai-proxy): support Google Cloud Vertex #2119

feat(ai-proxy): support Google Cloud Vertex #2119

Uh oh!

HecarimV commented Apr 24, 2025 •

edited

Loading

Uh oh!

codecov-commenter commented Apr 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CH3CHO left a comment

Uh oh!

Uh oh!

CH3CHO left a comment

Uh oh!

Uh oh!

CH3CHO Jun 9, 2025

Uh oh!

CH3CHO left a comment

Uh oh!

Uh oh!

Colstuwjx commented Jun 18, 2025 •

edited by github-actions bot

Loading

Uh oh!

Uh oh!

feat(ai-proxy): support Google Cloud Vertex #2119

feat(ai-proxy): support Google Cloud Vertex #2119

Uh oh!

Conversation

HecarimV commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Ⅰ. Describe what this PR did

Ⅱ. Does this pull request fix one issue?

Ⅳ. Describe how to verify it

Ⅴ. Special notes for reviews

Uh oh!

codecov-commenter commented Apr 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

CH3CHO left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

CH3CHO left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

CH3CHO Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

CH3CHO left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Colstuwjx commented Jun 18, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

HecarimV commented Apr 24, 2025 •

edited

Loading

codecov-commenter commented Apr 24, 2025 •

edited

Loading

Colstuwjx commented Jun 18, 2025 •

edited by github-actions bot

Loading