Resolve Security Vulnerability & Implement Cost Optimizations by Pravalika-Batchu · Pull Request #86 · gemma-facet/cloud-services

Pravalika-Batchu · 2025-12-25T14:11:22Z

Security Issue Fixed
Problem: Users had to submit API keys (Gemini & Hugging Face) in requests, creating security vulnerability.

Solution:

Made gemini key and hf token optional in all requests
Services now use GOOGLE_API_KEY and HF_TOKEN environment variables
No more API key exposure in request payloads

Cost Optimization
Problem: Text similarity rewards made 2N API calls for N comparisons.

Solution: Switched to batch embedding API, reducing calls to 1 per batch = ~50% cost savings.

Local Development
Added support for running services locally without GCP credentials.

Key Changes
Preprocessing: Made Gemini API key optional, uses env var
Inference/Training: Made HF token optional, uses env var
Rewards: Implemented batch_embed_contents for cost savings
All Services: Added local testing mode

✅ Testing Verified
✅ Requests work without API keys
✅ Services use environment variables
✅ Batch API reduces costs
✅ Local development works
✅ Backward compatibility maintained
Environment Setup:

Production

GOOGLE_API_KEY=your-gemini-key
HF_TOKEN=your-huggingface-token

Local dev

STORAGE_TYPE=local

Impact
Security: Eliminated API key exposure
Cost: ~50% reduction in embedding costs
Dev Experience: Easy local testing

Closes: #84 - API key security vulnerability

Ready for review! 🚀

Test: STORAGE_TYPE=local python app.py then curl http://localhost:8080/health

- Remove user-submitted API keys security vulnerability - Use server-side GOOGLE_API_KEY environment variable - Implement batch embedding API for 50% cost reduction - Add local testing support without GCP credentials - Update preprocessing service to handle optional API keys - Modify text similarity rewards to use batch_embed_contents

…ation Security and cost optimization improvements

- Remove user-submitted HF tokens from all request schemas - Use HF_TOKEN environment variable server-side for authentication - Make hf_token optional in InferenceRequest, BatchInferenceRequest, EvaluationRequest, TrainRequest - Update login_hf function to use HF_TOKEN env var as fallback - Update validation logic to allow server-side token usage - Maintain backward compatibility while improving security This eliminates another API key security vulnerability where users could submit Hugging Face tokens in requests.

…ation Extend security fixes to Hugging Face authentication

supreme-gg-gg · 2026-01-14T02:31:11Z

jobs/training-job/main.py


-    if train_request.hf_token:
-        login(token=train_request.hf_token)
+    # Login to Hugging Face using provided token or HF_TOKEN env var


Hi, thanks for your interest in contributing! However, the proper way to do this would be using OAuth which is described in #84. Using env var means users cannot connect their own accounts, which is needed to push and pull from HF hub!

supreme-gg-gg · 2026-01-14T02:31:57Z

jobs/training-job/rewards.py

+                completion_texts = [_get_completion_text(comp) for comp in completions]
+                all_texts = completion_texts + reference_values
+                # Batch embed all texts
+                batch_response = _genai_client.models.batch_embed_contents(


This does seem valid, we should batch all the calls to LLM as judge as well! They will also improve the efficiency since batched calls are cheaper (by almost 50% IIRC).

Pravalika-Batchu added 4 commits December 25, 2025 13:38

Merge pull request #1 from Pravalika-Batchu/security-and-cost-optimiz…

e459df6

…ation Security and cost optimization improvements

Merge pull request #2 from Pravalika-Batchu/security-and-cost-optimiz…

ac329a2

…ation Extend security fixes to Hugging Face authentication

supreme-gg-gg reviewed Jan 14, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

Resolve Security Vulnerability & Implement Cost Optimizations#86

Resolve Security Vulnerability & Implement Cost Optimizations#86
Pravalika-Batchu wants to merge 4 commits intogemma-facet:mainfrom
Pravalika-Batchu:main

Pravalika-Batchu commented Dec 25, 2025

Uh oh!

supreme-gg-gg Jan 14, 2026

Uh oh!

supreme-gg-gg Jan 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

Pravalika-Batchu commented Dec 25, 2025

Production

Local dev

Uh oh!

supreme-gg-gg Jan 14, 2026

Choose a reason for hiding this comment

Uh oh!

supreme-gg-gg Jan 14, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants