Google Vertex AI Integration

Google Vertex AI is Google Cloud’s fully managed AI platform that provides access to Gemini and other foundation models. This guide shows how to set up Google Vertex AI and integrate it with Passy.

Prerequisites

A Google Cloud account with billing enabled
Passy account (Pro plan required for provider keys) or self-hosted instance (free)

Overview

Google Vertex AI provides access to Google’s latest AI models including Gemini Pro, Gemini Flash, and other foundation models with enterprise-grade security and scalability. Passy integrates seamlessly with Vertex AI deployments.

Create Google Cloud Project & Enable Vertex AI

Create or Select a Google Cloud Project

Go to the Google Cloud Console
Click on the project dropdown at the top of the page
Click New Project or select an existing project
If creating new:
- Enter a Project name
- Select a Billing account
- Click Create

Enable Vertex AI API

In the Google Cloud Console, navigate to APIs & Services > Library
Search for Vertex AI API
Click on Vertex AI API
Click Enable
Wait for the API to be enabled (usually takes a few seconds)

Create Service Account

Navigate to IAM & Admin > Service Accounts
Click Create Service Account
Configure the service account:
- Service account name: Enter a name (e.g., passy-vertex-ai)
- Service account ID: Auto-generated
- Description: Optional description
Click Create and Continue
Grant the following role:
- Vertex AI User (roles/aiplatform.user)
Click Continue, then Done

Generate API Key (Service Account Key)

In the Service Accounts list, find your newly created service account
Click on the service account email
Go to the Keys tab
Click Add Key > Create new key
Select JSON as the key type
Click Create
Important: The JSON key file will be downloaded automatically - keep it secure!

The JSON file contains your credentials and looks like this:

{
  "type": "service_account",
  "project_id": "your-project-id",
  "private_key_id": "...",
  "private_key": "...",
  "client_email": "...",
  ...
}

Note Your Project ID

You’ll need your Google Cloud Project ID for the integration. You can find it:

In the downloaded JSON key file (project_id field)
In the Google Cloud Console header
On the project dashboard

Add to Passy

Navigate to Provider Keys

Log into Passy Dashboard
Select your organization and project
Go to Provider Keys in the sidebar

Add Google Vertex AI Provider Key

Click Add for Google Vertex AI
Enter your Project ID from Google Cloud
Upload or paste the contents of your Service Account JSON key file
Select your preferred Region (e.g., us-central1, europe-west1, asia-southeast1)
Click Add Key

The system will validate your credentials and confirm the connection.

Test the Integration

Test your integration with a simple API call:

curl -X POST https://api.passy.ai/v1/chat/completions \
  -H "Authorization: Bearer PASSY_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "vertex/gemini-3.0-flash",
    "messages": [
      {
        "role": "user",
        "content": "Hello from Google Vertex AI!"
      }
    ]
  }'

Replace PASSY_API_KEY with your Passy API key.

Available Models

Once configured, you can access Google Vertex AI models through Passy:

Gemini 3.0 Flash: vertex/gemini-3.0-flash
Gemini 3.0: vertex/gemini-3.0
Gemini 2.0 Flash: vertex/gemini-2.0-flash-exp

Note: Model availability may vary by region. Check the Google Cloud documentation for the latest model availability. Browse all available models at passy.ai/models

Troubleshooting

”Permission denied” error

Verify your service account has the Vertex AI User role
Check that the Vertex AI API is enabled in your project
Ensure your service account key is valid and not expired

”Project not found” error

Verify the project ID is correct
Ensure billing is enabled for your Google Cloud project
Check that you have access to the project

”Model not available” error

Verify the model is available in your selected region
Some models may require allowlist access
Check model availability by region

Rate limiting

Google Vertex AI has quota limits per project and region
Monitor usage in the Google Cloud Console under Vertex AI > Quotas
Request quota increases through the Google Cloud Console if needed

Region availability

Not all models are available in all regions
Check Vertex AI locations for model availability
Consider using multiple regions for better availability

Need Help?

WhatsApp Support

Chat with us instantly:

Book a Call

For enterprise routing, custom GPU servers or migration assistance:

IDE Integrations

Workflow Automation

Cloud Providers

Google Vertex AI Integration

Prerequisites

Overview

Create Google Cloud Project & Enable Vertex AI

Create or Select a Google Cloud Project

Enable Vertex AI API

Create Service Account

Generate API Key (Service Account Key)

Note Your Project ID

Add to Passy

Navigate to Provider Keys

Add Google Vertex AI Provider Key

Test the Integration

Available Models

Troubleshooting

”Permission denied” error

”Project not found” error

”Model not available” error

Rate limiting

Region availability

Need Help?

WhatsApp Support

Book a Call

IDE Integrations

Workflow Automation

Cloud Providers

​Prerequisites

​Overview

​Create Google Cloud Project & Enable Vertex AI

​Create or Select a Google Cloud Project

​Enable Vertex AI API

​Create Service Account

​Generate API Key (Service Account Key)

​Note Your Project ID

​Add to Passy

​Navigate to Provider Keys

​Add Google Vertex AI Provider Key

​Test the Integration

​Available Models

​Troubleshooting

​”Permission denied” error

​”Project not found” error

​”Model not available” error

​Rate limiting

​Region availability

​Need Help?

WhatsApp Support

Book a Call

Prerequisites

Overview

Create Google Cloud Project & Enable Vertex AI

Create or Select a Google Cloud Project

Enable Vertex AI API

Create Service Account

Generate API Key (Service Account Key)

Note Your Project ID

Add to Passy

Navigate to Provider Keys

Add Google Vertex AI Provider Key

Test the Integration

Available Models

Troubleshooting

”Permission denied” error

”Project not found” error

”Model not available” error

Rate limiting

Region availability

Need Help?