Direct VPC egress vs Serverless VPC Access for Cloud Run: our default

We default to Direct VPC egress for Cloud Run because it is the cleaner networking shape: fewer moving parts, no connector resource, and costs that scale with the service instead of beside it.

Decision memo Infrastructure

By Ivan RichterLinkedIn

Last updated: Mar 25, 2026

4 min read

cloud-run networking serverless

On this page

The default

We default to Direct VPC egress for Cloud Run.

At this point, it’s the cleaner Cloud Run networking shape. It keeps the service attached to the VPC without dragging a connector resource into every deployment by default. Fewer moving parts. Fewer things to size, explain, secure, and pay for on the side.

A connector can still be the right fit. We start with the simpler direct path when Cloud Run already supports it.

Why the simpler shape wins

The main advantage is operational.

With Direct VPC egress, the networking story stays closer to the service. There is no separate connector sitting beside it with its own lifecycle and its own cost shape. The service talks to the VPC, and the infrastructure diagram stays closer to what’s actually running.

That also keeps the cost model cleaner. With connectors, you aren’t just paying for the service. You’re also carrying connector capacity as its own thing. Direct VPC egress removes that extra layer, which is a better default for small teams and small platforms.

Security and ownership get cleaner too

The security story gets cleaner for the same reason.

With Direct VPC egress, network tags can be attached to the Cloud Run workload itself instead of being pushed through connector-level infrastructure. That makes firewall intent easier to follow because the policy sits closer to the thing that actually owns the traffic.

It doesn’t magically simplify network design. You still need to decide what should reach what. But it’s a cleaner place to hang that decision than a shared connector people stop thinking about once it’s in the graph, especially once internal-only ingress, private reachability, egress mode, and VPC routing all start depending on the same boundary being set cleanly.

The caveats are real

Direct VPC egress has real caveats, and they’re worth taking seriously. Startup can be awkward. Connectivity to the egress destination can take a while to come up on a fresh instance. Throughput is capped per instance. There are quotas on how many instances can use Direct VPC egress. Networking maintenance can still break connections, which means client behavior needs to tolerate resets instead of acting surprised every time infrastructure behaves like infrastructure.

Direct VPC egress keeps the default topology simpler while the team evaluates its caveats.

Subnets stop being background detail

Cloud Run allocates IP addresses from the subnet you attach. That means subnet size isn’t decorative anymore. If the service scales up, rolls to a new revision, or uses jobs aggressively, IP consumption becomes part of whether the service can start cleanly. At that point, subnet sizing and network configuration are runtime behavior.

That’s one of the more useful side effects of Direct VPC egress. It forces the network boundary to be honest. If the subnet is too small or the IP plan is sloppy, the platform tells you directly instead of hiding the problem behind another resource.

The egress mode still matters

Direct VPC egress doesn’t remove the actual routing decision.

You still need to choose whether the service should send only private ranges through the VPC or send all traffic through it. That isn’t console trivia. It changes what the service depends on, what paths are private, and where failure or latency can show up.

Why this is a good default for small teams

For SME internal platforms, the default should reduce platform drag.

That’s why this fits naturally with Cloud Run as the default. If the service can live comfortably inside the Cloud Run model, the VPC story should feel like part of that same low-ownership runtime. It shouldn’t turn into a side quest in connector management before the workload has earned that complexity.

Direct VPC egress gets us closer to that shape.

When the default stops being enough

Sometimes the surrounding system stops being simple enough for the simplest shape.

Maybe private networking assumptions are spreading across a lot of services. Maybe service-to-service topology is getting denser. Maybe the network design wants more cluster-shaped constructs, more involved east-west traffic, or a broader container estate where Cloud Run is no longer the whole picture. In that kind of setup, GKE Autopilot often becomes the cleaner fit once the surrounding system stops being mostly Cloud Run-shaped.

The point

We default to Direct VPC egress because it’s the cleaner Cloud Run networking shape.

It removes connector infrastructure from the normal path, keeps costs and security controls closer to the service, and lowers platform drag. If the caveats matter more than the simplicity, we can make a different choice. Until then, the default should stay simple.

More in this domain: Infrastructure

Browse all

How we decide between Cloud SQL connectors, Auth Proxy, and private IP

Cloud SQL connectors, the Auth Proxy, and private IP are not interchangeable secure connection options. They change identity, routing, deployment shape, and how much network plumbing the team actually owns.

Safe scaling defaults for Cloud Run + Postgres

Cloud Run autoscaling is not a database strategy. Safe defaults keep the application from scaling itself into a Postgres incident before the team understands the workload.

IAM DB auth for Cloud SQL: when it simplifies security and when it complicates delivery

IAM DB auth can reduce password sprawl and make revocation cleaner, but it also turns database access into an identity operating model that depends on disciplined service-account boundaries.

Cloud Run request timeouts don't kill your code (so your architecture has to)

A Cloud Run request timeout ends the request, not necessarily the work. If the operation can outlive its caller, the system needs explicit job semantics instead of hope.

Cloud Run scaling from zero is a feature until it isn't

Scale to zero is a good default for request-driven services, until startup delay, warm-capacity needs, or instance caps turn it into user-visible reliability behavior instead of a pricing feature.

Related patterns

"Internal-only" Cloud Run isn't just a checkbox

Making a Cloud Run service private is not one toggle. It is a decision about ingress, routing, caller path, and IAM working together as one access model.

GKE Autopilot as the escape hatch from Cloud Run

When Cloud Run stops fitting, the next move is usually GKE Autopilot: more Kubernetes-shaped control without immediately taking on the full burden of Standard clusters.

Why we default to Cloud Run for SME internal platforms

For SME internal platforms, Cloud Run is our default because it covers a large share of useful workload shapes without forcing teams to own cluster operations before they have earned that surface area.

When repeated Pulumi code earns abstraction and when it doesn't

We don't abstract repeated Pulumi code just because it shows up more than once. We do it when the shared shape is real, the behavior is stable enough to deserve a boundary, and the result is easier to read than the duplication it replaces.