cloud-foundation-fabric/blueprints/networking/hub-and-spoke-peering
Julio Castillo 1a3bb25917 Update provider version (needed for dns logging support). 2022-10-25 12:15:02 +02:00
..
README.md More updates 2 2022-09-09 16:40:37 +02:00
backend.tf.sample Rename examples folder to blueprints 2022-09-09 16:38:43 +02:00
diagram-network.png Rename examples folder to blueprints 2022-09-09 16:38:43 +02:00
diagram.png Rename examples folder to blueprints 2022-09-09 16:38:43 +02:00
main.tf Refactor net-vpc module for Terraform 1.3 (#880) 2022-10-14 11:02:33 +02:00
outputs.tf Rename examples folder to blueprints 2022-09-09 16:38:43 +02:00
variables.tf Rename examples folder to blueprints 2022-09-09 16:38:43 +02:00
versions.tf Update provider version (needed for dns logging support). 2022-10-25 12:15:02 +02:00

README.md

Hub and Spoke via VPC Peering

This blueprint creates a simple Hub and Spoke setup, where the VPC network connects satellite locations (spokes) through a single intermediary location (hub) via VPC Peering.

The blueprint shows some of the limitations that need to be taken into account when using VPC Peering, mostly due to the lack of transitivity between peerings:

  • no mesh networking between the spokes
  • complex support for managed services hosted in tenant VPCs connected via peering (Cloud SQL, GKE, etc.)

One possible solution to the managed service limitation above is presented here, using a static VPN to establish connectivity to the GKE masters in the tenant project (courtesy of @drebes). Other solutions typically involve the use of proxies, as described in this GKE article.

One other topic that needs to be considered when using peering is the limit of 25 peerings in each peering group, which constrains the scalability of design like the one presented here.

The blueprint has been purposefully kept simple to show how to use and wire the VPC modules together, and so that it can be used as a basis for more complex scenarios. This is the high level diagram:

High-level diagram

Managed resources and services

This sample creates several distinct groups of resources:

  • one VPC each for hub and each spoke
  • one set of firewall rules for each VPC
  • one Cloud NAT configuration for each spoke
  • one test instance for each spoke
  • one GKE cluster with a single nodepool in spoke 2
  • one service account for the GCE instances
  • one service account for the GKE nodes
  • one static VPN gateway in hub and spoke 2 with a single tunnel each

Testing GKE access from spoke 1

As mentioned above, a VPN tunnel is used as a workaround to avoid the peering transitivity issue that would prevent any VPC other than spoke 2 to connect to the GKE master. This diagram illustrates the solution

Network-level diagram

To test cluster access, first log on to the spoke 2 instance and confirm cluster and IAM roles are set up correctly:

gcloud container clusters get-credentials cluster-1 --zone europe-west1-b
kubectl get all

The blueprint configures the peering with the GKE master VPC to export routes for you, so that VPN routes are passed through the peering. You can disable by hand in the console or by editing the peering_config variable in the gke-cluster module, to test non-working configurations or switch to using the GKE proxy.

Change the GKE cluster module and add a new variable after private_cluster_config:

  peering_config = {
    export_routes = true
    import_routes = false
  }

If you added the variable after applying, simply apply Terraform again.

Export routes via gcloud (alternative)

If you prefer to use gcloud to export routes on the peering, first identify the peering (it has a name like gke-xxxxxxxxxxxxxxxxxxxx-xxxx-xxxx-peer) in the Cloud Console from the VPC network peering page, or using gcloud, then configure it to export routes:

gcloud compute networks peerings list
# find the gke-xxxxxxxxxxxxxxxxxxxx-xxxx-xxxx-peer in the spoke-2 network
gcloud compute networks peerings update [peering name from above] \
  --network spoke-2 --export-custom-routes

Test routes

Then connect via SSH to the spoke 1 instance and run the same commands you ran on the spoke 2 instance above, you should be able to run kubectl commands against the cluster. To test the default situation with no supporting VPN, just comment out the two VPN modules in main.tf and run terraform apply to bring down the VPN gateways and tunnels. GKE should only become accessible from spoke 2.

Operational considerations

A single pre-existing project is used in this blueprint to keep variables and complexity to a minimum, in a real world scenario each spoke would use a separate project (and Shared VPC).

A few APIs need to be enabled in the project, if apply fails due to a service not being enabled just click on the link in the error message to enable it for the project, then resume apply.

The VPN used to connect the GKE masters VPC does not account for HA, upgrading to use HA VPN is reasonably simple by using the relevant module.

Variables

name description type required default
project_id Project id used for all resources. string
ip_ranges IP CIDR ranges. map(string) {…}
ip_secondary_ranges Secondary IP CIDR ranges. map(string) {…}
prefix Arbitrary string used to prefix resource names. string null
private_service_ranges Private service IP CIDR ranges. map(string) {…}
project_create Set to non null if project needs to be created. object({…}) null
region VPC region. string "europe-west1"

Outputs

name description sensitive
project Project id.
vms GCE VMs.