Memo: Learn GCP Global infrastructure

Supanut Laddayam
3 min readApr 22, 2024

--

Global infrastructure = the global presence of data centers, network and cloud resources.

Region

region — is independent geographic areas that are collection of zones.

  • at GCP has 25 regions
  • when want to launch a cloud resources (eg. VM), so it need to choose the region.

Zone

zone — is physical location that made up of 1 or more datacenter

Region & zone relation: 1 Region will container 3+ zones.

Datacenter in zone will be isolated from each other but they will be close enough to provide low-latency.

  • Practice to run workload in at least 3 zones to ensure service remain available (High Availability)
  • Once you choose Region, you will proceed to choose your Zone when launching cloud resource.

Edge Network

edge network — is a practice to have compute and resource as closest as possible to the end user in order to deliver the lowest latency and save bandwidth.

Point of Presence (PoP)

PoP — is a location between GCP region and end user. -> The location could be a 3rd party datacenter or collection of hardware

PoP 3 types:

  1. Edge PoP — is a location where user can quickly enter (ingress) to GCP network for accelerated access to cloud resource.
  2. CDN PoP — is a location where serve (egress) cache resource (website, file or assets) for load very fast to end user.
  3. Cloud Media Edge — is a location specialized for delivery of media (Video content)

Resource Scoping

One zone is a deployment area for Gcloud resources with in a region. So only one zone should be considered a single failure in region. To deploy redundant resource is multiple zones (multi-zones) that help fault tolerance and high availability.

Product / service can scoped based on:

  • Zonal resource — resource resides in a single zone in single region.
  • Regional resource — resource resides in multiple zone in single region.
  • Multiple regional resource — resource resides across multiple specific zone.
  • Global service — resource resides globally and regions and zone.
  • Internal service — resource used by many other service, but you don’t interact with these services directly.

Cloud Interconnect

cloud interconnect — is Gcloud service to provide direct physical connection between on-premise network and GCloud network. Its allow you to transfer large amounts of data between network which more cost-effective than purchasing bandwidth over public internet.

2 types of Cloud interconnect:

  1. Dedicated — a physical direct connection on-premise network and Gcloud network through a co-location facility -> can transfer 10–200 Gbps
  2. Partner — a physical direct connection on-premise network and Gcloud network through a trusted 3rd party -> can transfer 50 Mbps to 10 Gbps
  • Co-location — aka Carrier-hotel is a datacenter are available for rental for retail customer.

Latency

Latency — is time delay between two physical systems.

Lag — is time noticeable delay between the action of input and the reaction of the server send back to the client.

  • Inter-regional latency — latency between region
  • Inter-zonal latency — latency betwwen zone in signle region

*Latency between region is alway more that zone

--

--

No responses yet