Session 9.2: Advanced Security: Zero Trust

Course → Module 9: Advanced Topics & Emerging Architectures

The End of the Castle

Traditional network security follows the castle-and-moat model. There is a perimeter. Everything inside the perimeter is trusted. Everything outside is not. Firewalls guard the gates. VPNs extend the walls. Once you are inside, you move freely.

This model worked when the perimeter was clear: a corporate office, a data center, a known set of machines. It fails in a world of cloud services, remote workers, mobile devices, third-party APIs, and microservices that communicate across networks you do not control. The perimeter dissolved. The moat dried up. But many organizations kept behaving as though the castle still stood.

Zero trust is the security model built for a world without perimeters.

Zero trust does not mean "trust nothing." It means "verify everything, every time."

Three Core Principles

NIST Special Publication 800-207, published in August 2020, defines zero trust architecture around three principles.

1. Verify explicitly. Every access request must be authenticated and authorized based on all available data: identity, location, device health, service or workload, data classification, and anomalies. There is no "already inside, so probably fine."

2. Use least privilege access. Grant the minimum permissions needed for the task at hand. Use just-in-time and just-enough-access policies. If a service needs to read from a database, it should not have write permissions. If it needs write permissions for five minutes during a migration, revoke them after six.

3. Assume breach. Design as though an attacker is already inside your network. Minimize blast radius. Segment access. Encrypt all traffic, including internal traffic. Verify end-to-end. This assumption changes how you build everything.

Perimeter Security vs. Zero Trust

Dimension	Perimeter Security	Zero Trust
Trust model	Trust internal network implicitly	Trust nothing implicitly, verify every request
Network boundary	Clear perimeter (firewall, VPN)	No perimeter; identity is the new perimeter
Lateral movement	Easy once inside	Restricted; microsegmentation enforced
Authentication	At the gate (login once, roam freely)	Continuous; re-verified per request
Encryption	Perimeter (TLS at the edge)	End-to-end (mTLS between all services)
Access policy	Broad roles, long-lived credentials	Fine-grained, just-in-time, short-lived tokens
Breach response	Detect at the perimeter, contain inside	Detect everywhere, blast radius is already small
Remote workers	VPN required	Same policy regardless of location

Service Mesh and mTLS

In a microservices architecture, services call other services constantly. In a perimeter model, these calls happen over plain HTTP inside the "trusted" network. In zero trust, every service-to-service call must be authenticated and encrypted.

A service mesh handles this transparently. Tools like Istio, Linkerd, and Consul Connect deploy a sidecar proxy alongside each service. The proxy handles mutual TLS (mTLS), meaning both sides of every connection present certificates and verify each other. The application code does not change. The mesh handles identity, encryption, and policy enforcement at the infrastructure layer.

mTLS differs from standard TLS. In standard TLS, only the server presents a certificate. The client verifies the server but the server does not verify the client. In mTLS, both sides authenticate. This prevents a compromised service from impersonating another.

graph TB subgraph "Zero Trust Network with Service Mesh" direction TB GW[API Gateway
Identity Verification] --> PA[Policy Agent
Authorization] subgraph "Service A Pod" A[Service A] --- PA1[Sidecar Proxy] end subgraph "Service B Pod" B[Service B] --- PB1[Sidecar Proxy] end subgraph "Service C Pod" C[Service C] --- PC1[Sidecar Proxy] end PA --> PA1 PA1 -->|mTLS| PB1 PA1 -->|mTLS| PC1 PB1 -->|mTLS| PC1 CP[Control Plane
Certificate Authority
Policy Distribution] -.->|certs & policy| PA1 CP -.->|certs & policy| PB1 CP -.->|certs & policy| PC1 SIEM[SIEM
Log Aggregation
Threat Detection] -.->|telemetry| PA1 SIEM -.->|telemetry| PB1 SIEM -.->|telemetry| PC1 end style GW fill:#c8a882,stroke:#111110,color:#111110 style PA fill:#6b8f71,stroke:#111110,color:#111110 style CP fill:#8a8478,stroke:#111110,color:#ede9e3 style SIEM fill:#c47a5a,stroke:#111110,color:#111110 style A fill:#ede9e3,stroke:#111110,color:#111110 style B fill:#ede9e3,stroke:#111110,color:#111110 style C fill:#ede9e3,stroke:#111110,color:#111110

Secrets Management

Zero trust requires that secrets (API keys, database passwords, TLS certificates) are never hardcoded, never stored in environment variables long-term, and never shared between services. Secrets management tools like HashiCorp Vault, AWS Secrets Manager, and Azure Key Vault provide centralized, audited, and rotatable secret storage.

Vault, for example, can issue short-lived database credentials on demand. A service requests credentials, receives a username and password valid for one hour, and the credentials are automatically revoked when they expire. If the service is compromised, the attacker gets credentials that expire shortly. Compare this with a shared database password that has not been rotated in two years.

Dynamic secrets, automatic rotation, and lease-based access are the mechanisms that make least privilege practical instead of theoretical.

SIEM for Threat Detection

If you assume breach, you need the ability to detect anomalies in real time. Security Information and Event Management (SIEM) systems aggregate logs from every component, correlate events, and detect patterns that indicate compromise.

A SIEM might detect that Service A, which normally makes 100 requests per minute to Service B, suddenly made 10,000 requests. Or that a service account authenticated from an IP address it has never used before. Or that a database query returned 50 million rows when the typical result set is 50.

Tools like Splunk, Elastic Security, Microsoft Sentinel, and AWS Security Hub serve this role. The zero trust principle of "assume breach" only works if you have the observability to detect when a breach actually happens.

Implementing Zero Trust Layer by Layer

Zero trust is not a product you buy. It is a design approach applied at multiple layers.

Network layer: Microsegmentation. Services can only reach the specific services they need. Default deny for all traffic. Network policies in Kubernetes. Security groups in AWS. No flat networks.

Service layer: mTLS between all services. Service identity via certificates (SPIFFE/SPIRE). Authorization policies enforced by the mesh. No service trusts another service just because it is on the same network.

Data layer: Encryption at rest and in transit. Field-level encryption for sensitive data. Access logging on every data store. Row-level security where supported. Data classification and handling policies enforced programmatically.

Assignment

You have a microservices system where all internal services currently trust any request that arrives on the internal network. There is no service-to-service authentication. Database credentials are stored as environment variables and have not been rotated in 18 months.

Redesign the system for zero trust. Address each layer specifically:

Network layer: What changes? What is the default policy? How do you segment?
Service layer: How do services authenticate to each other? What tools do you introduce?
Data layer: How do you manage secrets? What is the credential lifecycle? How do you detect misuse?

Draw a diagram showing the before and after states. Identify the three highest-risk gaps in the current system and explain how your redesign addresses each one.

Advanced Security: Zero Trust