Best Practices for Secure Istio Deployment with Gloo Mesh Core

Best practices for secure Istio deployment with Gloo Mesh Core

The recent vulnerabilities discovered by Wiz in a popular cloud service highlight critical misconceptions about container security and the role of service meshes like Istio. In this case, a workflow service allowed arbitrary user code to be run, and Istio was implicated as part of the security bypass with which the researchers were able to gain network access. Upon closer examination, the fundamental problem wasn’t with Istio itself, but a combination of an assumption that Istio sidecars act as as an egress firewall and the fact that the environment wasn’t secured in such a way that Istio couldn’t be impersonated. The researchers were able to:

Run arbitrary Kubernetes pod and modify the pod spec (this is the functionality provided by the SAP AI service)
Configure pods with specific UID, where one of them, 1337, wasn’t blocked by the Kubernetes admission controller
Gain unrestricted network access.

It’s crucial to understand that impersonating a UID and changing process namespace is not an Istio-specific issue.

Istio, like other service meshes, is not designed to be a complete security solution, particularly not as a client-side egress firewall. Istio’s documentation explicitly states that it doesn’t claim to secure pod egress and that network policies should be used instead.

Istio offers security for pod ingress using mTLS and authorization policies, and general egress management with a combination of network policies and an egress gateway.

The importance of defense-in-depth

This example illustrates the critical importance of a defense-in-depth approach in cloud-native environments. This strategy, which involves implementing multiple layers of security controls, is essential for protecting against sophisticated attacks and mitigating the impact of any single point of failure.

Here are a couple of areas where additional layers of security could have prevented or mitigated the attack:

Network Layer: While Istio provides traffic management and mTLS between services, additional network-level isolation between tenants was lacking. Kubernetes Network Policies or similar were needed for this layer of protection.
Pod Security: The ability to run pods with arbitrary UIDs represents a significant security gap. A proper isolation level for user-generated pods could have prevented this.
Authentication and Authorization: Once network restrictions were bypassed, internal services appeared to lack robust authentication mechanisms, highlighting the danger of an overly trusted internal network.
Continuous Monitoring and Alerting: The absence of real-time monitoring for security anomalies allowed the attack to progress undetected.

This multi-layered approach is about more than having multiple security tools. It’s about creating a cohesive security strategy where each layer complements and reinforces the others. When one layer is compromised, the others should still provide protection.

Even with the best security tools in place, the user is responsible for fully understanding the tool’s capabilities and using it correctly. It’s like purchasing the world’s strongest door; it offers no protection if it’s installed backward. A 2023 report by Thales claims that 55% of Kubernetes-related security risks stem from misconfigurations. This statistic underscores the critical importance of proper setup and configuration in maintaining a secure environment.

Gloo Mesh Core insights

An essential component of defense-in-depth is the ability to detect and respond swiftly to potential security issues, including those arising from misconfigurations. This is where Gloo Mesh Core’s security insights and observability features become invaluable. Gloo Mesh Core seamlessly integrates with your existing Istio environments, providing comprehensive insights into service mesh performance, best practices for configuration, and detailed observability and security analysis.

The invalid application UID insight is designed to detect scenarios where pods run with unexpected or potentially dangerous UIDs. This is exactly the type of issue that was exploited in the above case. The insights engine inspects your configuration and live telemetry data from its OTel pipeline to validate that your environment aligns with the best practices Solo.io has built over years of helping customers deliver production-grade platforms at scale.

Gloo Mesh Core leverages OpenTelemetry to collect telemetry data from various sources across all your clusters. With OpenTelemetry, you can establish pipelines for these diverse sources, consolidating all your telemetry data in a single location. The pipelines make it easy to integrate the Gloo Mesh Core insights engine with any OpenTelemetry compatible backend, including OSS tools like Prometheus, or observability vendors. The Gloo UI presents these observability details in a unified, single pane of glass. Utilizing this comprehensive service graph is another key tool for identifying and addressing any security issues.

Conclusion

By leveraging Gloo Mesh Core’s insights and following these best practices, organizations can build more secure, resilient, and observable service mesh deployments. This proactive approach not only helps prevent vulnerabilities like the one discovered by Wiz, but also provides the visibility and control needed to manage complex, cloud-native environments securely.

Security in modern cloud-native architectures is an ongoing process. Regular audits, continuous monitoring, and staying updated with the latest security practices are key to maintaining a robust security posture in your Istio-enabled environments.

Best practices for secure Istio deployment with Gloo Mesh Core

The importance of defense-in-depth

Gloo Mesh Core insights

Conclusion

Featured content

Part Two: MCP Authorization The Hard Way

Part One: MCP Authorization The Hard Way

Agent Identity and Access Management - Can SPIFFE Work?

Deep Dive into llm-d and Distributed Inference

Gloo Mesh 2.8 simplifies service mesh operations with new enhanced user experience across multi-cluster environments.

Gloo Gateway 1.19 accelerates context-rich, real-time AI apps with Gateway API

llm-d: Distributed Inference Serving on Kubernetes

AI Reliability Engineering For More Dependable Humans

Kubernetes Identity the Right Way with SPIRE and Ambient

Optimizing GenAI in Production: High-Value Use Cases for AI Gateways

Solo.io Recognized as a Visionary in the 2024 Gartner® Magic Quadrant™ for API Management for the SECOND year in a row.

Guardians of the Governance: GenAI Gateway Guidance with GitOps and Gloo

Istio Ambient Waypoint Proxy explained

Hands-On with the Kubernetes Gateway API and Envoy Proxy: A Tutorial with GitOps and Gloo Gateway

Istio and the State of DevOps: Enhancing Key Metrics

What is an AI Gateway and its role in AI Applications?

Best practices for secure Istio deployment with Gloo Mesh Core

Gloo Mesh 2.6: Istio's Ambient mode now ready for production

HTTP Observability Without Compromises

Advance your knowledge of service mesh tech with Solo.io Academy certifications

Service Mesh for the developer workflow, a series

Challenges of adopting service mesh in enterprise organizations

Service Mesh in the Real World #2 — Ingress Traffic Control

Service Mesh in the Real World Video Series – Episode # 1: Egress Traffic

Service Mesh the easy way with AWS App Mesh and SuperGloo

Webinar Recap: Intro to Service Mesh Hub and SMI

D-TECK Uses Solo.io Gloo Gateway and Google Cloud to Help Businesses Make Better HR Decisions

Minimize the blast radius of changes with Solo.io Gloo Gateway and Weaveworks Flagger

Announcing Service Mesh Interface (SMI) Support and Collaboration

Service Mesh Interface (SMI) and our Vision for the Community and Ecosystem

The need for a standard, service mesh API

SuperGloo to the Rescue! Making it easier to write extensions for Service Mesh

Introducing The Service Mesh Hub -everything you need for your service mesh

Kubernetes Ingress Past, Present, and Future

Solo.io Streamlines Service Mesh and Serverless Adoption for Enterprises in Google Cloud

ParkMobile

Vonage

Domino’s Pizza

Gloo Mesh Feature Comparison

Service Mesh for Developers, Part 1: Exploring the Power of Observability and OpenTelemetry

Service Mesh at Scale

Compare Capabilities of the Top Service Mesh Platforms

Compare Capabilities of the Top API Gateways

Establishing zero trust security for modern cloud architectures

Unlocking the Power of Your API Gateway

API Gateways: Productivity, Resilience, and Security for Next-Generation Cloud Applications

Driving Business Value with Istio

Service Mesh Vendor Comparison

Istio Then & Now

4 Reasons Why You Need an AI Gateway

Gloo Gateway vs. Kong

Gloo Gateway vs. Apigee

3 Reasons You Need an API Gateway for Microservices Apps

Solo Academy Course: Service Mesh Basics

Solo Academy Course: Istio Basics

Solo Academy Course: Envoy Basics

Solo Academy Course: API Gateway Basics

Solo Academy Course: Get Started with Istio Service Mesh

Solo Academy Course: Introduction to Envoy Proxy

Solo Academy Course: Deploying Istio for Production

Kgateway Lab: Integrating kgateway with Istio at Ingress

Kgateway Lab: Kgateway as a Waypoint

Kgateway AI Lab: Consumption Reporting

Kgateway AI Lab: Deploying kgateway as an AI Gateway

Kagent Lab: How to build an AI agent

Kagent Lab: Integrate tools from MCP servers with kagent

Gloo AI Gateway Hands-On Lab: Semantic Caching

Kgateway AI Lab: Credentials Management

Kgateway AI Lab: Prompt Enrichment

Kgateway AI Lab: Prompt Guards

Ambient Mesh Lab: Migrating from Sidecar to Sidecarless

Ambient Mesh Lab: Multi-cluster scalability with Istio Ambient Mesh

Solo Lab: Gloo Cloud Preview

Ambient Mesh Lab: Waypoints for Traffic management, Security and Observability

Kgateway Lab: Gateway API inference extensions with kgateway