Navigating DeepSeek R1, Security Concerns, and Guardrails

DeepSeek R1, a new model from Chinese startup DeepSeek, rivals top reasoning models and is open source, with a distilled version for self-hosting. In this blog, we explore how enterprises can maintain tight control and audit trails when interacting with DeepSeek to protect sensitive data.

DeepSeek R1 is a new model from Chinese startup DeepSeek released on January 20th with impressive reasoning ability which rivals (or beats) the existing leading reasoning models (ie, o1 from OpenAI). It is available from a few different providers but from my perspective the most interesting thing is that it's open source and is available in distilled form with similar performance (ie, distilled to Qwen or Llama models, which can be self-hosted).

One question we see (from our users as well as the broader public) is "is DeepSeek safe?". Sure, DeepSeek is hosted by a Chinese company, so most organizations (and the country) may have some questions about that, but the real concern is about your data, what happens to it, what should be sent or not sent. And in that case, it doesn't matter if it's DeepSeek or any other public LLM provider. As an enterprise organization, you have to be careful what you do with your sensitive data. You need tight control and audit trails for your data or risk significant harm to your company with slip ups.

This blog covers some ways to enable tighter control and audit for your interactions with DeepSeek.

Establishing Control

Application developers building apps that leverage LLMs are the first line of defense when establishing control over LLM usage. These developers must be responsible in what data they send or receive in terms of detecting, scrubbing, or outright limiting it. They can adopt content safety or guardrail systems to help with that. They must also be responsible in securely connecting to any LLMs they use. This includes keeping client id, secrets, authorization tokens safe. Code reviews and responsible coding can help here, but this is just one line of defense.

Whenever you consider security and compliance, you must think in terms of "defense in depth". The organizations we work with at Solo.io require having more options to control these LLM interactions.

Establishing an intermediary which acts as an out-of-application way to observe, audit, and control these interactions is crucial. This can be done with a piece of middleware like an API gateway or a more dedicated AI gateway. These AI gateways can implement security, WAF (IP-based controls), and further apply policy around what LLMs can be used, securing their connections, and most importantly, placing a second line of defense for guardrails. These guardrails can inspect the request and do analysis to determine whether or not they should be allowed (or conversely, inspect a response from an LLM and determine whether it should be allowed).

This intermediary allows an organization to implement a "kill-switch" type architecture should a vulnerability be discovered. That is, it would be easier to apply new guardrails quickly at this intermediary than it would be to go to all respective application teams and ask them to update their code.

In the case of DeepSeek R1, we also have the option to take the open source model (or one of it's distilled variants) and run it ourselves, that is, in our datacenters/VPC, and under our control. We will still want guardrails in place, but running it ourselves means the data being sent to/from will be under our control. We see some of our larger enterprise customers doing this by building out their own GPU infrastructure to run these types of models.

Putting this into Practice

We can use AI Gateway like Gloo Gateway to help implement these intermediary controls for security, prompt guarding, and routing/failover to either the public DeepSeek R1 or to our own self-hosted R1 model.

To demonstrate this, we will deploy the AI gateway on GKE along with a locally hosted deepseek-r1:7b model. We will use the NVIDIA GPU operator to configure the GPUs and containerd so that GPU enabled workloads can take advantage of the accelerators.

We will see how we can use powerful security and guardrail capabilities from Gloo AI Gateway to implement control and observability when calling Deepseek models whether they are publicly hosted or locally/self hosted.

Check out this 5 minute demo to see the following details (source code available):

Running a self hosted deepseek-r1 7B model on GKE with NVIDIA L4 GPUs and NVIDIA GPU Operator
Running a mature, powerful OSS AI gateway capable of applying guardrails as an intermediary
Securing traffic to Deepseek using your own security mechanisms instead of provider API keys
Routing/splitting traffic to local Deepseek instead of public one without clients knowing

Conclusion

Deepseek is a powerful model, but just like with any model hosted by a provider, you should be very wary of what data and information you're sending to it. Deepseek may be safe, but who knows? Who knows about any LLM model provider? You should implement strict security, observability, and guardrail systems before you use any LLM provider. An AI Gateway like Gloo Gateway helps implement a safe "kill-switch" style architecture which gives you defense in depth.

Navigating DeepSeek R1, Security Concerns, and Guardrails

Establishing Control

Putting this into Practice

Conclusion

Featured content

Overhaul of Agent Gateway supporting A2A, MCP, and Kubernetes Gateway API

How Ambient Mesh Delivers Advanced Resource and Cost Savings

Getting Started with Ambient Mesh: From 0 to 100 mph

Agent Discovery, Naming, and Resolution - the Missing Pieces to A2A

Part Two: MCP Authorization The Hard Way

Part One: MCP Authorization The Hard Way

Agent Identity and Access Management - Can SPIFFE Work?

Deep Dive into llm-d and Distributed Inference

Gloo Mesh 2.8 simplifies service mesh operations with new enhanced user experience across multi-cluster environments.

Gloo Gateway 1.19 accelerates context-rich, real-time AI apps with Gateway API

llm-d: Distributed Inference Serving on Kubernetes

AI Reliability Engineering For More Dependable Humans

Kubernetes Identity the Right Way with SPIRE and Ambient

Optimizing GenAI in Production: High-Value Use Cases for AI Gateways

Solo.io Recognized as a Visionary in the 2024 Gartner® Magic Quadrant™ for API Management for the SECOND year in a row.

Guardians of the Governance: GenAI Gateway Guidance with GitOps and Gloo

Istio Ambient Waypoint Proxy explained

Hands-On with the Kubernetes Gateway API and Envoy Proxy: A Tutorial with GitOps and Gloo Gateway

Istio and the State of DevOps: Enhancing Key Metrics

What is an AI Gateway and its role in AI Applications?

Best practices for secure Istio deployment with Gloo Mesh Core

Gloo Mesh 2.6: Istio's Ambient mode now ready for production

HTTP Observability Without Compromises

Advance your knowledge of service mesh tech with Solo.io Academy certifications

Service Mesh for the developer workflow, a series

Challenges of adopting service mesh in enterprise organizations

Service Mesh in the Real World #2 — Ingress Traffic Control

Service Mesh in the Real World Video Series – Episode # 1: Egress Traffic

Service Mesh the easy way with AWS App Mesh and SuperGloo

Webinar Recap: Intro to Service Mesh Hub and SMI

D-TECK Uses Solo.io Gloo Gateway and Google Cloud to Help Businesses Make Better HR Decisions

Minimize the blast radius of changes with Solo.io Gloo Gateway and Weaveworks Flagger

Announcing Service Mesh Interface (SMI) Support and Collaboration

Service Mesh Interface (SMI) and our Vision for the Community and Ecosystem

The need for a standard, service mesh API

SuperGloo to the Rescue! Making it easier to write extensions for Service Mesh

Introducing The Service Mesh Hub -everything you need for your service mesh

Kubernetes Ingress Past, Present, and Future

Solo.io Streamlines Service Mesh and Serverless Adoption for Enterprises in Google Cloud

Ingenico

ParkMobile

Vonage

Domino’s Pizza

Gloo Mesh Feature Comparison

Service Mesh for Developers, Part 1: Exploring the Power of Observability and OpenTelemetry

Service Mesh at Scale

Compare Capabilities of the Top Service Mesh Platforms

Compare Capabilities of the Top API Gateways

Establishing zero trust security for modern cloud architectures

Unlocking the Power of Your API Gateway

API Gateways: Productivity, Resilience, and Security for Next-Generation Cloud Applications

Driving Business Value with Istio

Service Mesh Vendor Comparison

Istio Then & Now

4 Reasons Why You Need an AI Gateway

Gloo Gateway vs. Kong

Gloo Gateway vs. Apigee

3 Reasons You Need an API Gateway for Microservices Apps

Ambient Mesh Lab: SPIRE integration with Gloo Mesh in Istio Ambient Mode

Ambient Mesh Lab: Introduction to ztunnel in Ambient Mesh

Solo Academy Course: Service Mesh Basics

Solo Academy Course: Istio Basics

Solo Academy Course: Envoy Basics

Solo Academy Course: API Gateway Basics

Solo Academy Course: Get Started with Istio Service Mesh

Solo Academy Course: Introduction to Envoy Proxy

Solo Academy Course: Deploying Istio for Production

Kgateway Lab: Integrating kgateway with Istio at Ingress

Kgateway Lab: Kgateway as a Waypoint

Kgateway AI Lab: Consumption Reporting

Kgateway AI Lab: Deploying kgateway as an AI Gateway

Kagent Lab: How to build an AI agent

Kagent Lab: Integrate tools from MCP servers with kagent

Gloo AI Gateway Hands-On Lab: Semantic Caching

Kgateway AI Lab: Credentials Management