The First Practical Malicious-Model-File RCE Is Here, and It's a Jinja2 Template

For two years, the AI security community has been warning that model files, especially the community-hosted kind you pull from Hugging Face with a single git-lfs command, are an untrusted distribution channel that deserves the same scrutiny as container images or npm packages. The argument has always been theoretical. Sure, a model file is just bytes, someone could stuff something bad into it, but what is the actual exploitation primitive? Nobody had the answer at a practical, CVSS-9.8 severity.

This week, they do. CVE-2026-5760 is a command injection flaw in SGLang, the Berkeley/LMSys inference server that a lot of production AI workloads run on. The exploit chain is so short that the mitigation is almost less interesting than the attack surface it reveals.

How the Exploit Works

SGLang, like most modern inference frameworks, reads a chat_template field out of the GGUF model metadata. The chat_template is a Jinja2 template that tells the inference server how to format conversational turns into the token sequence the model was trained on. It is a convenient abstraction. It is also a Turing-complete rendering layer that runs on your infrastructure.

The vulnerable code path renders the chat_template through a plain jinja2.Environment(). Jinja2 ships an ImmutableSandboxedEnvironment specifically for cases where the template is attacker-controlled, because Jinja2 templates can, by design, access Python built-ins if you let them. SGLang did not use the sandboxed environment. The chat_template, which came from an untrusted model file that was downloaded from a public model registry, was treated as trusted template code.

The trigger is a request to the /v1/rerank endpoint using a Qwen3 reranker phrase in the prompt. When the endpoint processes the request, it renders the chat_template. The attacker's Jinja2 payload executes arbitrary Python in the SGLang process, which in most deployments is running as a service account with enough privileges to reach the GPU, the model cache, and the surrounding container's network interfaces. From there, the exploit looks like any other container escape or credential harvest that has been in the IR playbook for a decade.

Why the Distribution Channel Is the Real Story

Before this CVE, a skeptic could reasonably ask: even if a model file could in principle carry an exploit, what is the realistic vector? The answer used to be "someone would have to upload a malicious model to Hugging Face and get a victim to download it." That answer was not taken seriously because the vector looks like a watering-hole attack, and watering-hole attacks against random downloaders looked low-yield.

It is not a low-yield vector anymore. Here is what has changed in the past two years.

Community model consumption has industrialized. Most teams running inference in production are pulling at least some of their models from Hugging Face, not just from vendors' official weights. Fine-tuned variants, quantized GGUF conversions, LoRA adapters, reranker heads, embedding models, the long tail is enormous. A lot of these are published by individuals or small teams whose security posture is indistinguishable from a random npm publisher.

Inference server ingestion is automated. A model pull is typically a CI/CD step or a scheduled refresh, not a manual download that a human reviews. If the pull pipeline is not pinning to content hashes, the community publisher can update the model under a static tag, and the inference server will silently reload.

The rendering surface is larger than the weights. GGUF metadata, tokenizer configs, chat_templates, preprocessor configs, and tool definitions all travel with the model file. Most of these are executed or rendered by the runtime. Any one of them is an injection surface.

So the theoretical "malicious model upload" vector is now operationally "a maintainer of a popular fine-tune gets compromised, pushes an updated GGUF with a crafted chat_template, and thousands of inference servers worldwide run attacker-controlled Python within the next pull cycle." The Axios npm worm from last month shows that the underlying compromise mechanics, attacker gets a maintainer token, pushes malicious update, downstream consumers execute on pull, are routine. The AI supply chain just got its equivalent vector.

What to Actually Do This Week

Three things, in order of urgency.

One, patch SGLang to the fixed release. Check the SGLang advisory for the specific patched version. If you cannot patch immediately, take the /v1/rerank endpoint off the public interface and gate it behind internal auth, because the Qwen3 reranker phrase trigger is public.

Two, scan every chat_template in every model you currently have loaded for Jinja2 SSTI patterns. The signatures are well-known: any reference to __class__, __mro__, __subclasses__, __import__, __builtins__, or other dunder traversal inside a template string. A regex scanner will catch the low-effort exploits. For anything more sophisticated, render the template in a truly sandboxed environment offline and compare output to expected. This check costs you an afternoon of work and gives you a clean inventory of which models you can currently trust.

Three, pin your model supply chain. Every production inference workload should pull models by content hash, not by tag. The Hugging Face CLI supports this. Your CI pipeline should fail loudly if a pinned hash no longer resolves, because that means the maintainer force-pushed the tag, which means you are looking at either a compromise or a version bump that needs human review. Move to an internal model mirror for anything load-bearing. The same discipline you apply to container images applies to models, and the industry is about twelve months late to getting there.

The Durable Lesson

The 2023-2025 AI security conversation was dominated by prompt injection and model behavior, which are real problems but are the ones you get after the model is running. The 2026 conversation is going to be dominated by the question of whether the model you loaded is the one you think you loaded, and whether the substrate around it, tokenizers, templates, preprocessing, tool definitions, code-execution sandboxes, is safe to evaluate. This CVE is the first public demonstration that the substrate is an attack surface with practical exploitation and a mature distribution channel waiting to carry the exploit.

If your AI platform strategy assumes models are data, you are behind. They are executable data. Treat them accordingly.

Gigia Tsiklauri is a Security Architect and founder of Infosec.ge, cybersecurity intelligence from Georgia.

Supply ChainCVEVulnerability ResearchChina

OceanLotus pivots inward: supply chain espionage targeting Vietnam's retail investors

ESET documented how OceanLotus (APT32) compromised the FireAnt MetaKit stock trading platform to deliver the SPECTRALVIPER backdoor to Vietnamese retail investors, marking a shift from external to domestic espionage targets.

June 12, 2026

CVEvulnerabilityAI SecurityLLM

Langflow CVE-2026-5027: When your AI developer stack becomes the attack surface

Approximately 7,000 internet-exposed Langflow instances are being actively exploited via a path traversal flaw that enables unauthenticated remote code execution. Patch 1.9.0 has been available since April. Here is what you need to know.

June 12, 2026

CVEendpoint-securityVulnerability Research

SHub Reaper and the AppleScript bypass: How macOS malware adapted around Apple's Terminal fix

SentinelOne identified a new macOS infostealer variant that sidesteps Apple's March 2026 Terminal protections by switching to the AppleScript URL scheme. Crypto wallet users and anyone running macOS Tahoe are the primary targets.

June 12, 2026

How the Exploit Works

Why the Distribution Channel Is the Real Story

What to Actually Do This Week

The Durable Lesson

Related articles

OceanLotus pivots inward: supply chain espionage targeting Vietnam's retail investors

Langflow CVE-2026-5027: When your AI developer stack becomes the attack surface

SHub Reaper and the AppleScript bypass: How macOS malware adapted around Apple's Terminal fix