ollama - 💡(How to fix) Fix GGUF/GGML parser runs on untrusted model files — native parsing of attacker-controlled binary data

StepCodex · 2026-05-25T17:14:04Z

[ollama] Model files downloaded from registries are parsed by GGUF/GGML native parsers ggml.Decode , gguf.Open . These parsers handle binary file formats that… Model files downloaded from registries are parsed by GGUF/GGML native parsers (`ggml.Decode`, `gguf.Open`). These parsers handle binary file formats that include: - Variable-length arrays - Nested tensor metadata - Custom key-value stores - Raw tensor data offsets If a malicious model file contains crafted binary data (e.g., integer overflows in array sizes, buffer over-reads in string parsing, invalid offsets), the parser could crash or exhibit undefined behavior. While this is a client-side risk (the user chose to pull the model), the default registry (`ollama.com`) and the ability to configure custom registries means users may trust models from unknown sources. This is partially mitigated by SHA256 digest verification, which ensures the file hasn't been tampered with in transit — but does NOT protect against a malicious publisher who created the model with a valid digest. ## CWE-120: GGUF/GGML Parser Runs on Untrusted Model Files — Native Code Attack Surface ### Severity: LOW-MEDIUM (CVSS 4.0) ### Location `llm/server.go`: ```go func LoadModel(model string, maxArraySize int) (*ggml.GGML, error) { f, _ := os.Open(model) ggml, _ := ggml.Decode(f, maxArraySize) return ggml, err } ``` `server/model.go`: ```go func parseFromModel(ctx context.Context, name model.Name, ...) { blob, _ := os.Open(blobpath) f, _ := ggml.Decode(blob, -1) } ``` `server/images.go`: ```go f, _ := gguf.Open(m.ModelPath) ``` ### Description Model files downloaded from registries are parsed by GGUF/GGML native parsers (`ggml.Decode`, `gguf.Open`). These parsers handle binary file formats that include: - Variable-length arrays - Nested tensor metadata - Custom key-value stores - Raw tensor data offsets If a malicious model file contains crafted binary data (e.g., integer overflows in array sizes, buffer over-reads in string parsing, invalid offsets), the parser could crash or exhibit undefined behavior. While this is a client-side risk (the user chose to pull the model), the default registry (`ollama.com`) and the ability to configure custom registries means users may trust models from unknown sources. This is partially mitigated by SHA256 digest verification, which ensures the file hasn't been tampered with in transit — but does NOT protect against a malicious publisher who created the model with a valid digest. ### Impact A carefully crafted GGUF file from an untrusted registry could crash the Ollama server, or in a worst-case scenario (parser memory corruption), potentially achieve code execution in the Ollama process. ### Remediation 1. Run GGUF/GGML parsing in a sandboxed subprocess 2. Add fuzz testing harness for the GGUF/GGML parser (AFL++ or Go fuzzing) 3. Validate critical fields before allocation (array sizes, string lengths, offsets) 4. Document the risk: "Only pull models from trusted publishers"

ollama2026-05-25 17:14:04

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

Model files downloaded from registries are parsed by GGUF/GGML native parsers (ggml.Decode, gguf.Open). These parsers handle binary file formats that include:

Variable-length arrays
Nested tensor metadata
Custom key-value stores
Raw tensor data offsets

If a malicious model file contains crafted binary data (e.g., integer overflows in array sizes, buffer over-reads in string parsing, invalid offsets), the parser could crash or exhibit undefined behavior. While this is a client-side risk (the user chose to pull the model), the default registry (ollama.com) and the ability to configure custom registries means users may trust models from unknown sources.

This is partially mitigated by SHA256 digest verification, which ensures the file hasn't been tampered with in transit — but does NOT protect against a malicious publisher who created the model with a valid digest.

Error Message

func LoadModel(model string, maxArraySize int) (*ggml.GGML, error) {

Root Cause

Model files downloaded from registries are parsed by GGUF/GGML native parsers (ggml.Decode, gguf.Open). These parsers handle binary file formats that include:

Variable-length arrays
Nested tensor metadata
Custom key-value stores
Raw tensor data offsets

Code Example

func LoadModel(model string, maxArraySize int) (*ggml.GGML, error) {
    f, _ := os.Open(model)
    ggml, _ := ggml.Decode(f, maxArraySize)
    return ggml, err
}

---

func parseFromModel(ctx context.Context, name model.Name, ...) {
    blob, _ := os.Open(blobpath)
    f, _ := ggml.Decode(blob, -1)
}

---

f, _ := gguf.Open(m.ModelPath)

RAW_BUFFERClick to expand / collapse

CWE-120: GGUF/GGML Parser Runs on Untrusted Model Files — Native Code Attack Surface

Severity: LOW-MEDIUM (CVSS 4.0)

Location

llm/server.go:

func LoadModel(model string, maxArraySize int) (*ggml.GGML, error) {
    f, _ := os.Open(model)
    ggml, _ := ggml.Decode(f, maxArraySize)
    return ggml, err
}

server/model.go:

func parseFromModel(ctx context.Context, name model.Name, ...) {
    blob, _ := os.Open(blobpath)
    f, _ := ggml.Decode(blob, -1)
}

server/images.go:

f, _ := gguf.Open(m.ModelPath)

Description

Model files downloaded from registries are parsed by GGUF/GGML native parsers (ggml.Decode, gguf.Open). These parsers handle binary file formats that include:

Variable-length arrays
Nested tensor metadata
Custom key-value stores
Raw tensor data offsets

Impact

A carefully crafted GGUF file from an untrusted registry could crash the Ollama server, or in a worst-case scenario (parser memory corruption), potentially achieve code execution in the Ollama process.

Remediation

Run GGUF/GGML parsing in a sandboxed subprocess
Add fuzz testing harness for the GGUF/GGML parser (AFL++ or Go fuzzing)
Validate critical fields before allocation (array sizes, string lengths, offsets)
Document the risk: "Only pull models from trusted publishers"

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

ollama - 💡(How to fix) Fix GGUF/GGML parser runs on untrusted model files — native parsing of attacker-controlled binary data

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Code Example

CWE-120: GGUF/GGML Parser Runs on Untrusted Model Files — Native Code Attack Surface

Severity: LOW-MEDIUM (CVSS 4.0)

Location

Description

Impact

Remediation

Still need to ship something?

TRENDING