ollama - 💡(How to fix) Fix Bug: qwen3.5:397b-cloud hangs on /v1/chat/completions endpoint [1 comments, 2 participants]

ollama2026-03-07 23:51:40

ON THIS PAGE

Recommended Tools

×6

Utilities matched from this issue’s tags and category — try them while you read without losing context.

GitHub issue graph ai analysis

Paste a GitHub issue URL. We fetch that issue, discover linked issues from bodies/comments/timeline, collect linked pull requests, and produce a structured English report.

The report is written in English Markdown for sharing and archival.

GitHub issue URL

Helpful · Quick feedback

GitHub stats

ollama/ollama#14698•Fetched 2026-04-08 00:32:49

View on GitHub

Comments

Participants

Timeline

Reactions

Author

oggixx

Participants

oggixx

rick-github

Timeline (top)

closed ×1commented ×1labeled ×1

The model qwen3.5:397b-cloud hangs on the /v1/chat/completions endpoint while /api/generate works fine.

Error Message

Request hangs/times out with 500 Internal Server Error

/v1/chat/completions hangs/times out (500 Internal Server Error)

Root Cause

The model qwen3.5:397b-cloud hangs on the /v1/chat/completions endpoint while /api/generate works fine.

Code Example

# Works:
curl -X POST http://localhost:11434/api/generate -d '{"model": "qwen3.5:397b-cloud", "prompt": "Hi"}'

# Hangs/Timeouts:
curl -X POST http://localhost:11434/v1/chat/completions -d '{"model": "qwen3.5:397b-cloud", "messages":[{"role":"user","content":"Hi"}]}'

RAW_BUFFERClick to expand / collapse

Model

qwen3.5:397b-cloud

Description

The model qwen3.5:397b-cloud hangs on the /v1/chat/completions endpoint while /api/generate works fine.

Steps to Reproduce

Start Ollama with qwen3.5:397b-cloud model loaded
Send POST request to /v1/chat/completions
Request hangs/times out with 500 Internal Server Error

Expected Behavior

Chat completion should return within reasonable time

Actual Behavior

/api/generate works fine
/v1/chat/completions hangs/times out (500 Internal Server Error)
Other models (qwen3.5:cloud, minimax-m2.5:cloud) work fine on same server with both endpoints

Investigation

# Works:
curl -X POST http://localhost:11434/api/generate -d '{"model": "qwen3.5:397b-cloud", "prompt": "Hi"}'

# Hangs/Timeouts:
curl -X POST http://localhost:11434/v1/chat/completions -d '{"model": "qwen3.5:397b-cloud", "messages":[{"role":"user","content":"Hi"}]}'

Notes

Model size: 397b parameters
Same model works on /api/generate but not on /v1/chat/completions
Other models work on both endpoints without issues

Created via GitHub API

extent analysis

Fix Plan

The fix involves modifying the /v1/chat/completions endpoint to handle large models like `qwen

Vote matrix · Quick signals

Works

Did the solution work? Tap to confirm.

Easy Fix

Was it a quick fix?

Time Saver

Did it save you time?

Blocking

Was it severely blocking?

Common Issue

Are others likely hitting this too?

Flaky / Intermittent

Is it intermittent?

Verified / Reproducible

Can you reproduce it reliably?

#api #ssr #installation #tensor shape #autograd error #GPU compatibility #latency issue #model loading #dependency error

Still need to ship something?

×6

Another batch ranked right after the header list — different links, same matching logic.

Data

Security

Network

Code

UI/UX

Text

System

Multimedia

Protocol

API

Engineering

ollama - 💡(How to fix) Fix Bug: qwen3.5:397b-cloud hangs on /v1/chat/completions endpoint [1 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Code Example

Model

Description

Steps to Reproduce

Expected Behavior

Actual Behavior

Investigation

Notes

extent analysis

Fix Plan

Still need to ship something?

TRENDING

ollama - 💡(How to fix) Fix Bug: qwen3.5:397b-cloud hangs on /v1/chat/completions endpoint [1 comments, 2 participants]

Recommended Tools

GitHub issue graph ai analysis

Error Message

Root Cause

Code Example

Model

Description

Steps to Reproduce

Expected Behavior

Actual Behavior

Investigation

Notes

extent analysis

Fix Plan

Still need to ship something?

RELATED_DISCOVERY

TRENDING