fix: support image input in OpenAI Chat user messages by KaguraTart · Pull Request #26826 · anomalyco/opencode

KaguraTart · 2026-05-11T07:01:53Z

Issue for this PR

Closes #20802

Type of change

Bug fix
New feature
Refactor / code improvement
Documentation

What does this PR do?

Adds image input support in the OpenAI Chat protocol layer by converting MediaPart to OpenAI's image_url content block format in user messages.

Root cause: In packages/llm/src/protocols/openai-chat.ts, the lowerUserMessage function only accepted TextPart content. When a MediaPart was encountered, it returned an unsupportedContent error, preventing image attachments from reaching vision-capable models.

Fix:

Added OpenAIChatTextContentBlock and OpenAIChatImageUrlContentBlock schemas
Updated user message schema to accept string | ContentBlock[]
Added lowerUserPart function that converts:
- TextPart → { type: "text", text: "..." }
- MediaPart → { type: "image_url", image_url: { url: "data:<mediaType>;base64,<data>" } }
Updated lowerUserMessage to use content blocks when media is present

This is a protocol-layer fix that complements the provider-layer fix in #21627. While #21627 addresses capability detection, this PR ensures the conversion logic at the protocol level correctly transforms media parts into the OpenAI-compatible format.

How did you verify your code works?

All 16 unit tests pass (1 unrelated failure due to missing API key)
Added 3 new tests covering media handling:
- prepares user message with media as image_url content block
- prepares user message with mixed text and media
- prepares user message with only text (no content blocks)
Typecheck passes for all 14 packages

Checklist

I have tested my changes locally
I have not included unrelated changes in this PR

Comparison with #21627

#21627 fixes image support at the provider capability detection layer (1-line change in provider.ts).

This PR fixes image support at the protocol conversion layer in openai-chat.ts, ensuring MediaPart is correctly transformed to image_url content blocks. Both PRs address the same end goal but at different layers of the stack, and they are complementary.

Closes anomalyco#20802 Converts MediaPart to OpenAI image_url content block format in user messages.

github-actions · 2026-05-11T07:02:42Z

The following comment was made by an LLM, it may be inaccurate:

Related PR Found:

PR fix(provider): enable image support for custom OpenAI-compatible providers #21627: "fix(provider): enable image support for custom OpenAI-compatible providers"
- fix(provider): enable image support for custom OpenAI-compatible providers #21627

Why it's related:
PR #21627 is the complementary provider-layer fix mentioned in the PR description. While this PR (26826) fixes the protocol conversion layer (transforming MediaPart to image_url content blocks in openai-chat.ts), PR #21627 handles capability detection at the provider level. Both PRs work together to enable complete image support for OpenAI models, but they address different layers of the stack. They are not duplicates—they are complementary fixes that should both be merged.

Copilot

Pull request overview

This PR fixes OpenAI Chat protocol request lowering to support multimodal user messages by translating MediaPart inputs into OpenAI Chat image_url content blocks, allowing image attachments to reach vision-capable OpenAI-compatible /chat/completions backends.

Changes:

Extended the OpenAI Chat request schema so user.content can be either a string or an array of {type: "text" | "image_url"} content blocks.
Implemented lowerUserPart / updated lowerUserMessage to convert MediaPart into image_url data URLs (base64), and emit content blocks when any media is present.
Added unit tests to cover media-only, mixed text+media, and text-only user message lowering behavior.

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
packages/llm/src/protocols/openai-chat.ts	Adds schemas for multimodal content blocks and lowers `MediaPart` into OpenAI Chat `image_url` blocks for user messages.
packages/llm/test/provider/openai-chat.test.ts	Adds/updates tests asserting correct request-body lowering for media-only, mixed, and text-only user messages.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

wanick · 2026-05-27T18:30:53Z

This PR fixes a hard blocker for us — Kimi For Coding vision via static API token (employee subscription, no OAuth). The opencode-kimi-full plugin works but requires device-flow OAuth, which enterprise tokens can't use. The generic openai-compatible path is the only auth shape available to us, and it's broken on images until this lands.

Tested the same payload outside opencode (Roo Code in VS Code, same token, same endpoint) — vision works. Confirms it's the opencode adapter, exactly what this PR addresses.

Please prioritize merging 🙏

fix: support image input in OpenAI Chat user messages

c3f8a61

Closes anomalyco#20802 Converts MediaPart to OpenAI image_url content block format in user messages.

Copilot AI review requested due to automatic review settings May 11, 2026 07:01

KaguraTart mentioned this pull request May 11, 2026

fix(provider): enable image support for custom OpenAI-compatible providers #21627

Open

6 tasks

Copilot started reviewing on behalf of KaguraTart May 11, 2026 07:02 View session

Merge branch 'dev' into fix/openai-chat-vision

a11fd3b

Copilot AI reviewed May 11, 2026

View reviewed changes

KaguraTart mentioned this pull request May 15, 2026

MiniMax provider 无法读取本地图片 - VLM 工具调用失败 #26665

Closed

github-actions Bot mentioned this pull request May 26, 2026

fix: detect attachment mime from file contents #29442

Closed

wanick mentioned this pull request May 27, 2026

Custom OpenAI-compatible providers: image file attachments do not reach vision-capable models correctly #20802

Open

Merge branch 'dev' into fix/openai-chat-vision

170f205

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: support image input in OpenAI Chat user messages#26826

fix: support image input in OpenAI Chat user messages#26826
KaguraTart wants to merge 3 commits into
anomalyco:devfrom
KaguraTart:fix/openai-chat-vision

KaguraTart commented May 11, 2026

Uh oh!

github-actions Bot commented May 11, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

wanick commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

KaguraTart commented May 11, 2026

Issue for this PR

Type of change

What does this PR do?

How did you verify your code works?

Checklist

Comparison with #21627

Uh oh!

github-actions Bot commented May 11, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

wanick commented May 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants