Skip to content

SDK 0.3.0 does not support images in tool results' binaryResultsForLlm #1298

@connor4312

Description

@connor4312
  1. Define a tool that provides an image to the SDK
  2. Have the agent call it in a model that supports it (e.g. Haiku)
  3. 🐛 the agent can't see the image

Specifically for us this happens here: https://github.com/microsoft/vscode/blob/a1803b2a0c88b4a22ff83262b132cca1befc0886/src/vs/platform/agentHost/node/copilot/copilotAgentSession.ts#L592

Image

This is the image being returned: image.zip

Here is our log dump which includes the copilot CLI events.jsonl. You can see the binary results in that log:

ah-logs-Open-apple.com,-take-a-screenshot,-and-t.zip

And just for edification, here is the exact image that was returned, a valid jpeg: image.zip

Image

Upstream issue: microsoft/vscode#315947

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions