- Define a tool that provides an image to the SDK
- Have the agent call it in a model that supports it (e.g. Haiku)
- 🐛 the agent can't see the image
Specifically for us this happens here: https://github.com/microsoft/vscode/blob/a1803b2a0c88b4a22ff83262b132cca1befc0886/src/vs/platform/agentHost/node/copilot/copilotAgentSession.ts#L592
This is the image being returned: image.zip
Here is our log dump which includes the copilot CLI events.jsonl. You can see the binary results in that log:
ah-logs-Open-apple.com,-take-a-screenshot,-and-t.zip
And just for edification, here is the exact image that was returned, a valid jpeg: image.zip
Upstream issue: microsoft/vscode#315947
Specifically for us this happens here: https://github.com/microsoft/vscode/blob/a1803b2a0c88b4a22ff83262b132cca1befc0886/src/vs/platform/agentHost/node/copilot/copilotAgentSession.ts#L592
This is the image being returned: image.zip
Here is our log dump which includes the copilot CLI events.jsonl. You can see the binary results in that log:
ah-logs-Open-apple.com,-take-a-screenshot,-and-t.zip
And just for edification, here is the exact image that was returned, a valid jpeg: image.zip
Upstream issue: microsoft/vscode#315947