Skip to content

Plugin: transcribe inbound audio before Codex turns#103

Open
Yehonal wants to merge 1 commit into
pwrdrvr:mainfrom
Yehonal:pr/inbound-audio-transcription
Open

Plugin: transcribe inbound audio before Codex turns#103
Yehonal wants to merge 1 commit into
pwrdrvr:mainfrom
Yehonal:pr/inbound-audio-transcription

Conversation

@Yehonal
Copy link
Copy Markdown

@Yehonal Yehonal commented Apr 18, 2026

Summary

  • add optional inbound audio transcription config for the plugin
  • transcribe inbound audio/voice attachments before starting Codex turns
  • inject transcript text into turn input, preserving captions when present
  • document config and add tests for the new behavior

Why

This lets Telegram/Discord-style voice or audio attachments reach Codex as useful text input instead of only raw media placeholders, while keeping the feature optional and command-driven.

Notes

  • implemented as plugin-level pre-processing
  • configured via inboundAudioTranscription
  • supports {path}, {mimeType}, and {fileName} placeholders in args

Testing

  • added controller tests covering audio-only inbound messages
  • added controller tests covering audio with caption

@Yehonal Yehonal force-pushed the pr/inbound-audio-transcription branch from 2b3af10 to babd601 Compare April 24, 2026 14:04
@Yehonal
Copy link
Copy Markdown
Author

Yehonal commented Apr 30, 2026

Superseded by integration/yehonal-cas-stack reconciliation; closing this PR to avoid duplicate history.

@Yehonal Yehonal closed this Apr 30, 2026
@Yehonal Yehonal deleted the pr/inbound-audio-transcription branch April 30, 2026 05:49
@Yehonal Yehonal restored the pr/inbound-audio-transcription branch April 30, 2026 15:53
@Yehonal Yehonal reopened this Apr 30, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant