[WIP, do not review] Polling-based asynchronous verbose mode for execution time-tracking by avmanerikar · Pull Request #5102 · uxlfoundation/oneDNN

avmanerikar · 2026-04-30T18:19:23Z

Description

This PR is an alternative approach to the callback-based asynchronous verbose mode implemented in PR #4187 - the method relies on a periodic event polling to record and log profiling info. The motivation to using this approach is to have a unified implementation for the asynchronous mode that does not rely on runtime-specific callback APIs.

The implementation addresses points that the callback-based approach fails to triage:

Event-polling does not rely on runtime-specific callback API (required for supporting L0 runtime).
The callback method does not triage MFDNN-14331 for SYCL graphs.

Failing reproducer for SYCL Graph:

DNNL_VERBOSE=1 .\tests\benchdnn\benchdnn.exe --eltwise --engine=gpu --execution-mode=graph 1

Extension to CPU Threadpool is difficult with callback-based method.

This prototype is implemented for OpenCL GPU runtime.

avmanerikar added 7 commits April 29, 2026 10:05

common: verbose: add flag to force disable async verbose mode

5e23012

common: stream: define async verbose profiler api for streams

6a55d24

common: stream: enable stream profiler when in verbose profiling mode

331098e

gpu: ocl: enable verbose profiler for opencl interops

b7316bf

common: stream: add non-blocking profiler in verbose exec mode

752d517

xpu: stream_profiler: define api for async verbose profiling

9e34556

gpu: ocl: add async profiling api for ocl streams

66933f7

avmanerikar requested review from a team as code owners April 30, 2026 18:19

avmanerikar marked this pull request as draft April 30, 2026 18:19

github-actions Bot added platform:gpu-intel Codeowner: @oneapi-src/onednn-gpu-intel component:common labels Apr 30, 2026

gpu: ocl: enable async verbose profiling for ocl streams

fdc347c

avmanerikar force-pushed the amanerik/main/async-verbose-mode-polling branch from 4974389 to fdc347c Compare April 30, 2026 18:24

avmanerikar mentioned this pull request May 4, 2026

[GPU] a non-blocking, profiler-based verbose mode for execution time tracking #4187

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP, do not review] Polling-based asynchronous verbose mode for execution time-tracking#5102

[WIP, do not review] Polling-based asynchronous verbose mode for execution time-tracking#5102
avmanerikar wants to merge 8 commits intomainfrom
amanerik/main/async-verbose-mode-polling

avmanerikar commented Apr 30, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

avmanerikar commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Failing reproducer for SYCL Graph:

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

avmanerikar commented Apr 30, 2026 •

edited

Loading