[Breaking] Remove `Tensor` backend generic and add high-level `Device` struct by laggui · Pull Request #4717 · tracel-ai/burn

laggui · 2026-04-02T15:10:43Z

Pull Request Template

Checklist

Confirmed that cargo run-checks command has been executed.
Made sure the book is up to date with changes in this PR.

Related Issues/PRs

Closes Default backend implementation #3628 — Default backend / remove boilerplate <B: Backend> generic (core motivation, by @nathanielsimard)
Related Add support for runtime backend selection in Burn #4415 — Runtime backend selection for cross-platform/Python interop
Related Runtime backend selection #705 — Backend is not object-safe; runtime dispatch requires structural change
Related Multi-backend decorator support #2276 — Multi-backend decorator via device enum
Preparatory: [Feat] Global backend Dispatch #4508 (introduces burn-dispatch / Dispatch backend), Dispatch autodiff checkpointing strategy support #4629 (autodiff checkpointing as device property), [Breaking] Use device settings to provide output dtype #4653 (dtype from device settings), Refactor backend tests to set device settings at initialization + use Dispatch #4666 (tests migrated to Dispatch)

Changes

Problem: Every piece of user code that uses tensors must carry a B: Backend type parameter, which propagates through every struct, function, and trait impl in a project:

// Before: B infects the entire call stack
pub struct Model<B: Backend> { layer: nn::Linear<B>, ... }
impl<B: Backend> Model<B> {
    pub fn forward(&self, x: Tensor<B, 3>) -> Tensor<B, 2> { ... }
}
fn run<B: AutodiffBackend>(device: B::Device) { ... }

This creates three concrete problems:

Boilerplate: Every library/app must expose <B: Backend> generics or lock users into one backend.
No runtime dispatch: Backend is compile-time only; can't fall back from GPU→CPU when hardware is unavailable, and switching devices requires going through TensorData manipulations (which is really meant to be a data representation, not a device-transfer mechanism).
Autodiff coupling: Training requires a separate Autodiff<B> wrapper type, making the backend type even more complex.

Solution: Two changes working together:

burn-dispatch crate (landed in [Feat] Global backend Dispatch #4508) provides a single concrete Dispatch backend that implements the Backend trait via compile-time enum dispatch over all enabled backends. Backends are still behind feature flags, so users enable only what they need. DispatchDevice and DispatchTensor are enums over per-backend device/tensor types, so the actual backend is selected at runtime from the enum variant while the type system sees only Dispatch.
Remove B from Tensor: Since Dispatch is the one backend for user-facing code, Tensor<B, D, K> becomes Tensor<D, K>. Autodiff is now a property of the Device rather than a type parameter — call .autodiff() on any device to opt into gradient tracking.

// After: no backend generic anywhere in user code
let device = Device::default();            // auto-selects best available backend
let device = Device::default().autodiff(); // enables gradient tracking

let x = Tensor::<2>::zeros([3, 4], &device);

The DispatchDevice enum dispatches based on which Cargo feature flags are enabled (cuda, ndarray, vulkan, etc.). When only one backend feature is enabled the compiler optimizes the match away entirely; with multiple backends enabled the overhead is minimal enum dispatch rather than vtable dispatch.

Key benefits this unlocks:

Easy runtime switching between backend devices (e.g. CPU ↔ GPU) without TensorData round-trips.
Simpler development cycles — feature-gate the primitive to keep compile times fast while iterating.
A path toward making the primitive opaque, further improving compile times.
Docs and book will be updated separately to guide existing users through the migration.

Testing

Backend tests were migrated to use Dispatch in #4666 and validate correctness across all backends.

laggui added 3 commits April 2, 2026 11:06

Remove backend generic from Tensor and add high-level Device struct

827b545

Fix rfft

c42626f

WIP modules

d75f123

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Breaking] Remove `Tensor` backend generic and add high-level `Device` struct#4717

[Breaking] Remove `Tensor` backend generic and add high-level `Device` struct#4717
laggui wants to merge 3 commits intomainfrom
refactor/backend/tensor

laggui commented Apr 2, 2026 •

edited by antimora

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

laggui commented Apr 2, 2026 • edited by antimora Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Pull Request Template

Checklist

Related Issues/PRs

Changes

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

laggui commented Apr 2, 2026 •

edited by antimora

Loading