cudaPackages: introduce cudaLib and switch from backendStdenv to cudaStdenv by ConnorBaker · Pull Request #405751 · NixOS/nixpkgs

ConnorBaker · 2025-05-10T01:33:29Z

Important

Introduction of cudaLib has been split into #406531. This PR is closed as discussions about goals and implementation details of cudaStdenv need to happen first.

Things done

Add a 👍 reaction to pull requests you find important.

Signed-off-by: Connor Baker <[email protected]>

SomeoneSerge · 2025-05-12T15:57:14Z

IMO we should reserve the name cudaStdenv for when it's actually sufficient to build a package with CUDA support. The reason backendStdenv was not named that is because it isn't a "cuda stdenv". Let's brainstorm what's missing: having to simultaneously add nvcc and cudart (and the custom stdenv) is one part of the puzzle
I think we should make as many of the helpers we have "private" as we can: this applies to most of the content of cudaFlags (flags? let's delete one of these too) and I think should apply to cudaLib. By "making private" I mean changing the names to begin with __ and documenting them as private (subject to change, including in a backport) in the manual. EDIT: I see now you're not introducing it into the attribute tree yet, so it's not a concern I suppose?

ConnorBaker · 2025-05-12T16:24:24Z

IMO we should reserve the name cudaStdenv for when it's actually sufficient to build a package with CUDA support. The reason backendStdenv was not named that is because it isn't a "cuda stdenv". Let's brainstorm what's missing: having to simultaneously add nvcc and cudart (and the custom stdenv) is one part of the puzzle

Fair point -- I'm thinking:

{
  # NVCC is required to compile CUDA code.
  nativeBuildInputs = [ cuda_nvcc ];
  # The CUDA runtime is required by most CUDA applications and depends on the headers
  # provided by CCCL.
  buildInputs =
    [
      cuda_cudart
      cuda_cccl
    ]
    # CUDA compat is included when non-null and available on the host platform.
    ++ lib.optionals (cuda_compat != null && lib.availableOn stdenv.hostPlatform cuda_compat) [
      cuda_compat
    ];
}

I think we should make as many of the helpers we have "private" as we can: this applies to most of the content of cudaFlags (flags? let's delete one of these too) and I think should apply to cudaLib. By "making private" I mean changing the names to begin with __ and documenting them as private (subject to change, including in a backport) in the manual.

My intention with introducing cudaLib was to create a centralized library of functions and data necessary to create, modify, or extend CUDA package sets, both to float out commonalities in our packaging in-tree and to enable out-of-tree users to make use of our tooling without needing to vendor or duplicate a bunch of stuff we don't otherwise expose. I'm not opposed to making some of them private (or migrating them to lib proper), but I feel that doing that for the majority would be counter to what I want for cudaLib. Does that make sense, and or do you have thoughts on how to expose a minimal public interface which still allows for re-use?

On the topic of flags, I've had a TODO hanging around for a while to remove cudaFlags -- do you have a preference for one name over the other? I like flags because it's short, but cudaFlags makes it more clear what it is in cases where it's inherited in package expressions outside of the cudaPackages scope. In the same vein, I'm supposing we'd call our standard environment cudaStdenv and not just stdenv. On the other hand, having the cuda prefix within the cudaPackages scope feels repetitive... but at least you know which scope it's coming from!

Signed-off-by: Connor Baker <[email protected]>

SomeoneSerge · 2025-05-12T16:49:44Z

Fair point -- I'm thinking:

Ja-ja, are we brainstorming already? Great. Here come some more considerations. **Minor**: - I'd move `propagatedBuildInputs = [ setupCudaHook ]` from NVCC to `*Stdenv` (unless cf. **Major**) - It must be possible to use `*Stdenv` with NVC++ and Clang instead of NVCC **Major** This is in response to

In the same vein, I'm supposing we'd call our standard environment

|cudaStdenv| and not just |stdenv| - The difference I anticipate for `backendStdenv` and `cudaPackages.stdenv` is that the latter should be fit for using in Nixpkgs' `overrideStdenv` and should treat `cuda{Support,Capabilities` as part of the `hostPlatform`: - Rebuilding something non-CUDA, like `chromium`, with `cudaPackages.stdenv` should not trigger a rebuild (i.e. overlay reduces to a no-op and uses the modern stdenv's tools, libs, and hooks) - Building a real CUDA package, like `torch` and it must have a way to signal it "can be built with CUDA support" (NVCC in nativeBuildInputs? Maybe, I don't like this), enforces the compatible stdenv, prevents lib{c,stdc++} leakages, and adds NVCC & CUDA hooks. - `cudaPackages.stdenv.cudaSupport` can be used to test whether the platform we're building for "supports CUDA" at all; it can be used instead of `{ config, cudaSupport ? config.cudaSupport }` to test for "what's hypothetically supported" (this somehow needs to be disentangled from "whether the _current_package is requested to be built with CUDA kernels on") Onto `cudaLib`.

a centralized library of functions and data necessary to create,

modify, or extend CUDA package sets :+1: I was thinking along the same lines (would you like some `libcudb.nix`?), I'm surely pro-change. But this time I want to clearly disclaim liabilities

I like |flags| because it's short, but |cudaFlags| makes it more

clear what it is in cases where it's inherited in package expressions outside of the |cudaPackages| scope My vote is definitely `flags`: if we inherit outside the scope we can choose a longer local name.

ConnorBaker · 2025-05-12T17:06:51Z

Excellent points!

In terms of moving this PR forward, is it acceptable to you if:

I keep the name of backendStdenv instead of renaming it to cudaStdenv (all the additional attributes would remain as needed for further refactors)
I update the names and documentation of the functions in cudaLib.utils to indicate there are no stability guarantees (I'm thinking a single underscore, as double-underscore seems to be mostly used by Nix)
I do a tree-wide refactor to remove cudaFlags and replace it with flags

I want to have a further discussion with you about the implementation of a cudaStdenv with the properties you've outlined above, but I think that will be a longer series of discussions and I don't want to hold up landing fixes for package set leakage and introduction of pkgsCuda.

SomeoneSerge · 2025-05-12T17:17:37Z

single underscore Seems like you're right. tree-wide cudaFlags -> flags Separate PR to avoid extra yak-shaving? I think that will be a longer series of discussions Safe to assume so 👍 expose ... minimal functionality to ... create, modify, or extend CUDA package sets I didn't comment on this part - exactly my concern right now too! The artifact I was hoping to introduce at the top-level is something like a "cuda (metadata) database", which could be re-instatiated with a different set of manifests, and a function approx. of the form `DB -> VersionConstraints -> List Extension -> CudaPackageSet`

On 12/05/2025 17.07, Connor Baker wrote: ConnorBaker left a comment (NixOS/nixpkgs#405751) Excellent points! In terms of moving this PR forward, is it acceptable to you if: 1. I keep the name of `backendStdenv` instead of renaming it to `cudaStdenv` (all the additional attributes would remain as needed for further refactors) 2. I update the names and documentation of the functions in `cudaLib.utils` to indicate there are no stability guarantees (I'm thinking a single underscore, as double-underscore seems to be mostly used by Nix) 3. I do a tree-wide refactor to remove `cudaFlags` and replace it with `flags` I want to have a further discussion with you about the implementation of a `cudaStdenv` with the properties you've outlined above, but I think that will be a longer series of discussions and I don't want to hold up landing fixes for package set leakage and introduction of `pkgsCuda`.

-- SomeoneSerge <https://someonex.net>

ConnorBaker · 2025-05-12T18:32:08Z

Introduction of cudaLib has been split into #406531. This PR is closed as discussions about goals and implementation details of cudaStdenv need to happen first.

ConnorBaker self-assigned this May 10, 2025

ConnorBaker added the 6.topic: cuda Parallel computing platform and API label May 10, 2025

github-project-automation Bot moved this to New in CUDA Team May 10, 2025

github-project-automation Bot added this to CUDA Team May 10, 2025

github-actions Bot added 6.topic: python Python is a high-level, general-purpose programming language. 8.has: documentation This PR adds or changes documentation labels May 10, 2025

ConnorBaker force-pushed the feat/cuda-packages-uses-cudaStdenv branch 2 times, most recently from d4bcc1f to c8dfb5e Compare May 10, 2025 02:26

github-actions Bot added 10.rebuild-darwin: 11-100 This PR causes between 11 and 100 packages to rebuild on Darwin. 10.rebuild-linux: 11-100 This PR causes between 11 and 100 packages to rebuild on Linux. labels May 10, 2025

ConnorBaker moved this from New to 🏗 In progress in CUDA Team May 10, 2025

ConnorBaker force-pushed the feat/cuda-packages-uses-cudaStdenv branch 2 times, most recently from 646b010 to e11f0c0 Compare May 10, 2025 15:02

ConnorBaker added 7 commits May 12, 2025 08:21

cudaPackages: add cudaNamePrefix

034d99b

Signed-off-by: Connor Baker <[email protected]>

cudaPackages.driver_assistant: mark as unsupported

31bb324

cudaLib: init

5a9c25b

Signed-off-by: Connor Baker <[email protected]>

cudaPackages: rewrite backendStdenv as cudaStdenv

bcfcb87

Signed-off-by: Connor Baker <[email protected]>

tree-wide: cudaPackages.backendStdenv -> cudaPackages.cudaStdenv

6cd4bb0

Signed-off-by: Connor Baker <[email protected]>

cudaPackages: switch to cudaLib

248227a

Signed-off-by: Connor Baker <[email protected]>

cudaPackages: doc fixup

40baa7d

Signed-off-by: Connor Baker <[email protected]>

ConnorBaker force-pushed the feat/cuda-packages-uses-cudaStdenv branch from e11f0c0 to 40baa7d Compare May 12, 2025 15:24

SomeoneSerge reviewed May 12, 2025

View reviewed changes

Comment thread pkgs/development/cuda-modules/lib/default.nix Outdated

fixup! cudaLib: init

b0f36d5

Signed-off-by: Connor Baker <[email protected]>

ConnorBaker mentioned this pull request May 12, 2025

cudaPackages: introduce and use cudaLib #406531

Merged

13 tasks

ConnorBaker closed this May 12, 2025

github-project-automation Bot moved this from 🏗 In progress to ✅ Done in CUDA Team May 12, 2025

This was referenced May 12, 2025

tree-wide: cudaPackages.cudaFlags -> cudaPackages.flags #406545

Merged

tests.cuda.db: init #406740

Closed

ConnorBaker deleted the feat/cuda-packages-uses-cudaStdenv branch May 14, 2025 07:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

cudaPackages: introduce cudaLib and switch from backendStdenv to cudaStdenv#405751

cudaPackages: introduce cudaLib and switch from backendStdenv to cudaStdenv#405751
ConnorBaker wants to merge 8 commits intoNixOS:masterfrom
ConnorBaker:feat/cuda-packages-uses-cudaStdenv

ConnorBaker commented May 10, 2025 •

edited

Loading

Uh oh!

SomeoneSerge commented May 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

ConnorBaker commented May 12, 2025

Uh oh!

SomeoneSerge commented May 12, 2025 via email •

edited

Loading

Uh oh!

ConnorBaker commented May 12, 2025

Uh oh!

SomeoneSerge commented May 12, 2025 via email

Uh oh!

ConnorBaker commented May 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

ConnorBaker commented May 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Things done

Uh oh!

SomeoneSerge commented May 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

ConnorBaker commented May 12, 2025

Uh oh!

SomeoneSerge commented May 12, 2025 via email • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ConnorBaker commented May 12, 2025

Uh oh!

SomeoneSerge commented May 12, 2025 via email

Uh oh!

ConnorBaker commented May 12, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

ConnorBaker commented May 10, 2025 •

edited

Loading

SomeoneSerge commented May 12, 2025 •

edited

Loading

SomeoneSerge commented May 12, 2025 via email •

edited

Loading