python3.pkgs.mmcv: init at 1.7.1 #219815

benxiao · 2023-03-06T10:40:22Z

Description of changes

Things done

pkgs/development/python-modules/mmcv/default.nix

kirillrdy · 2023-03-09T09:29:18Z

pkgs/development/python-modules/mmcv/default.nix

+
+buildPythonPackage rec {
+  pname = "mmcv";
+  version = "1.7.0";


1.7.1 was released on Jan 3rd, but I wouldn't want to block this MR because of it

That's 3 months ago. If we introduce a new package and it is a trivial update, we should do that right away unless there is a reason not to

kirillrdy

last commit message doesnt meet guidelines,

since this is an init PR, I would just squash all the commits

pkgs/development/python-modules/mmcv/default.nix

kirillrdy · 2023-03-10T05:37:58Z

pkgs/development/python-modules/mmcv/default.nix

+  LDFLAGS = lib.optionalString cudaSupport "-L${cudatoolkit.lib}/lib";
+
+  meta = with lib; {
+    description = "MMCV is a foundational library for computer vision research";


as per contribution guidelines, description should not start or contain name of the package

pkgs/development/python-modules/mmcv/default.nix

bcdarwin · 2023-03-17T02:59:29Z

pkgs/development/python-modules/mmcv/default.nix

+    hash = "sha256-EVu6D6rTeebTKFCMNIbgQpvBS52TKk3vy2ReReJ9VQE=";
+  };
+
+  torch-deps = if cudaSupport then [ torch-bin torchvision-bin ] else [ torch torchvision ];


I don't think the use of torch-bin and torchvision-bin is desirable here. We have torch{With,Without}Cuda and also torch itself can have cuda enabled or not depending on config.enableCuda or overriding torch.

I might use torch torch vision, and let the user overwrite it in a flake file.

If someone might do that, they can always overwrite the torch input to this package

bcdarwin · 2023-03-17T03:00:46Z

pkgs/development/python-modules/mmcv/default.nix

+, pythonOlder
+, cudaSupport ? false
+, cudatoolkit
+, torchCudaArchList ? "Kepler"


I don't think we should have a single package depending on cuda arch settings like this. We already have e.g. config.cudaCapabilities for configuring what architectures cuda-enabled packages are built for.

I will try to use torch/default.nix as an example

bcdarwin · 2023-03-17T03:01:54Z

pkgs/development/python-modules/mmcv/default.nix

+
+  torch-deps = if cudaSupport then [ torch-bin torchvision-bin ] else [ torch torchvision ];
+  nativeBuildInputs = [ ninja which ]
+    ++ lib.lists.optionals cudaSupport [ cudatoolkit.lib cudatoolkit.out cudatoolkit.cc ] ++ torch-deps;


it shouldn't be necessarily for torch/torchvision to be in nativeBuildInputs and propagatedBuildInputs.

@bcdarwin torch is needed at runtime. not sure there is another way to do it without propagatedBuildInputs

ConnorBaker · 2023-03-18T21:08:56Z

pkgs/development/python-modules/mmcv/default.nix

+, pytest
+, pythonOlder
+, cudaSupport ? false
+, cudatoolkit


Not that I'd consider this a blocker for this PR, but for future work consider using cudaPackages over cudatoolkit. cudaPackages uses redistributables from NVIDIA and allows for much smaller closures because you choose which libraries you need. It also makes it easier to maintain because we then have greater visibility into what the package is using.

For an example, take a look at

nixpkgs/pkgs/development/python-modules/torchvision/default.nix

Lines 17 to 40 in 7f194c7

inherit (torch) cudaCapabilities cudaPackages cudaSupport;

inherit (cudaPackages) cudatoolkit cudaFlags cudaVersion;

# NOTE: torchvision doesn't use cudnn; torch does!

# For this reason it is not included.

cuda-common-redist = with cudaPackages; [

cuda_cccl # <thrust/*>

libcublas # cublas_v2.h

libcusolver # cusolverDn.h

libcusparse # cusparse.h

];

cuda-native-redist = symlinkJoin {

name = "cuda-native-redist-${cudaVersion}";

paths = with cudaPackages; [

cuda_cudart # cuda_runtime.h

cuda_nvcc

] ++ cuda-common-redist;

};

cuda-redist = symlinkJoin {

name = "cuda-redist-${cudaVersion}";

paths = cuda-common-redist;

};

.

I break the dependencies into those needed only at build time, run time, or both (common). Unfortunately, because most packages expect all the CUDA libraries to be in a single directly, we need to use symlinkJoin to group them together.

Also, make sure you're specifying the right C/C++ compilers if you're building with CUDA support, like here: #218265. (Torchvision doesn't yet do that; I've got an open issue here: #221898.)

The compiler provided by stdenv isn't necessarily supported by NVCC, and it's possible to get odd symbol errors during linking if the other derivations were built with a different compiler or language standard. That's why it's important to use backendStdenv.cc.

Torchvision and Torch will probably have other things you might want to look at. If you have any questions, don't hesitate to reach out to me or the other CUDA maintainers on Matrix: https://matrix.to/#/#cuda:nixos.org.

hey @ConnorBaker , thanks a lot for the advices and examples. thanks @bcdarwin for raising this. I am thinking about using torch.cudaCapabilities as well. but I think there is a valid usecase for people who want to overwrite with torch-bin (doesn't require compilation), but doesn't have passthru for cudaCapabilities. would you suggest that I copy the same pattern from torch/default.nix to get cudaCababilities?

as for the odd symbol error, I am already getting that with opencv4, as it is compiled with stdenv.cc gcc 12, and cudatoolkit uses gcc 11. I will give it another go.

also. with following nix-shell, the torchvision now fails to build with cuda on latest master @ConnorBaker. i assume gcc 12.2 is from stdenv

with import ../nixpkgs { config.allowUnfree = true; }; let mypthon = let packageOverrides = self: super: { # packagesOverride, will affect all packages that uses the resulted packages as deps torch = super.torch.override{ magma = magma-cuda; cudaSupport = true; }; # opencv4 = super.opencv4.override { # enableBlas = false; # }; }; in python3.override { inherit packageOverrides; self = mypthon; }; in let pythonEnv = mypthon.withPackages (ps: with ps; [ torchvision ]); in mkShell { packages = [ pythonEnv ]; } raise RuntimeError( RuntimeError: The current installed version of g++ (12.2.0) is greater than the maximum required version by CUDA 11.7 (11.5.0). Please make sure to use an adequate version of g++ (>=6.0.0, <=11.5.0). /nix/store/c3f4jdwzn8fm9lp72m91ffw524bakp6v-stdenv-linux/setup: line 1593: pop_var_context: head of shell_variables not a function context

but I think there is a valid usecase for people who want to overwrite with torch-bin (doesn't require compilation), but doesn't have passthru for cudaCapabilities.

IIRC, newer torch binaries use CUDA redistributables from PyPi. If they haven't moved over to that yet, then their builder is still using bash scripts to manually copy libraries into their binary :(

There's a hardcoded set of supported capabilities for each version of PyTorch. You can find them here: https://github.com/pytorch/pytorch/blob/49444c3e546bf240bed24a101e747422d1f8a0ee/torch/utils/cpp_extension.py#L1751-L1752.

Because it's a binary distributions and not "built" with Nixpkgs, the CUDA capabilities supported by the binary aren't tied to what we specify in our config.

For that reason I'd recommend you use the same logic torch/default.nix has. (Those versions are actually sourced from that same file!)

as for the odd symbol error, I am already getting that with opencv4, as it is compiled with stdenv.cc gcc 12, and cudatoolkit uses gcc 11. I will give it another go.

I have a PR open which should help with that: #221370.

i assume gcc 12.2 is from stdenv

That's right, it is :(

I know that error well. It's from here: https://github.com/pytorch/pytorch/blob/49444c3e546bf240bed24a101e747422d1f8a0ee/torch/utils/cpp_extension.py#L415. (Related discussion on whether checks like that should be removed when included in Nixpkgs: #221564 (comment).)

That's odd that you're getting it though, since the torchvision derivation should correctly set the CC/CXX environment variables to the right compiler:

nixpkgs/pkgs/development/python-modules/torchvision/default.nix

Lines 65 to 72 in e400f93

# NOTE: We essentially override the compilers provided by stdenv because we don't have a hook

# for cudaPackages to swap in compilers supported by NVCC.

+ lib.optionalString cudaSupport ''

export CC=${backendStdenv.cc}/bin/cc

export CXX=${backendStdenv.cc}/bin/c++

export TORCH_CUDA_ARCH_LIST="${lib.concatStringsSep ";" cudaCapabilities}"

export FORCE_CUDA=1

'';

.

With respect to your nix file can you try doing something like the following instead? Substitute whatever capability is appropriate for your device (though using just one does make it a lot faster to rebuild stuff from source!).

with import ../nixpkgs { config = { allowUnfree = true; cudaSupport = true; cudaCapabilities = [ "8.6" ]; }; };

I think it was choosing the wrong compiler because cudaSupport wasn't explicitly set to true.

Let me know how that works!

@ConnorBaker with your config, I fixed the issue. thanks a bunch.

@ConnorBaker, your MR on opencv fixed my symbol error.

Yay! Now it just needs to get merged (somewhat difficult given the large number of rebuilds)!

SuperSandro2000 · 2023-03-23T09:45:08Z

pkgs/development/python-modules/mmcv/default.nix

+  # reason for not using pytestCheckHook is similiar to what
+  # is already mentioned in pkgs/development/python-modules/typed-ast
+  # test_cnn test_ops really requires gpus to be useful.
+  # some of the tests take exceedingly long time.
+  # the rest of the tests are disabled due to sandbox env.


This comment does not justify not using pytestCheckHook at all

Suggested change

# reason for not using pytestCheckHook is similiar to what

# is already mentioned in pkgs/development/python-modules/typed-ast

# test_cnn test_ops really requires gpus to be useful.

# some of the tests take exceedingly long time.

# the rest of the tests are disabled due to sandbox env.

I tried to do

preCheck = '' PYTHONPATH=$out/${python.sitePackages}:$PYTHONPATH ''

which seems to work with a lot of packages with c extension, but it didn't work for me.

Another common way is to rm the the conflicting source directory but pytestCheckHook does just invoke pytest as you do, it only builds the arguments to it.

had to find that out the hard way. lol. will add disabled tests like you suggested.

pkgs/development/python-modules/mmcv/default.nix

SuperSandro2000 · 2023-03-23T09:45:38Z

pkgs/development/python-modules/mmcv/default.nix

+    pytest --ignore=tests/test_cnn \
+           --ignore=tests/test_ops \
+           --ignore=tests/test_fileclient.py \
+           --ignore=tests/test_load_model_zoo.py \
+           --ignore=tests/test_runner/test_checkpoint.py \
+           --ignore=tests/test_video/test_processing.py \
+           --ignore=tests/test_utils/test_hub.py \
+           --ignore=tests/test_video/test_reader.py


Please convert this into disabledTestFiles

SuperSandro2000 · 2023-03-23T09:46:23Z

pkgs/development/python-modules/mmcv/default.nix

+    hash = "sha256-EVu6D6rTeebTKFCMNIbgQpvBS52TKk3vy2ReReJ9VQE=";
+  };
+
+  torch-deps = if cudaSupport then [ torch-bin torchvision-bin ] else [ torch torchvision ];


If someone might do that, they can always overwrite the torch input to this package

pkgs/development/python-modules/mmcv/default.nix

SuperSandro2000 · 2023-03-23T09:46:59Z

pkgs/development/python-modules/mmcv/default.nix

+
+buildPythonPackage rec {
+  pname = "mmcv";
+  version = "1.7.0";


That's 3 months ago. If we introduce a new package and it is a trivial update, we should do that right away unless there is a reason not to

bcdarwin · 2023-03-30T14:45:59Z

pkgs/development/python-modules/mmcv/default.nix

+
+  nativeCheckInputs = [ pytestCheckHook ];
+
+  checkInputs = [ lmdb onnx onnxruntime scipy pyturbojpeg tifffile ];


These should almost certainly be moved to nativeCheckInputs.

nixos-discourse · 2023-03-31T14:38:04Z

This pull request has been mentioned on NixOS Discourse. There might be relevant details there:

https://discourse.nixos.org/t/tweag-nix-dev-update-46/26872/1

bcdarwin · 2023-04-06T14:32:24Z

pkgs/development/python-modules/mmcv/default.nix

+  inherit (torch) cudaCapabilities cudaPackages cudaSupport;
+  inherit (cudaPackages) backendStdenv cudaVersion;
+
+  # for cuda support, we are waiting on opencv: misc CUDA-related updates and fixes; add enableLto #221370


This comment is now out of date.

it was merged in Nixos:Staging not master. so its still relevant

It's currently in staging-next https://nixpk.gs/pr-tracker.html?pr=221370

I would suggest to just leave this open until that is merged which will be very soon.

#221370 has been merged into master. @SuperSandro2000

So you can resolve the comment now?

bcdarwin · 2023-04-06T23:49:34Z

Results of nixpkgs-review on x86_64-NixOS:

2 packages built:
python310Packages.mmcv python310Packages.mmcv.dist

bcdarwin

LGTM

SomeoneSerge · 2023-04-06T23:55:09Z

Results of nixpkgs-review on x86_64-NixOS:

Btw, nixpkgs-review has a --post-result option that uses gh cli to post a comment containing the exact command you used to run it. Makes it easier to reproduce the results

pkgs/development/python-modules/mmcv/default.nix

SuperSandro2000 · 2023-04-29T23:37:12Z

@ofborg build python310Packages.mmcv

github-actions bot added the 6.topic: python label Mar 6, 2023

benxiao force-pushed the add-mmcv branch 2 times, most recently from c4ba1d7 to 567a01e Compare March 6, 2023 10:49

ofborg bot added 8.has: package (new) 11.by: package-maintainer 10.rebuild-darwin: 1-10 10.rebuild-darwin: 1 10.rebuild-linux: 1-10 10.rebuild-linux: 1 labels Mar 6, 2023

benxiao force-pushed the add-mmcv branch from 567a01e to 15d1af5 Compare March 6, 2023 12:00

benxiao requested a review from fabaff March 7, 2023 06:56

fabaff reviewed Mar 9, 2023

View reviewed changes

pkgs/development/python-modules/mmcv/default.nix Outdated Show resolved Hide resolved

pkgs/development/python-modules/mmcv/default.nix Outdated Show resolved Hide resolved

pkgs/development/python-modules/mmcv/default.nix Show resolved Hide resolved

kirillrdy reviewed Mar 9, 2023

View reviewed changes

benxiao requested a review from kirillrdy March 10, 2023 05:28

benxiao force-pushed the add-mmcv branch 2 times, most recently from 03ff9a1 to 3e58cc7 Compare March 10, 2023 05:36

kirillrdy reviewed Mar 10, 2023

View reviewed changes

benxiao force-pushed the add-mmcv branch from 3e58cc7 to 8665072 Compare March 14, 2023 00:02

kirillrdy reviewed Mar 14, 2023

View reviewed changes

pkgs/development/python-modules/mmcv/default.nix Outdated Show resolved Hide resolved

benxiao force-pushed the add-mmcv branch from 8665072 to 6859430 Compare March 14, 2023 00:39

benxiao requested a review from fabaff March 14, 2023 02:00

benxiao force-pushed the add-mmcv branch 2 times, most recently from 61a17c0 to c81b004 Compare March 14, 2023 06:04

benxiao requested a review from bcdarwin March 16, 2023 23:47

bcdarwin suggested changes Mar 17, 2023

View reviewed changes

ConnorBaker reviewed Mar 18, 2023

View reviewed changes

ConnorBaker added the 6.topic: cuda label Mar 18, 2023

SuperSandro2000 reviewed Mar 23, 2023

View reviewed changes

benxiao force-pushed the add-mmcv branch from c81b004 to 0c0f29d Compare March 26, 2023 21:37

ofborg bot requested a review from basvandijk March 26, 2023 21:55

ofborg bot added 10.rebuild-darwin: 1-10 10.rebuild-darwin: 1 10.rebuild-linux: 1-10 10.rebuild-linux: 1 and removed 10.rebuild-darwin: 101-500 10.rebuild-linux: 501+ 10.rebuild-linux: 501-1000 labels Mar 27, 2023

benxiao requested a review from SuperSandro2000 March 29, 2023 11:46

benxiao changed the title ~~python3.pkgs.mmcv: init at 1.7.0~~ python3.pkgs.mmcv: init at 1.7.1 Mar 29, 2023

benxiao requested a review from bcdarwin March 30, 2023 09:05

bcdarwin suggested changes Mar 30, 2023

View reviewed changes

benxiao force-pushed the add-mmcv branch 2 times, most recently from eba6834 to 9275745 Compare April 6, 2023 00:50

benxiao requested a review from kirillrdy April 6, 2023 00:52

benxiao force-pushed the add-mmcv branch from 9275745 to 6ba3596 Compare April 6, 2023 00:59

bcdarwin reviewed Apr 6, 2023

View reviewed changes

bcdarwin approved these changes Apr 6, 2023

View reviewed changes

SuperSandro2000 reviewed Apr 8, 2023

View reviewed changes

pkgs/development/python-modules/mmcv/default.nix Outdated Show resolved Hide resolved

SuperSandro2000 reviewed Apr 8, 2023

View reviewed changes

pkgs/development/python-modules/mmcv/default.nix Outdated Show resolved Hide resolved

benxiao force-pushed the add-mmcv branch from c98ef7a to 1c2ab10 Compare April 11, 2023 11:42

benxiao requested a review from SuperSandro2000 April 14, 2023 12:52

python3.pkgs.mmcv: init at 1.7.1

53feaed

benxiao force-pushed the add-mmcv branch from 1c2ab10 to 53feaed Compare April 19, 2023 09:25

SuperSandro2000 reviewed Apr 23, 2023

View reviewed changes

pkgs/development/python-modules/mmcv/default.nix Show resolved Hide resolved

benxiao requested a review from SuperSandro2000 April 23, 2023 23:30

SuperSandro2000 merged commit cfb4d61 into NixOS:master Apr 30, 2023
21 checks passed

	inherit (torch) cudaCapabilities cudaPackages cudaSupport;
	inherit (cudaPackages) cudatoolkit cudaFlags cudaVersion;

	# NOTE: torchvision doesn't use cudnn; torch does!
	# For this reason it is not included.
	cuda-common-redist = with cudaPackages; [
	cuda_cccl # <thrust/*>
	libcublas # cublas_v2.h
	libcusolver # cusolverDn.h
	libcusparse # cusparse.h
	];

	cuda-native-redist = symlinkJoin {
	name = "cuda-native-redist-${cudaVersion}";
	paths = with cudaPackages; [
	cuda_cudart # cuda_runtime.h
	cuda_nvcc
	] ++ cuda-common-redist;
	};

	cuda-redist = symlinkJoin {
	name = "cuda-redist-${cudaVersion}";
	paths = cuda-common-redist;
	};

	# NOTE: We essentially override the compilers provided by stdenv because we don't have a hook
	# for cudaPackages to swap in compilers supported by NVCC.
	+ lib.optionalString cudaSupport ''
	export CC=${backendStdenv.cc}/bin/cc
	export CXX=${backendStdenv.cc}/bin/c++
	export TORCH_CUDA_ARCH_LIST="${lib.concatStringsSep ";" cudaCapabilities}"
	export FORCE_CUDA=1
	'';


		nativeCheckInputs = [ pytestCheckHook ];

		checkInputs = [ lmdb onnx onnxruntime scipy pyturbojpeg tifffile ];

python3.pkgs.mmcv: init at 1.7.1 #219815

python3.pkgs.mmcv: init at 1.7.1 #219815

Conversation

benxiao commented Mar 6, 2023 • edited

Description of changes

Things done

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

kirillrdy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

benxiao Mar 17, 2023 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

benxiao Mar 21, 2023 • edited

Choose a reason for hiding this comment

benxiao Mar 21, 2023 • edited

Choose a reason for hiding this comment

ConnorBaker Mar 21, 2023 • edited

Choose a reason for hiding this comment

benxiao Mar 23, 2023 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

benxiao Mar 27, 2023 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

benxiao Apr 5, 2023 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nixos-discourse commented Mar 31, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bcdarwin commented Apr 6, 2023

bcdarwin left a comment

Choose a reason for hiding this comment

SomeoneSerge commented Apr 6, 2023

SuperSandro2000 commented Apr 29, 2023

benxiao commented Mar 6, 2023 •

edited

benxiao Mar 17, 2023 •

edited

benxiao Mar 21, 2023 •

edited

benxiao Mar 21, 2023 •

edited

ConnorBaker Mar 21, 2023 •

edited

benxiao Mar 23, 2023 •

edited

benxiao Mar 27, 2023 •

edited

benxiao Apr 5, 2023 •

edited