internal asset failure #128564
Labels
module: linear algebra
Issues related to specialized linear algebra operations in PyTorch; includes matrix multiply matmul
needs reproduction
Someone else needs to try reproducing the issue given the instructions. No action needed from user
triaged
This issue has been looked at a team member, and triaged and prioritized into an appropriate module
🐛 Describe the bug
Traceback (most recent call last):
File "/home/ec2-user/experiments/kan/pykan/kan_exp_1.py", line 25, in
model.train(dataset, opt="LBFGS", steps=50);
File "/home/ec2-user/experiments/kan/pykan/kan/KAN.py", line 898, in train
self.update_grid_from_samples(dataset['train_input'][train_id].to(device))
File "/home/ec2-user/experiments/kan/pykan/kan/KAN.py", line 244, in update_grid_from_samples
self.act_fun[l].update_grid_from_samples(self.acts[l])
File "/home/ec2-user/experiments/kan/pykan/kan/KANLayer.py", line 218, in update_grid_from_samples
self.coef.data = curve2coef(x_pos, y_eval, self.grid, self.k, device=self.device)
File "/home/ec2-user/experiments/kan/pykan/kan/spline.py", line 138, in curve2coef
coef = torch.linalg.lstsq(mat.to(device), y_eval.unsqueeze(dim=2).to(device),
RuntimeError: false INTERNAL ASSERT FAILED at "../aten/src/ATen/native/BatchLinearAlgebra.cpp":1539, please report a bug to PyTorch. torch.linalg.lstsq: (Batch element 0): Argument 6 has illegal value. Most certainly there is a bug in the implementation calling the backend library.
=================================== (this code below fails roughly once every three tries)
Versions
wget https://raw.githubusercontent.com/pytorch/pytorch/main/torch/utils/collect_env.py
For security purposes, please check the contents of collect_env.py before running it.
python collect_env.py
cc @jianyuh @nikitaved @pearu @mruberry @walterddr @xwang233 @lezcano
The text was updated successfully, but these errors were encountered: