Please consider adding MIG (MI-rror with G-radient modification) to torch.nn #122680

YagaoDirac · 2024-03-26T02:50:29Z

🚀 The feature, motivation and pitch

https://github.com/YagaoDirac/Pytorch-extension-from-YagaoDirac-v2/blob/main/v2%20with%20basic%20test.py
I implemented this 2 weeks ago. It's probably a better implementation of the Linear layer. It speeds up the training while let people stack much more such layers directly without any trick.

Alternatives

In the code I implemented 3 different types of similar purpose. Each of them are tested and can work individually( if my tests are not too wrong).

Additional context

Also, if you decide to add this to pytorch, remember to rename it.
Notice the "untested workflow" in the file. It's probably a better way, and can be integreted into the layer itself.
More info in the file.

cc @albanD @mruberry @jbschlosser @walterddr @mikaylagawarecki

YagaoDirac · 2024-03-29T11:41:58Z

@albanD Hi, thank you for tagging this issue with "needs research". Finally someone trusts me at least a bit. If you need more info about the code, my twitter, github, gmail and discord, are all the same name. Simply dm me somewhere.

mikaylagawarecki · 2024-04-09T18:22:05Z

Hi @YagaoDirac , could you provide more detail on what MIG is (e.g. research papers where it is used or proposed) and further elaboration on what problem it is meant to solve?

YagaoDirac · 2024-04-11T08:21:48Z

Hi @YagaoDirac , could you provide more detail on what MIG is (e.g. research papers where it is used or proposed) and further elaboration on what problem it is meant to solve?

Hi @mikaylagawarecki . I'm glad people don't ignore my work.
Short answer: I explained it in the python file.

I'm a hobbist. I didn't write a paper for it since I'm recently too busy on some other tasks. MIG and 2 other tools in the code is designed as a better implementation of the FCNN(torch.nn.Linear). It's trainable no matter you stack however many of it in a row. (If you stack 5 FCNN in a row, the training is too slow. Basically I don't do it. But mig and 2 other tools can do the same thing with at least 10 stacked in a row.) According to my tests(if they are not too wrong), it's 1000x to 100_000 times faster in some cases. The test code is also in the code in the link.
Now I'm asking 2 of my friends to coop on a paper for this but they can not begin at the moment. One of them has to wait until sep this year, the other one is bit slower. So if Pytorch team is interested in this, you guys are absolutely much faster, so people will begin to know this tool earlier, which is great.

cpuhrsch added module: nn Related to torch.nn triaged This issue has been looked at a team member, and triaged and prioritized into an appropriate module labels Mar 28, 2024

albanD added the needs research We need to decide whether or not this merits inclusion, based on research world label Mar 28, 2024

YagaoDirac mentioned this issue Apr 1, 2024

🚀 Contributing to Keras 🚀 keras-team/keras#18442

Open

YagaoDirac mentioned this issue Apr 15, 2024

Please consider adding MIG (MI-rror with G-radient modification) tracel-ai/burn#1629

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Please consider adding MIG (MI-rror with G-radient modification) to torch.nn #122680

Please consider adding MIG (MI-rror with G-radient modification) to torch.nn #122680

YagaoDirac commented Mar 26, 2024 •

edited by pytorch-bot bot

Loading

YagaoDirac commented Mar 29, 2024 •

edited

Loading

mikaylagawarecki commented Apr 9, 2024 •

edited

Loading

YagaoDirac commented Apr 11, 2024 •

edited

Loading

Please consider adding MIG (MI-rror with G-radient modification) to torch.nn #122680

Please consider adding MIG (MI-rror with G-radient modification) to torch.nn #122680

Comments

YagaoDirac commented Mar 26, 2024 • edited by pytorch-bot bot Loading

🚀 The feature, motivation and pitch

Alternatives

Additional context

YagaoDirac commented Mar 29, 2024 • edited Loading

mikaylagawarecki commented Apr 9, 2024 • edited Loading

YagaoDirac commented Apr 11, 2024 • edited Loading

YagaoDirac commented Mar 26, 2024 •

edited by pytorch-bot bot

Loading

YagaoDirac commented Mar 29, 2024 •

edited

Loading

mikaylagawarecki commented Apr 9, 2024 •

edited

Loading

YagaoDirac commented Apr 11, 2024 •

edited

Loading