Add linear wrapper and kan feedback #189

klei22 · 2024-06-18T21:28:13Z

No description provided.

This change prevents division by zero. This appears to be the reason for the initial instability of inference, so now we can explore various KAN applications. Note: We should convert this clamp into an argparse param and config for experimentation.

This allows us to set the mean and standard deviation for random weights for both linear modules and for the embedding table.

After experimentation, we don't need to include these after including the dictionary.

Utilize inheritance based interface adaptation.

klei22 · 2024-06-18T21:33:02Z

Previous discussions here:

After these changes wpe is not set if we're using a different strategy for position embedding.

SenmiaoORZ and others added 8 commits June 4, 2024 21:52

replace linear with KAN

82c74d6

Add KAN for explorations

bdbf490

Improve stability of KAN Module

6bc1153

This change prevents division by zero. This appears to be the reason for the initial instability of inference, so now we can explore various KAN applications. Note: We should convert this clamp into an argparse param and config for experimentation.

Add options random init mean and std to train.py

aeca808

This allows us to set the mean and standard deviation for random weights for both linear modules and for the embedding table.

Clean imports section of model.py

6277d35

After experimentation, we don't need to include these after including the dictionary.

Add polymorphic interface for linear variations

45a5b2e

Simplify linear wrapper to inheritance based

6e70517

Utilize inheritance based interface adaptation.

Merge branch 'add_kan_and_hyperparams' into origin_main

df5bd39

klei22 requested a review from gkielian June 18, 2024 21:28

klei22 mentioned this pull request Jun 18, 2024

Add kan and hyperparams #182

Closed

Don't set WPE if not selected

01aef4b

After these changes wpe is not set if we're using a different strategy for position embedding.

gkielian approved these changes Jun 18, 2024

View reviewed changes

gkielian merged commit 0e9ac28 into ReaLLMASIC:master Jun 18, 2024
5 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add linear wrapper and kan feedback #189

Add linear wrapper and kan feedback #189

klei22 commented Jun 18, 2024

klei22 commented Jun 18, 2024

Add linear wrapper and kan feedback #189

Add linear wrapper and kan feedback #189

Conversation

klei22 commented Jun 18, 2024

klei22 commented Jun 18, 2024