SparseModule

Summary

Strong Lottery Ticket Hypothesis (Ramanujan et al. 2020, Malach et al. 2020, ...) states that randomly initialized neural networks already contain subnetworks with surprisingly good accuracy. SparseModule enables us to find such subnetworks in any neural network architectures.

How to Use

Wrap your network in SparseModule. That's it!

net = nn.Linear(7,5,bias=False)

# All parameters in net are randomly initialized and fixed.
# model has score parameters, which is latent variables for subnetwork masks.
sparse_net = SparseModule(net, 0.8)
sparse_net = sparse_net.to(device)

# sparse_net.parameters() returns the score parameters.
# Never return the original parameters in net.
optimizer = optim.Adam(sparse_net.parameters(), lr=0.1) 

criterion = nn.MSELoss()
for i in range(10):
    optimizer.zero_grad()
    input = torch.randn(3,7).to(device)
    target = torch.randn(3,5).to(device)

    # Forward computation with masked net.
    output = sparse_net(input)
    loss = criterion(output, target)
    loss.backward()

    # Train score parameters (and thus masks).
    optimizer.step()

Requirements

We've checked the code is valid under the following settings:

Python 3.7.7
PyTorch 1.5.0

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
LICENSE		LICENSE
README.md		README.md
sparse_module.py		sparse_module.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SparseModule

Summary

How to Use

Requirements

About

Releases

Packages

Languages

License

dchiji/sparse_module

Folders and files

Latest commit

History

Repository files navigation

SparseModule

Summary

How to Use

Requirements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages