Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix pca on sparse data reproducibility #1240

Merged
merged 4 commits into from May 21, 2020

Conversation

ivirshup
Copy link
Member

Bugfix for the sparse pca.

It looks like we forgot to pass a random seed when this is used... But we also never had a test that checks if you run the function twice with the same random seed it returns the same result.

This PR fixes both these issues. The new tests are a bit slow, but are definitely needed.

I've also added a fixture for returning a copy of the pbmc3k dataset which has been normalized and had highly_variable_genes run on it. Preparation of the object should only happen once per run of the suite, but a new copy will be provided for each test that uses it. This was done to speed up the new tests.

@ivirshup
Copy link
Member Author

To limit the amount of time we have to wait for rebuilding, I've added the change from #1236 here

@ivirshup ivirshup merged commit ec3a44f into scverse:master May 21, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant