Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve deduplicate function #92

Merged
merged 8 commits into from
Jun 8, 2020
Merged

Conversation

mattijn
Copy link
Owner

@mattijn mattijn commented Jun 8, 2020

This PR improves the _deduplicate() function in the Dedup phase. In #91 was noted that the np.where() function is a bottleneck when used in a loop.

A hinted mapping array method is adopted to replace element-wise values and numpy.digitize() to incorporate for popped linestrings in the bookkeeping.

Close #91

@mattijn
Copy link
Owner Author

mattijn commented Jun 8, 2020

Some commits conflicts with PRs mixing. This PR should only be this single commit: 485914b

@mattijn mattijn merged commit 8a78a37 into master Jun 8, 2020
@mattijn mattijn deleted the improve-deduplicate-function branch July 3, 2020 13:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

optimise deduplicate function
1 participant