Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to speed up the progress of adding patterns #10

Open
Hunter-Leo opened this issue Nov 15, 2021 · 1 comment
Open

How to speed up the progress of adding patterns #10

Hunter-Leo opened this issue Nov 15, 2021 · 1 comment

Comments

@Hunter-Leo
Copy link

  • spikex version: 0.5.0
  • Python version:
  • Operating System: linux

Description

Hey, guys. I found your tool is very powerful, thx for sharing.
I met a problem that the time cost is huge, when I was trying to add 30 thousands patterns to initialize LabelX.
And this progress is much slower than the spacy, so that I wonder if any solution you guys can propose?

@paoloq
Copy link
Contributor

paoloq commented Jan 28, 2022

Hi @Hunter-Leo!

Time cost indexing patterns depends on many factors.
I'm thinking that a couple of things could help in identifying where's the issue:

  • You can investigate "spying" an indexing, maybe using py-spy. In this way, we know where is most of the time consumption.
  • You can share some of patterns you're using, just to have an idea of what kind of complexity they have.

If you have any other suggestion, of course it's welcome!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants