Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Guess how long a Sparv process will take to run #73

Open
anne17 opened this issue Apr 17, 2023 · 0 comments
Open

Guess how long a Sparv process will take to run #73

anne17 opened this issue Apr 17, 2023 · 0 comments
Labels
enhancement New feature or request

Comments

@anne17
Copy link
Member

anne17 commented Apr 17, 2023

Many test users have expressed the desire to get an estimate of how long they will have to wait until their corpus is done processing. Can we guess this somehow? How long it takes depends on:

  • place in the queue
  • total corpus size (amount of files, file sizes, total size)
  • which annotators are run (stanza, swener and compound analysis take the longest)
  • how clean the data is (a lot of ORC-trash often increases processing time)
  • if there are many long strings in the texts it will take longer time to process
  • ...?

If we have a time estimate per corpus we should be able to get an estimate for a queued corpus.

@anne17 anne17 added the enhancement New feature or request label Dec 12, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant