Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How do I know the progress of Training? #60

Closed
tvvignesh opened this issue Aug 20, 2018 · 6 comments
Closed

How do I know the progress of Training? #60

tvvignesh opened this issue Aug 20, 2018 · 6 comments

Comments

@tvvignesh
Copy link

First of all, great work with this library. While it works for small data sets. I wanted to do the same for large datasets. It looks like it is stuck in trainbatch and I am not sure about the progress of the training completed so far.

How do I know how many records have been trained so far or get some feedback that its processing?

@erelsgl
Copy link
Owner

erelsgl commented Aug 20, 2018 via email

@tvvignesh
Copy link
Author

Hmm. I managed for now by passing debug as true to the options like so:

var TextClassifier = limdu.classifiers.multilabel.BinaryRelevance.bind(0, {
        binaryClassifierType: limdu.classifiers.SvmJs.bind(0, { C: 1.0 }),
        debug: true
});

And doing that I noticed that SvmJs gets stuck with the first input itself if the size of the input of dataset is more than 250 rows or so and does not train further (waited for around 15-20 minutes).

Hence, I had to go far WordExtractor + Winnow classifier like so:

var TextClassifier = limdu.classifiers.multilabel.BinaryRelevance.bind(0, {
        binaryClassifierType: limdu.classifiers.Winnow.bind(0, { retrain_count: 10 }),
        debug: true
});

Thanks anyways for your reply.

@erelsgl
Copy link
Owner

erelsgl commented Aug 20, 2018 via email

@tvvignesh
Copy link
Author

Not sure if its a bug (since it did not throw any error or so) but there is an issue which has been raised and a pull request merged here: karpathy/svmjs#5

@tvvignesh
Copy link
Author

@erelsgl Tried with SvmPerf and it was faster and worked great for my use case. Thanks.

Sorry for bothering you, but had few more questions to ask:

The SvmPerf application is generating a lot of model and bin files in my directory. What should I do with that? Is it possible to reuse the same without retraining?

Also, was curious how this will scale in a production environment. Any suggestions?

Btw, Great work with the library. Works like a breeze. (Needs some minor cleanups though - will see if I can help when I get the time. For instance, it was not able to find localstorage file in bayesian classifier)

@erelsgl
Copy link
Owner

erelsgl commented Aug 28, 2018 via email

@erelsgl erelsgl closed this as completed Apr 29, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants