-
Notifications
You must be signed in to change notification settings - Fork 77
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
No database dependency during inference/serving #316
Comments
One step further towards this goal, I'd like to make all the child class of Currently, the To make these methods unaware of the database,
For example, @senwu , @lukehsiao thoughts? |
At a high level, this sounds like an excellent idea to me. I wonder if/how this might affect performance as well. Perhaps we can get less lock contention if the different UDFs are not using the session object directly. |
Good point. Also we have to be careful not to run out of memory when a large corpus is processed. As you have noticed, the architecture looks like a Map-Reduce (mapper: each UDF and reducer: UDFRunner). |
Is your feature request related to a problem? Please describe.
I think a Fonduer-based app lifecycle has three phases: development, training, and serving.
During development, you may have many iterations of re-definition of mention/candidate subclasses, labeling functions, which involve re-parsing, re-extraction, re-labeling, etc.
I understand that it is important to persist intermediate results to a database to save time.
Depending on the size of training dataset, you may need to persist intermediate results during training too.
On the other hand, there is no need to persist intermediate results to a database during inference/serving.
The dependency on database (postgres) makes a Fonduer-based app less portable and less scalable.
Describe the solution you'd like
I'd like no database dependency during inference/serving.
Describe alternatives you've considered
sqlite is easier to install than postgres, but still there is no need to persist intermediate results to a database.
Additional context
Related to #137.
The text was updated successfully, but these errors were encountered: