multitenancy support #36

amn41 · 2016-12-01T07:46:49Z

This suggestion also comes from @3x14159265 .
Idea is to have rasa NLU provide multiple apps, e.g. have several models loaded into memory and serve requests based on them (routed by the URL).

The simplest approach is to start a separate server for each model, and use a supervisor. But each process will have word vectors loaded into memory, which means you can't fit very many on a server.

A better way would be to have several models loaded within one server, although I think only the spaCy backend would actually be able to share the large memory component between them. Would probably be doable to modify MITIE to support that as well.

To help plan this out, would be really helpful if people wrote their intended deployment setup here, so we can discuss various trade-offs.

The text was updated successfully, but these errors were encountered:

plauto · 2016-12-01T15:49:22Z

Personally I like more the idea of having several models served by one server...although I think that the implications on that over spacy and MITIE would have been well investigated.

So basically you would have a /models/flights or /models/restaurants right? each one of them would then be able to answer queries at /models/flights/parse?q=flight from Tokyo to Munich

baregawi · 2016-12-04T22:10:17Z

Just an idea: use multiprocessing's Process class to spawn processes from a "main" process then use it's Queue class to implement a facade over Spacy, MITIE and anything that has a heavy memory footprint or a long start-up time. That way those things only run in the "main" process, as long as they can be shared.

EDIT: upon second thought, it would be cleaner and just as performant to make Spacy and MITIE microservices themselves.

amn41 · 2016-12-05T10:23:08Z

thanks for the input @baregawi - you guys are on AWS right? so would you run this on elastic beanstalk or just a regular VPS?

baregawi · 2016-12-05T16:55:31Z

For our main servers, we use Flask as our application framework and deploy through AWS CodeDeploy at the moment since our application is not simple enough for Heroku or Elastic Beanstalk. But I imagine that if we contribute models to rasa_nlu they will be our individual ML/NLP modules which are all Python classes at the moment.

add button to chat bot rasa bot

Add info about verbose-switch to README

…sampling-parameter Eng 471 llm generated output size sampling parameter

amn41 added type:enhancement ✨ Additions of new features or changes to existing ones, should be doable in a single PR help wanted labels Dec 1, 2016

amn41 mentioned this issue Dec 30, 2016

Multitenancy #85

Merged

amn41 removed the help wanted label Jan 16, 2017

tmbo closed this as completed Mar 13, 2017

DominikRos pushed a commit that referenced this issue Feb 11, 2019

Merge pull request #36 from RasaHQ/button_to_chat_to_bot

b602184

add button to chat bot rasa bot

taytzehao pushed a commit to taytzehao/rasa that referenced this issue Jul 14, 2023

Merge pull request RasaHQ#36 from Jaskaranbir/readme-update

3a4de2d

Add info about verbose-switch to README

vcidst pushed a commit that referenced this issue Jan 23, 2024

Merge pull request #36 from RasaHQ/ENG-482-llm-generated-output-size-…

74ae0b4

…sampling-parameter Eng 471 llm generated output size sampling parameter

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multitenancy support #36

multitenancy support #36

amn41 commented Dec 1, 2016

plauto commented Dec 1, 2016

baregawi commented Dec 4, 2016 •

edited

amn41 commented Dec 5, 2016

baregawi commented Dec 5, 2016 •

edited

multitenancy support #36

multitenancy support #36

Comments

amn41 commented Dec 1, 2016

plauto commented Dec 1, 2016

baregawi commented Dec 4, 2016 • edited

amn41 commented Dec 5, 2016

baregawi commented Dec 5, 2016 • edited

baregawi commented Dec 4, 2016 •

edited

baregawi commented Dec 5, 2016 •

edited