Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add tags attribute to models #906

Merged
merged 3 commits into from Aug 8, 2017
Merged

Add tags attribute to models #906

merged 3 commits into from Aug 8, 2017

Conversation

gunthercox
Copy link
Owner

This change is going to be a prerequisite to making ChatterBot's response selection process more efficient. Using categorized training data from the chatterbot corpus, it will be possible to do text classification on input statements to place them in a group based on the existing training data. Once tagged, a strong assumption can be made by the chat bot that the appropriate response to the input statement exists within the set of statements that have the same tag, or category as the input statement.

Because filtering the results in the database based on a named attribute can be made to be fast and efficient, this will help solve a large portion of the performance problems that the current version of ChatterBot faces (~0.7).

Note: this pull request will only add the tag attributes to statements. Future pull requests will be required to:

  1. Allow storage adapters to filter results based on tags
  2. Modify the the training processes to add tags based on the:
    a. Categories for data in the ChatterBot corpus
    b. File names in the Ubuntu Dialog corpus

@gunthercox gunthercox merged commit 0668809 into master Aug 8, 2017
@gunthercox gunthercox deleted the tags branch August 8, 2017 04:17
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant