Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clean Up and Update Single Exchange Dialogs #96

Closed
NirantK opened this issue Jan 9, 2018 · 16 comments
Closed

Clean Up and Update Single Exchange Dialogs #96

NirantK opened this issue Jan 9, 2018 · 16 comments

Comments

@NirantK
Copy link
Collaborator

NirantK commented Jan 9, 2018

The Single Exchange Dialogs section is ambiguous, too broad and out of date. Here is how you can help us improve this:

  1. Removing links which you think do not fit in the section. Don't worry about damaging this repository, we can have a discussion on the PR you raise.
  2. Consider adding 2-3 code examples and datasets
  3. Consider adding 2-3 examples from slot filling (sequence mining for text is welcome too) and other approaches in chatbots
@anu0012
Copy link
Contributor

anu0012 commented Jan 16, 2018

Hey @NirantK , in code examples can I add how one can use CountVectorizer and Tf-idfVectorizer? And how pre-trained vectors can be used in NLP?

@NirantK
Copy link
Collaborator Author

NirantK commented Jan 16, 2018

Hey @anu0012 , if the vectorization methods specific to Dialogs and chatbots - feel free to add them to that section. I guess they are not.

If you are asking if you can add them in general, consider adding them to the tutorial section - if these are not already covered.

@anu0012
Copy link
Contributor

anu0012 commented Jan 16, 2018

https://github.com/anu0012/Predict_the_happiness_challenge/blob/master/notebook.ipynb

In this notebook, I have used Tf-IDF Vectorizer. I used several concepts like text-cleaning, lemmatization, stemming etc. in this script. Can I add this?

@NirantK
Copy link
Collaborator Author

NirantK commented Jan 16, 2018

No, @anu0012 that does not meet our requirements just yet. Please refer the tutorials section to get an estimate of the quality needed to be included here.

I am sure you can polish it to make it awesome and help the community in the process!

@the-ethan-hunt
Copy link
Collaborator

@NirantK, I first checked the links here.

  1. the RNNLM toolkit link is broken.
  2. The other papers have working links
    New stuff worth adding
  • SPMF , a Java library for pattern mining
  • A Sequential pattern mining tutorial and a 'hands-on' thingy
  • This code repo is dual LSTM encoder for dialog response generation from the Ubuntu corpus.
    Anything I am missing out or mistaking for something?

@NirantK
Copy link
Collaborator Author

NirantK commented Jan 17, 2018 via email

@the-ethan-hunt
Copy link
Collaborator

I understand. 😄 . I mistook it for something. No problem, I will open up the needed issue and look more in the DialogCI and ubuntu corpus thing

@anu0012
Copy link
Contributor

anu0012 commented Jan 17, 2018

https://www.tidytextmining.com

I think this can be added in reading section. What do you think @NirantK ?

@NirantK
Copy link
Collaborator Author

NirantK commented Jan 18, 2018

@anu0012 good find. Since this is an entire book and not a one-off tutorial, let's create a new section under tutorials Books and add there.

This becomes our excuse to make some progress on #5 as well.

@NirantK
Copy link
Collaborator Author

NirantK commented Jan 18, 2018

Thank you @the-ethan-hunt.

I have fixed the broken link and closed #105.

As a quick note, Dialogflow is a tool for making Human-Computer Interaction systems (or HCI). In layman words, it is a tool for making chatbots.

@NirantK
Copy link
Collaborator Author

NirantK commented Feb 4, 2018

Hey @the-ethan-hunt, do consider continue contributing to awesome-nlp. Take a look at this issue if you'd like :)

@the-ethan-hunt
Copy link
Collaborator

Sure @NirantK ! But is there any other issue I might possibly work on? 😅

@NirantK
Copy link
Collaborator Author

NirantK commented Feb 4, 2018

Sure @the-ethan-hunt.

Thanks for adding Korean from #98 but did not make enough progress on Chinese, Japanese or any European languages for that matter. It'd be awesome if we'd take that issue to its due conclusion.

It saves a lot of time for the community to have all of the best tools for a particular language in one place.

@NirantK
Copy link
Collaborator Author

NirantK commented Feb 14, 2018

Hey @anu0012, are you still interested in working on this? We could really appreciate a hand here :)

@anu0012
Copy link
Contributor

anu0012 commented Feb 14, 2018

Sure @NirantK. In the second point which you mentioned what type of code examples and dataset can be added?

@NirantK
Copy link
Collaborator Author

NirantK commented Feb 15, 2018

@anu0012 Chatbots, virtual assistants and any other popular form of conversational interfaces is a good starting point.

E.g. there is some work on chatbots from Microsoft and Facebook both, check for what datasets they've used and if we can mention them here. Similarly, there is some work on intent detection etc, maybe look if that is relevant?

If at the end of all of this search, we are still unsatisifed with the quality and breadth of coverage, maybe we can merge this section with Conversational Q&A which has similar technical challenges imho. I'd be mostly going by your (and community's) recommendation and findings on the same.

@NirantK NirantK closed this as completed Sep 28, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants