Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Write a topic evaluation. #4

Closed
summerlight opened this issue Mar 18, 2016 · 2 comments
Closed

Write a topic evaluation. #4

summerlight opened this issue Mar 18, 2016 · 2 comments

Comments

@summerlight
Copy link
Owner

At the last meeting, several selected project topics are assigned to each member. The evaluation text is supposed to answer the below questions:

  1. What is the use of this application? Any research already done on this?
  2. Is there any dataset available?
  3. What are the brief steps and procedure that might be needed to achieve the application?

This set of questions is basically a gist of the most important part of the corresponding wiki page. So it is good to think about those detailed questions while writing a topic evaluation.

@Samualkrish
Copy link
Collaborator

As per my conversation with the TA. The project that we have selected are good but there are certain things we need to elaborate more.

Listing down the order in which the TA and Professor need the abstract.

  1. Explain the problem in a very good detail
  • what is the current issue you are addressing
  • what will be the outcome if your solution works
  • why is the problem different in case there is an existing solution
  • which part of NLP application will this fall into e.x. text classification, machine translation, dialogue systems
  1. What is the source language other than English, target language (optional) based on your project.
  2. How is your corpora implemented?
    • Are you implementing a manual way to train your vocabulary?
    • Are you using WordNet or any other known vocabulary?
  3. How is your data set defined?
    • What is your source for your data set?
    • How is data set modeled?
      • How will your system understand the data set.
      • Example of data set model : ʾaʿadda pos=V,mood=SBJV,gen=FEM,per=1,voice=PASS,num=PL nuʿadda
      • pos - V => Verb
      • mood=SBJV => Subjective
      • gen=FEM => Gender-Feminine
      • per=1 => 1st Person
      • voice=PASS
      • num=PL => Plural
    • Will your data set be sufficient to handle all the vocabulary or will there be some words which might not be there? What will you do in that case?
  4. How is the model going to be implemented?
    • What are the features which you would be using ex. words or phrases or sentences.
    • What are the classes (a rough estimate is fine) for classification?
    • What are the morphology, phonemes, stems and nltk packages you would use?
    • What are the steps involved during the phase, detail your phases at each step?
      • Like Part of Speech tagging, what would be the method you will use
      • What stemmer would you use for other language.
      • What are the components in your model
  5. What is your evaluation metric?
    • Define a baseline (a minimum accepted solution)
      • Example a literal word to word translation can be used as base.
    • How would you define the upper bound
      • An best solution with good accuracy and F1 score.
      • If your output is nearly equal to a human translated text then your program has passed
      • If your program runs at the baseline, then you will lose points in the project.
  6. Related Research and work
    • What are the works that are done about your project?
    • Links to the components that your are going to use?
    • Research papers on how to train your data if available?

Update your topic with these information, and we will post it on blackboard

@summerlight
Copy link
Owner Author

Because we don't have much time before the deadline, we're gonna write a proposal.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants