Added documentation for AFINN, returned objects and tokenization. #124

rishpandey · 2017-10-10T12:38:44Z

No description provided.

CI fail fix

Indent fix

Added how it works; AFINN, tokenization and returned objects info.

thisandagain

Looks great! A few really minor comments.

thisandagain · 2017-10-11T12:17:36Z

README.md

@@ -79,7 +79,62 @@ Yelp:    0.69 (+2%)
 ```

 ---
+### How it works
+#### AFINN 
+AFINN is a list of words rated for valence with an integer between minus five (negative) and plus five (positive). Sentiment analysis is performed by cross-checking the string tokens( words, emojis) with the AFINN list and getting their respective scores. The comparative score is simply: sum of each token / number of tokens. So for example let's take the following:


Couple minor nitpicks here:

Fix spacing / formatting in parenthetical "tokens (e.g. words and emojis)"

Format "sum of each token / number of tokens" as code by wrapping in code

thisandagain · 2017-10-11T12:19:20Z

README.md

+(5 * 200) / 200 = 5
+
+#### Tokenization
+Tokenization works by splitting the lines of input string, then removing the special characters and finally splitting it using spaces. This is used to get list of words in the string. 


Another really minor nitpick:

Please use an oxford comma for "characters, and finally"

thisandagain · 2017-10-11T12:23:09Z

README.md

@@ -79,7 +79,62 @@ Yelp:    0.69 (+2%)
 ```

 ---
+### How it works


I think this whole section should probably be before "Benchmarks". Also, please match the spacing between the H3 and the divider line as shown in the other sections:

... lorem ipsum dolor sit amet. --- ### Some title Lorem ipsum dolor sit amet...

thisandagain · 2017-10-11T12:23:55Z

README.md

+
+This approach leaves you with a mid-point of 0 and the upper and lower bounds are constrained to positive and negative 5 respectively (the same as each token! 😸). For example, let's imagine an incredibly "positive" string with 200 tokens and where each token has an AFINN score of 5. Our resulting comparative score would look like this:
+
+(max positive score * number of tokens) / number of tokens


Please wrap this and the line below it in a code block

thisandagain · 2017-10-11T12:24:13Z

README.md

+    * __Negative__: List of negative words in input string that were found in AFINN list.
+
+In this case, love has a value of 3, allergic has a value of -2, and the remaining tokens are neutral with a value of 0. Because the string has 9 tokens the resulting comparative score looks like:
+(3 + -2) / 9 = 0.111111111


Please wrap this line in a code block

rishpandey · 2017-10-15T12:13:46Z

I fixed these in #125. Please take a look.

rishpandey added 6 commits September 20, 2017 15:32

Added newline replace for Tokenizer

60ac760

fixed indentation

33af9d3

Update tokenize.js

1d4ea35

CI fail fix

Update tokenize.js

ae59676

Indent fix

Update tokenize.js

1803eb9

Update README.md

de18f87

Added how it works; AFINN, tokenization and returned objects info.

rishpandey mentioned this pull request Oct 11, 2017

Improve documentation #112

Closed

3 tasks

thisandagain requested changes Oct 11, 2017

View reviewed changes

thisandagain assigned rishpandey Oct 11, 2017

thisandagain added the pr - needs work label Oct 11, 2017

thisandagain reviewed Oct 11, 2017

View reviewed changes

rishpandey closed this Oct 15, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added documentation for AFINN, returned objects and tokenization. #124

Added documentation for AFINN, returned objects and tokenization. #124

rishpandey commented Oct 10, 2017

thisandagain left a comment

thisandagain Oct 11, 2017

thisandagain Oct 11, 2017

thisandagain Oct 11, 2017

thisandagain Oct 11, 2017

thisandagain Oct 11, 2017

rishpandey commented Oct 15, 2017


		This approach leaves you with a mid-point of 0 and the upper and lower bounds are constrained to positive and negative 5 respectively (the same as each token! 😸). For example, let's imagine an incredibly "positive" string with 200 tokens and where each token has an AFINN score of 5. Our resulting comparative score would look like this:

		(max positive score * number of tokens) / number of tokens

Added documentation for AFINN, returned objects and tokenization. #124

Added documentation for AFINN, returned objects and tokenization. #124

Conversation

rishpandey commented Oct 10, 2017

thisandagain left a comment

Choose a reason for hiding this comment

thisandagain Oct 11, 2017

Choose a reason for hiding this comment

thisandagain Oct 11, 2017

Choose a reason for hiding this comment

thisandagain Oct 11, 2017

Choose a reason for hiding this comment

thisandagain Oct 11, 2017

Choose a reason for hiding this comment

thisandagain Oct 11, 2017

Choose a reason for hiding this comment

rishpandey commented Oct 15, 2017