Permalink
Browse files

Add test cases with hashtags containing Japanese Ditto mark and Turki…

…sh 'i'.
  • Loading branch information...
1 parent 69069e5 commit 6b7a34a1d25d5f9cd8117b1d6923d26cec55ba8c keita committed Dec 13, 2011
Showing with 6 additions and 2 deletions.
  1. +6 −2 extract.yml
View
@@ -639,13 +639,17 @@ tests:
expected: ["日本語ハッシュタグ"]
- description: "Hashtag with ideographic iteration mark"
- text: "#云々 #学問のすゝめ #いすゞ #各〻"
- expected: ["云々", "学問のすゝめ", "いすゞ", "各〻"]
+ text: "#云々 #学問のすゝめ #いすゞ #各〻 #〃"
+ expected: ["云々", "学問のすゝめ", "いすゞ", "各〻", "〃"]
- description: "Hashtags with ş (U+015F)"
text: "Here’s a test tweet for you: #Ateş #qrşt #ştu #ş"
expected: ["Ateş", "qrşt", "ştu", "ş"]
+ - description: "Hashtags with İ (U+0130) and ı (U+0131)"
+ text: "Here’s a test tweet for you: #İn #ın"
+ expected: ["İn", "ın"]
+
- description: "Hashtag before punctuations"
text: "#hashtag: #hashtag; #hashtag, #hashtag. #hashtag! #hashtag?"
expected: ["hashtag", "hashtag", "hashtag", "hashtag", "hashtag", "hashtag"]

0 comments on commit 6b7a34a

Please sign in to comment.