Skip to content
This repository has been archived by the owner on Sep 18, 2021. It is now read-only.

Commit

Permalink
Add test cases for hashtag with Zero-Width Non-Joiner
Browse files Browse the repository at this point in the history
  • Loading branch information
keita committed Feb 23, 2012
1 parent def9095 commit df24c10
Showing 1 changed file with 6 additions and 2 deletions.
8 changes: 6 additions & 2 deletions extract.yml
Expand Up @@ -709,14 +709,18 @@ tests:
text: "#سیاست #ایران #السياسة #السياح #لغات #اتمی #کنفرانس #العربية #الجزيرة #فارسی"
expected: ["سیاست", "ایران", "السياسة", "السياح", "لغات", "اتمی", "کنفرانس", "العربية", "الجزيرة", "فارسی"]

- description: "Extract Arabic hashtags with dash"
- description: "Extract Arabic hashtags with underscore"
text: "#برنامه_نویسی #رییس_جمهور #رئيس_الوزراء, #ثبت_نام. #لس_آنجلس"
expected: ["برنامه_نویسی", "رییس_جمهور", "رئيس_الوزراء", "ثبت_نام", "لس_آنجلس"]

- description: "Extract Thai hashtags with dash"
- description: "Extract Thai hashtags"
text: "#ผู้เริ่ม #การเมือง #รายละเอียด #นักท่องเที่ยว #ของขวัญ #สนามบิน #เดินทาง #ประธาน"
expected: ["ผู้เริ่ม", "การเมือง", "รายละเอียด", "นักท่องเที่ยว", "ของขวัญ", "สนามบิน", "เดินทาง", "ประธาน"]

- description: "Extract Arabic hashtags with Zero-Width Non-Joiner"
text: "#أي‌بي‌إم #می‌خواهم"
expected: ["أي‌بي‌إم", "می‌خواهم"]

hashtags_with_indices:
- description: "Extract a hastag at the start"
text: "#hashtag here"
Expand Down

0 comments on commit df24c10

Please sign in to comment.