Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with
or
.
Download ZIP
Browse files

Merge remote-tracking branch 'keitaf/fix_russian_add_iteration' into …

…update_bundle_2011_08_05

Conflicts:
	src/com/twitter/Regex.java
  • Loading branch information...
commit 89057a8a1ceaaffd9c72c81dd68e03f06897ec59 2 parents e47131e + c142635
@mzsanford mzsanford authored
Showing with 8 additions and 6 deletions.
  1. +8 −6 src/com/twitter/Regex.java
View
14 src/com/twitter/Regex.java
@@ -12,13 +12,15 @@
private static String LATIN_ACCENTS_CHARS = "\\u00c0-\\u00d6\\u00d8-\\u00f6\\u00f8-\\u00ff\\u015f";
private static final String HASHTAG_ALPHA_CHARS = "a-z" + LATIN_ACCENTS_CHARS +
- "\\u0400-\\u04ff\\u0500-\\u0527" + // Cyrillic
+ "\\u0400-\\u04ff\\u0500-\\u0527" + // Cyrillic
+ "\\u2de0–\\u2dff\\ua640–\\ua69f" + // Cyrillic Extended A/B
"\\u1100-\\u11ff\\u3130-\\u3185\\uA960-\\uA97F\\uAC00-\\uD7AF\\uD7B0-\\uD7FF" + // Hangul (Korean)
- "\\p{InHiragana}\\p{InKatakana}" + // Japanese Hiragana and Katakana
- "\\p{InCJKUnifiedIdeographs}\\u3005" + // Japanese Kanji / Chinese Han
- "\\uff21-\\uff3a\\uff41-\\uff5a" + // full width Alphabet
- "\\uff66-\\uff9f" + // half width Katakana
- "\\uffa1-\\uffdc"; // half width Hangul (Korean)
+ "\\p{InHiragana}\\p{InKatakana}" + // Japanese Hiragana and Katakana
+ "\\p{InCJKUnifiedIdeographs}" + // Japanese Kanji / Chinese Han
+ "\\u3005\\u303b" + // Kanji/Han iteration marks
+ "\\uff21-\\uff3a\\uff41-\\uff5a" + // full width Alphabet
+ "\\uff66-\\uff9f" + // half width Katakana
+ "\\uffa1-\\uffdc"; // half width Hangul (Korean)
private static final String HASHTAG_ALPHA_NUMERIC_CHARS = "0-9\\uff10-\\uff19_" + HASHTAG_ALPHA_CHARS;
private static final String HASHTAG_ALPHA = "[" + HASHTAG_ALPHA_CHARS +"]";
private static final String HASHTAG_ALPHA_NUMERIC = "[" + HASHTAG_ALPHA_NUMERIC_CHARS +"]";
Please sign in to comment.
Something went wrong with that request. Please try again.