GitHub - rookielzy/learn-regex: Learn regex the easy way

What is Regular Expression? 什么是正则表达式？

Regular expression is a group of characters or symbols which is used to find a specific pattern from a text.

正则表达式是一组用来从一段文字中匹配一个特定模式的字符或符号。

A regular expression is a pattern that is matched against a subject string from left to right. The word "Regular expression" is a mouthful, you will usually find the term abbreviated as "regex" or "regexp". Regular expression is used for replacing a text within a string, validating form, extract a substring from a string based upon a pattern match, and so much more.

正则表达式是从左至右匹配主体字符串的一种模式。"Regular expression"是正则表达式的全程，很多时候，你会经常看到其缩写 "regex" 或 "regexp"。正则表达式用于从一个字符串中替换一段文字，表单验证，根据指定的匹配模式从字符串中提取出一段子字符串等等。

Imagine you are writing an application and you want to set the rules when user choosing their username. We want the username can contains letter, number, underscore and hyphen. We also want to limit the number of characters in username so it does not look ugly. We use the following regular expression to validate a username:

假设你正在开发一个应用，你想要用户在设置用户名的时候遵循特定的规则。我们希望用户名包含字母，数字，下划线和连接符，同时还希望限制用户名的长度使其看起来更加美观；我们就可以使用以下正则表达式来验证用户名:

Above regular expression can accept the strings john_doe, jo-hn_doe and john12_as. It does not match Jo because that string contains uppercase letter and also it is too short.

上述正则表达式可以匹配如下字符串：john_doe, jo-hn_doe 和 john12_as，但是它不匹配 Jo，因为字符串长度太短，并且含有大写字母。

1. Basic Matchers 普通匹配

A regular expression is just a pattern of letters and digits that we use to perform search in a text. For example, the regular expression cat means: the letter c, followed by the letter a, followed by the letter t.

用于在文本中搜寻字母和数字的一种匹配模式。例如，正则表达式 cat 表示为：字母 c 紧跟着字母 a 在紧跟着字母 t。

"cat" => The cat sat on the mat

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
LICENSE		LICENSE
README.md		README.md

Meta character	Description
.	Period matches any single character except a line break.
[ ]	Character class. Matches any character contained between the square brackets.
[^ ]	Negated character class. Matches any character that is not contained between the square brackets
*	Matches 0 or more repetitions of the preceding symbol.
+	Matches 1 or more repetitions of the preceding symbol.
?	Makes the preceding symbol optional.
{n,m}	Braces. Matches at least "n" but not more than "m" repetitions of the preceding symbol.
(xyz)	Character group. Matches the characters xyz in that exact order.
\|	Alternation. Matches either the characters before or the characters after the symbol.
\	Escapes the next character. This allows you to match reserved characters `[ ] ( ) { } . * + ? ^ $ \ \|`
^	Matches the beginning of the input.
$	Matches the end of the input.

元字符	具体描述
.	匹配除了换行符以外的所有字符。
[ ]	字符类。匹配任何包含在方括号内的所有字符。
[^ ]	反字符类。匹配任何不包括在方括号内的所有字符。
*	匹配 * 前的字符0次或者多次。
+	匹配 + 前的字符至少1次或者多次。
?	? 前的字符为匹配可选条件。
{n,m}	大括号，匹配 {} 前的字符至少 n 次但不超过 m 次。
(xyz)	字符组，按照括号内的字符顺序来匹配。
\|	或，匹配 \| 前的字符或其后的字符。
\	转义符，允许你匹配类似一下字符 `[ ] ( ) { } . * + ? ^ $ \ \|`
^	匹配的开头。
$	匹配的结尾。

Shorthand	Description
.	Any character except new line
\w	Matches alphanumeric characters: `[a-zA-Z0-9_]`
\W	Matches non-alphanumeric characters: `[^\w]`
\d	Matches digit: `[0-9]`
\D	Matches non-digit: `[^\d]`
\s	Matches whitespace character: `[\t\n\f\r\p{Z}]`
\S	Matches non-whitespace character: `[^\s]`

Symbol	Description
?=	Positive Lookahead
?!	Negative Lookahead
?<=	Positive Lookbehind
?<!	Negative Lookbehind

Flag	Description
i	Case insensitive: Sets matching to be case-insensitive.
g	Global Search: Search for a pattern throughout the input string.
m	Multiline: Anchor meta character works on each line.

License

rookielzy/learn-regex

Folders and files

Latest commit

History

LICENSE

LICENSE

README.md

README.md

Repository files navigation

What is Regular Expression? 什么是正则表达式？

Table of Contents 目录

1. Basic Matchers 普通匹配

2. Meta Characters 元字符

2.1 Full stop 结尾符

2.2 Character set 字符集

2.2.1 Negated character set 反字符集

2.3 Repetitions 重复

2.3.1 The Star 星号

2.3.2 The Plus 加号

2.3.3 The Question Mark 问号

2.4 Braces 大括号（花括号）

2.5 Character Group 字符组

2.6 Alternation 或

2.7 Escaping special character 转义特殊字符

2.8 Anchors 锚

2.8.1 Caret 插入符

2.8.2 Dollar 美元符号

3. Shorthand Character Sets

4. Lookaround

4.1 Positive Lookahead

4.2 Negative Lookahead

4.3 Positive Lookbehind

4.4 Negative Lookbehind

5. Flags

5.1 Case Insensitive

5.2 Global search

5.3 Multiline

Bonus

Contribution

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages