Permalink
Cannot retrieve contributors at this time
Fetching contributors…
| <!DOCTYPE html> | |
| <html> | |
| <head> | |
| <meta http-eqiv="content-type" content="text/html;charset=utf-8"> | |
| <title>Go by Example: Regular Expressions</title> | |
| <link rel=stylesheet href="site.css"> | |
| </head> | |
| <script type="text/javascript"> | |
| if (window.location.host == "gobyexample.com") { | |
| var _gaq = _gaq || []; | |
| _gaq.push(['_setAccount', 'UA-34996217-1']); | |
| _gaq.push(['_trackPageview']); | |
| (function() { | |
| var ga = document.createElement('script'); ga.type = 'text/javascript'; ga.async = true; | |
| ga.src = ('https:' == document.location.protocol ? 'https://ssl' : 'http://www') + '.google-analytics.com/ga.js'; | |
| var s = document.getElementsByTagName('script')[0]; s.parentNode.insertBefore(ga, s); | |
| })(); | |
| } | |
| </script> | |
| <body> | |
| <div class="example" id="regular-expressions"> | |
| <h2><a href="./">Go by Example</a>: Regular Expressions</h2> | |
| <table> | |
| <tr> | |
| <td class="docs"> | |
| <p>Go offers built-in support for <a href="http://en.wikipedia.org/wiki/Regular_expression">regular expressions</a>. | |
| Here are some examples of common regexp-related tasks | |
| in Go.</p> | |
| </td> | |
| <td class="code empty leading"> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre><span class="kn">package</span> <span class="nx">main</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre><span class="kn">import</span> <span class="s">"bytes"</span> | |
| <span class="kn">import</span> <span class="s">"fmt"</span> | |
| <span class="kn">import</span> <span class="s">"regexp"</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre><span class="kd">func</span> <span class="nx">main</span><span class="p">()</span> <span class="p">{</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| <p>This tests whether a pattern matches a string.</p> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre> <span class="nx">match</span><span class="p">,</span> <span class="nx">_</span> <span class="o">:=</span> <span class="nx">regexp</span><span class="p">.</span><span class="nx">MatchString</span><span class="p">(</span><span class="s">"p([a-z]+)ch"</span><span class="p">,</span> <span class="s">"peach"</span><span class="p">)</span> | |
| <span class="nx">fmt</span><span class="p">.</span><span class="nx">Println</span><span class="p">(</span><span class="nx">match</span><span class="p">)</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| <p>Above we used a string pattern directly, but for | |
| other regexp tasks you’ll need to <code>Compile</code> an | |
| optimized <code>Regexp</code> struct.</p> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre> <span class="nx">r</span><span class="p">,</span> <span class="nx">_</span> <span class="o">:=</span> <span class="nx">regexp</span><span class="p">.</span><span class="nx">Compile</span><span class="p">(</span><span class="s">"p([a-z]+)ch"</span><span class="p">)</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| <p>Many methods are available on these structs. Here’s | |
| a match test like we saw earlier.</p> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre> <span class="nx">fmt</span><span class="p">.</span><span class="nx">Println</span><span class="p">(</span><span class="nx">r</span><span class="p">.</span><span class="nx">MatchString</span><span class="p">(</span><span class="s">"peach"</span><span class="p">))</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| <p>This finds the match for the regexp.</p> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre> <span class="nx">fmt</span><span class="p">.</span><span class="nx">Println</span><span class="p">(</span><span class="nx">r</span><span class="p">.</span><span class="nx">FindString</span><span class="p">(</span><span class="s">"peach punch"</span><span class="p">))</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| <p>The also finds the first match but returns the | |
| start and end indexes for the match instead of the | |
| matching text.</p> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre> <span class="nx">fmt</span><span class="p">.</span><span class="nx">Println</span><span class="p">(</span><span class="nx">r</span><span class="p">.</span><span class="nx">FindStringIndex</span><span class="p">(</span><span class="s">"peach punch"</span><span class="p">))</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| <p>The <code>Submatch</code> variants include information about | |
| both the whole-pattern matches and the submatches | |
| within those matches. For example this will return | |
| information for both <code>p([a-z]+)ch</code> and <code>([a-z]+)</code>.</p> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre> <span class="nx">fmt</span><span class="p">.</span><span class="nx">Println</span><span class="p">(</span><span class="nx">r</span><span class="p">.</span><span class="nx">FindStringSubmatch</span><span class="p">(</span><span class="s">"peach punch"</span><span class="p">))</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| <p>Similarly this will return information about the | |
| indexes of matches and submatches.</p> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre> <span class="nx">fmt</span><span class="p">.</span><span class="nx">Println</span><span class="p">(</span><span class="nx">r</span><span class="p">.</span><span class="nx">FindStringSubmatchIndex</span><span class="p">(</span><span class="s">"peach punch"</span><span class="p">))</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| <p>The <code>All</code> variants of these functions apply to all | |
| matches in the input, not just the first. For | |
| example to find all matches for a regexp.</p> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre> <span class="nx">fmt</span><span class="p">.</span><span class="nx">Println</span><span class="p">(</span><span class="nx">r</span><span class="p">.</span><span class="nx">FindAllString</span><span class="p">(</span><span class="s">"peach punch pinch"</span><span class="p">,</span> <span class="o">-</span><span class="mi">1</span><span class="p">))</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| <p>These <code>All</code> variants are available for the other | |
| functions we saw above as well.</p> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre> <span class="nx">fmt</span><span class="p">.</span><span class="nx">Println</span><span class="p">(</span><span class="nx">r</span><span class="p">.</span><span class="nx">FindAllStringSubmatchIndex</span><span class="p">(</span> | |
| <span class="s">"peach punch pinch"</span><span class="p">,</span> <span class="o">-</span><span class="mi">1</span><span class="p">))</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| <p>Providing a non-negative integer as the second | |
| argument to these functions will limit the number | |
| of matches.</p> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre> <span class="nx">fmt</span><span class="p">.</span><span class="nx">Println</span><span class="p">(</span><span class="nx">r</span><span class="p">.</span><span class="nx">FindAllString</span><span class="p">(</span><span class="s">"peach punch pinch"</span><span class="p">,</span> <span class="mi">2</span><span class="p">))</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| <p>Our examples above had string arguments and used | |
| names like <code>MatchString</code>. We can also provide | |
| <code>[]byte</code> arguments and drop <code>String</code> from the | |
| function name.</p> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre> <span class="nx">fmt</span><span class="p">.</span><span class="nx">Println</span><span class="p">(</span><span class="nx">r</span><span class="p">.</span><span class="nx">Match</span><span class="p">([]</span><span class="nb">byte</span><span class="p">(</span><span class="s">"peach"</span><span class="p">)))</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| <p>When creating constants with regular expressions | |
| you can use the <code>MustCompile</code> variation of | |
| <code>Compile</code>. A plain <code>Compile</code> won’t work for | |
| constants because it has 2 return values.</p> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre> <span class="nx">r</span> <span class="p">=</span> <span class="nx">regexp</span><span class="p">.</span><span class="nx">MustCompile</span><span class="p">(</span><span class="s">"p([a-z]+)ch"</span><span class="p">)</span> | |
| <span class="nx">fmt</span><span class="p">.</span><span class="nx">Println</span><span class="p">(</span><span class="nx">r</span><span class="p">)</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| <p>The <code>regexp</code> package can also be used to replace | |
| subsets of strings with other values.</p> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre> <span class="nx">fmt</span><span class="p">.</span><span class="nx">Println</span><span class="p">(</span><span class="nx">r</span><span class="p">.</span><span class="nx">ReplaceAllString</span><span class="p">(</span><span class="s">"a peach"</span><span class="p">,</span> <span class="s">"<fruit>"</span><span class="p">))</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| <p>The <code>Func</code> variant allows you to transform matched | |
| text with a given function.</p> | |
| </td> | |
| <td class="code"> | |
| <div class="highlight"><pre> <span class="nx">in</span> <span class="o">:=</span> <span class="p">[]</span><span class="nb">byte</span><span class="p">(</span><span class="s">"a peach"</span><span class="p">)</span> | |
| <span class="nx">out</span> <span class="o">:=</span> <span class="nx">r</span><span class="p">.</span><span class="nx">ReplaceAllFunc</span><span class="p">(</span><span class="nx">in</span><span class="p">,</span> <span class="nx">bytes</span><span class="p">.</span><span class="nx">ToUpper</span><span class="p">)</span> | |
| <span class="nx">fmt</span><span class="p">.</span><span class="nx">Println</span><span class="p">(</span><span class="nb">string</span><span class="p">(</span><span class="nx">out</span><span class="p">))</span> | |
| <span class="p">}</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| </table> | |
| <table> | |
| <tr> | |
| <td class="docs"> | |
| </td> | |
| <td class="code leading"> | |
| <div class="highlight"><pre><span class="gp">$</span> go run regular-expressions.go | |
| <span class="go">true</span> | |
| <span class="go">true</span> | |
| <span class="go">peach</span> | |
| <span class="go">[0 5]</span> | |
| <span class="go">[peach ea]</span> | |
| <span class="go">[0 5 1 3]</span> | |
| <span class="go">[peach punch pinch]</span> | |
| <span class="go">[[0 5 1 3] [6 11 7 9] [12 17 13 15]]</span> | |
| <span class="go">[peach punch]</span> | |
| <span class="go">true</span> | |
| <span class="go">p([a-z]+)ch</span> | |
| <span class="go">a <fruit></span> | |
| <span class="go">a PEACH</span> | |
| </pre></div> | |
| </td> | |
| </tr> | |
| <tr> | |
| <td class="docs"> | |
| <p>For a complete reference on Go regular expressions check | |
| the <a href="http://golang.org/pkg/regexp/"><code>regexp</code></a> package docs.</p> | |
| </td> | |
| <td class="code empty"> | |
| </td> | |
| </tr> | |
| </table> | |
| <p class="next"> | |
| Next example: <a href="json">JSON</a>. | |
| </p> | |
| <p class="footer"> | |
| <a href="https://twitter.com/gobyexample">@gobyexample</a> | <a href="mailto:mmcgrana@gmail.com">feedback</a> | <a href="https://github.com/mmcgrana/gobyexample/blob/master/examples/regular-expressions">source</a> | <a href="https://github.com/mmcgrana/gobyexample#license">license</a> | |
| </p> | |
| </div> | |
| </body> | |
| </html> |