`xre` is to `sam` as `grep` is to `ed`

xre exists to bring the awesome power of Rob Pike's Structural Regular Expressions beyond the reach of the sam editor (approriately/coincidentally/ironically it is implemented in Go, yielding more Rob Pike reference).

WARNING: It is still in a primordial / experimental phase, but works well as a proof of concept.

What?

A short comparison to the grep/ed model:

a new x/re/ command extracts structure matched by a regular expression
... x[ x{ x( and x< extract a balanced pair of braces
a new y/re/ command extracts structure delimited by a regular expression
... y"delim" extracts structure between occurrences of a static delimiter, e.g. y"\n" for classic UNIX line-orientation
... y/start/end/ extracts structure between two regular expressions
... y[ y{ y( and y< extract content within a balanced pair of braces
the g/re/ command filters the current buffer (as extracted by x or y) if the given pattern matches
the v/re/ command filters the current buffer (as extracted by x or y) if the given pattern doesn't matches
the p command prints
... p"delim" prints with a delimiter, e.g. p"\n" to return to the warm embrace of classic UNIX tools
... p%"format" prints with a format pattern, e.g. p"%q\n" is particularly useful while developing an xre program

Why?

Loosely quoting from Structural Regular Expressions:

...if the interesting quantum of information isn’t a line, most of the (UNIX) tools don’t help, or at best do poorly

Example: counting Go heap allocations

For example, it is sometimes useful to deal with things like paragraphs (bytes that are delimited by a blank line, i.e. "\n\n"). For maximal self reference, such a data set can be had from your nearest Go program form either its /debug/pprof/heap?debug=1 endpoint, or by calling pprof.Lookup("heap").WriteTo(f, 1) yourself.

For example, the following xre program extracts just the allocation bytes from heap allocations involving a call to bytes.makeSlice (i.e. when a bytes.Buffer needs to grow):

xre 'y"\n\n" v/bytes.makeSlice/ y"\n" v/^#|^$/ x[x/^\d: (\d+)/ p"\n"'

Breaking down the above command

extract paragraphs (buffers defined delimited by blank lines)
keep only the paragraphs that mention "bytes.makeSlice"
extract lines within those paragraphs
and keep only the lines that aren't blank and don't start with a "#"
on those lines, extract the contents of the first balanced [ ] pair
and then extract the "MMM" in a "NNN: MMM" match within it
finally, print those numbers delimited by new lines (the classic UNIX paradigm)

As always, summing a stream of numbers is left as an exercise to the reader.

Name		Name	Last commit message	Last commit date
Latest commit History 145 Commits
cmd		cmd
internal/cmdutil		internal/cmdutil
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
balance.go		balance.go
balance_test.go		balance_test.go
between.go		between.go
between_test.go		between_test.go
command.go		command.go
command_test.go		command_test.go
environment.go		environment.go
environment_test.go		environment_test.go
extract.go		extract.go
extract_test.go		extract_test.go
filter.go		filter.go
filter_test.go		filter_test.go
go.mod		go.mod
go.sum		go.sum
integration_test.go		integration_test.go
join.go		join.go
join_test.go		join_test.go
matcher.go		matcher.go
matcher_test.go		matcher_test.go
output.go		output.go
output_test.go		output_test.go
predicate.go		predicate.go
processor.go		processor.go
readbuf.go		readbuf.go
split.go		split.go
split_test.go		split_test.go

License

jcorbin/xre

Folders and files

Latest commit

History

Repository files navigation

xre is to sam as grep is to ed

What?

Why?

Example: counting Go heap allocations

About

Resources

License

Stars

Watchers

Forks

Languages

`xre` is to `sam` as `grep` is to `ed`