rbook

the why

Staying in flow; for speed of exploration, research, and development. For fast, in-flow data analysis, nothing beats the power trio of emacs, ESS, and R. Together they run circles around the competition.
Ease of iteration on graphics. With graphics in the browser rather thatn in X11 or VNC, we have a snapshot at each step in development, and we can see all the graphs we ever made. Each step is clear, and repeatable. We can have 100s of png in an rbook, far more than R/X11 can keep open at once.
Recovery from memory exaustion and crashes. By recording exactly what we did, even if we went off script and down a deep rabbit-hole, we can reconstruct our findings. We can readily continue our analysis after an R crash. We just keep appending to the rbook, creating a seemless log.
Replication and documentation. Priority can be critical in a research setting. A microsecond precision timestamp is recorded for every line of R executed. By commiting the single binary rbook file to git, research progress over years can be precisely reconstructed if need be.

the what

Project rbook provides R notebooks, affectionately known as rbooks.

The server binary itself is called simply rbook and is used as a drop-in replacement for R inside emacs. Of course it can be run outside of emacs on the command line too.

When in an interactive R session, the rbook binary is also serving a live web view of the session to any (or multple) web browsers.

As in a Jupyter notebook, rbook plots are saved and shown inline with the R code.

R output is logged and free-form comments can be appended to the log.

Since all graphics, comments, code, and output are logged, rbooks form a simple, compact, and append-only digital lab notebook for R. Each command is timestamped internally. These timestamps can be displayed with the rbook -dumpts option.

detail

rbook is written in Go for use with R. It is designed for use with emacs and ESS.

flags quick reference

$ rbook -h
Usage of rbook:

  -display string
      X11 display number (example: -display :99) on which to
      display our X11 plots. Defaults to :10 but can be the string
      'xvfb' (without quotes) if you want to start a new Xvfb based
      display to run on; however this can conflict with other Xvfb
      client programs (for unknown reasons) and so is not
      recommended. Use 'png' to just save directly to png files,
      skipping x11/windowing.
  -dump
      write script version of the -path binary book to
      standard out, then exit.
  -dumpts
      like -dump but print the timestamp beside each line,
      showing when it was entered.
  -help
      show this help given rbook -h
  -host string
      host/ip to server on (optional)
  -path string
      path to the .rbook file to read and append to. this
      is also the default command line argument, so -path
      can be omitted in front of the path (default is
      my.rbook.hostname in the current dir)
  -port int
      port to serve index.html for images/R updates on (optional;
      if -port is taken or 0, defaults to the first free port
      at or above 8888)
  -rhome string
      value of R_HOME to start R with. This directory should have
      contents: bin  COPYING  etc  lib  library  modules  site-library  SVN-REVISION
      (default "/usr/lib/R")
  -v	show rbook version and exit
  -version
      show rbook version and exit
  -viewonly
      for viewing .png in this directory; skip starting
      R session.
  -wall string
      path or symlink to wallpaper to set on the Xvfb/x11vnc
      (default "/home/jaten/.wallpaper")

old notes, mostly historical interest, showing the design path

goal:

We want to enable saving and recording of an R session, including plots, commands, and command output to the rbook. The rbook is displayed in a web browser and updated as the user's R session progresses. Should be usable under an ESS/emacs environment. Thus our work sessions can survive R running out of memory or crashing. We can easily repeat our work, and visualize the same sequence of plots and analysis.

approach:

To capture graphs, we run under X11 or Xvfb and use the R savePlot() call. This happens automatically for plot() and hist() calls, and other graphics can be saved on demand. The user calls sv() at the R prompt to save the current graph to the browser; or svv() in the middle of code.

Interactive graph development is followed in a web browser. x11vnc can also be used, of course, as we are writing to an X11 environment. https://www.realvnc.com/en/ is a free VNC viewer. There are multiple free alternatives.

approach to show history in the browser:

All top level commands are captured by using R's addTaskCallback mechanism. Our C code then deparses each command, passes it to Go, and Go conveys it to the listening browser over a websocket.

approach to showing command output (prints, etc):

To capture the output of commands, we use the R sink() facility. Like plots, printed command output is automatically written to the browser. We use the R sink() facility to capture the last value seen at the R top level.

Push communication with listening browsers:

The rbook program provides a web server to serve the rbook file to browsers. Its uses websockets to push to updates to web browsers as each new lines is enter, or printed, or when there is a new plot to display.

Comments from the prompt into the book

Comments are created by having R evaluate a string literal that starts with the hash symbol # or the semicolon ;.

For example, at the rbook prompt[1]:

> "# start a comment line
+ that can span multiple lines
+ and is finished by ending the string literal"

is then rendered in the browser with a beige background and ### in front of each line.

[1] In the above example, the user does not type the '+' signs. They are automatically added by the R REPL after the user presses enter in the middle of a string literal, to indicate that a multi-line string is being typed.

The example above could equally have been done with single quotes, since those also delimit string literals in R. This may be easier to type, since it does not involve the shift key typically. We use the semicolon form too here, which also, ergonomically, avoids the shift key.

'; start a comment line
+ that can span multiple lines
+ and is finished by ending the string literal'

In either case, the output is the same:

### start a comment line
### that can span multiple lines
### and is finished by the ending the string literal

R's evaluation engine simply ignores string literals at the command prompt. They are legal values, but are not assigned to anything and so change no state. The rbook callbacks notice these special string literals and display them nicely as left-aligned ### blocks, as is the convention for comments in some places such as ESS.

From the ESS manual [2][3]:

Comments are also handled specially by ESS, using an idea borrowed from the Emacs-Lisp indentation style. By default, comments beginning with ‘###’ are aligned to the beginning of the line. Comments beginning with ‘##’ are aligned to the current level of indentation for the block containing the comment. Finally, comments beginning with ‘#’ are aligned to a column on the right (the 40th column by default, but this value is controlled by the variable comment-column,) or just after the expression on the line containing the comment if it extends beyond the indentation column. You turn off the default behavior by adding the line (setq ess-indent-with-fancy-comments nil) to your .emacs file.

references

[2] http://ess.r-project.org/Manual/ess.html#Indenting

[3] https://stackoverflow.com/questions/780796/emacs-ess-mode-tabbing-for-comment-region

finished sub-tasks

[x] done. make history (the notebook) persistent so that browsers can reload history; even after R or the browser has been restarted.

[x] have browser get the BookID and CreateTm from the server.

[x] done: keeping the browser state in sync

[x] mechanism to add comments into the stream.

[x] done. pick the next highest unused websocket port, embed into index.html before sending. to avoid collisions with multiple rbooks running at once.

[x] done. add configuration command line options for setting options/ the name of the rbook file to save into.

[x] done. a parallel script/text version of the session is also written for easy/quick review; without needing to open the browser. And if you have only the binary, rbook -dump will regenerate the text form.

[x] done: Automate the startup of the Xvfb, the window manager, and x11vnc server. Rbook should start them if they are not already running.

[x] Ctrl-c will interrupt R code. (The interrupt happens as soon as R interpreted code gets to run. If C or Fortran code is running without callback, we have to wait for them finish.)

[ ] authentiation: deferred/todo. There is no HTTPs or auth at the moment. Since this was developed for personal use, a private, personal VPN sufficed.

The login.go contains a simple cookie based login example that would need to be made persistent to disk with greenpack or other means. But we'll defer login until needed.

installation

System: developed under linux. Also works under MacOS. Windows support has not yet been implemented.

Preparation: (Xvfb are x11vnc are no longer the defaults; but are supported so we build against them).

apt install Xvfb x11vnc icewm

This installs the dependencies. Any X window manager can be used. icewm seems nice, but there's nothing special about it.

Have Go toolchain already installed, so make can call go.

git clone https://github.com/glycerine/embedr
cd embedr
make

cd ..
git clone https://github.com/glycerine/rbook
cd rbook
make

Runinng make should build the rbook binary.

For emacs configuration -- how to get ESS to run rbook instead of default R:

(defun rbook ()
  (interactive)
  (let ((inferior-R-program-name "~/go/bin/rbook"))
  (R)))

To set inferior-R-program-name manually:

Ctrl-h v inferior-R-program-name -> position cursor over the customize link and press enter.

The emacs variable setting screen is shown:

inferior-R-program-name is a variable defined in ‘ess-custom.el’.
Its value is "/home/jaten/go/bin/rbook"
Original value was "R"

Documentation:
Program name for invoking an inferior ESS with M-x R.

You can customize this variable.

howto - notes on figuring out what worked.

xvfb-run R

By running Xvfb, we can be sure that there is always a local X environment to write to. This can also be viewed in realtime with

Xvfb :99 -screen 0 3000x2000x16
icewm &
feh --bg-scale ~/pexels-ian-turnell-709552.jpg
x11vnc -display :99 -forever -nopw -quiet -xkb

Now a vnc client connecting to port 5900 will show the xvfb frame buffer.

Update: we avoid Xvfb by default now, and just use the :10 default running (real) Xserver for everything. This is much more reliable.

earlier notes

In R

savePlot() # writes current plot to Rplot.png or filename=

We can invoke savePlot() to assign a filename, and then send the filename to rbook.

Our rbook may then wish to copy the file version of plots for safe keeping into the archive. Update: yes, it copies them into a directory, my.rbook.plots, in the current directory (assuming my.rbook is the file name).

Since sometimes plots are built up interactively, we wait until ready and given the final sv() command added to R to tell rbook to consolidate into just the nice finished plot.

how to get the code snippets

We embeded R in our Go program 'rbook'. So R and Go are all in one process. This avoids the problems of having two processes. In particular, having one crash and the other still be up is a pita, and normally requires extra monitoring and retries. A single process solution is robust.

We have embedr already working and it provides an API for embedding R and executing arbitrary R code from Go. It was based on my earlier rmq proof of concept (which is public); https://github.com/glycerine/rmq

import (
   "github.com/glycerine/embedr"
)

embedr.InitR()
defer embedr.EndR()
embedr.EvalR("R code here")

The R_ReplDLLinit() and embedr.ReplDLLdo1() was the key to getting a nice REPL experience under the R loaded as DLL.

https://cran.r-project.org/doc/manuals/R-exts.html#index-Rf_005finitEmbeddedR

https://rstudio.github.io/r-manuals/r-exts/Linking-GUIs-and-other-front-ends-to-R.html

more notes

there is max.deparse.length as a limit, as an option to source() and a way to raise it

https://stackoverflow.com/questions/54872060/what-does-truncated-mean-in-the-tinn-r-console/55292384#55292384

https://emacs.stackexchange.com/questions/69220/ess-turn-off-truncated-in-ess-r-session

quoting a comment there:

It looks like I want to set max.depare.lengt = echo() rather than an integer. The relevant code seems to be in: ess/etc/ESSR/R/.basic.R: Specifically, 

.ess.eval <- function(string, visibly = TRUE, output = FALSE,
                      max.deparse.length = 300,
                      file = tempfile("ESS"), local = NULL) { ...

and 

.ess.source <- function(file, visibly = TRUE, output = FALSE,
                        max.deparse.length = 300, local = NULL,
                        fake.source = FALSE, keep.source = TRUE,
                        message.prefix = "") { ...

mikemtnbikes
2022 Oct 25

author

License: MIT

Name		Name	Last commit message	Last commit date
Latest commit History 440 Commits
attic		attic
js_css		js_css
misc		misc
testdata		testdata
vendor		vendor
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
embed_template.go		embed_template.go
exists.go		exists.go
fav.go		fav.go
favicon-16x16.png		favicon-16x16.png
favicon-32x32.png		favicon-32x32.png
favicon_io.zip		favicon_io.zip
gen.go		gen.go
green_checkmark.png		green_checkmark.png
ipaddr.go		ipaddr.go
login.go		login.go
logo_rbook.png		logo_rbook.png
rbook.go		rbook.go
rbook_session4.png		rbook_session4.png
rbookcfg.go		rbookcfg.go
reload.go		reload.go
saved.go		saved.go
serz.go		serz.go
serz_gen.go		serz_gen.go
serz_gen_test.go		serz_gen_test.go
showme.go		showme.go
udlock.go		udlock.go
udlock_test.go		udlock_test.go
version.go		version.go
vprint.go		vprint.go
wscli.go		wscli.go
wshub.go		wshub.go
xvfb.go		xvfb.go

License

glycerine/rbook

Folders and files

Latest commit

History

Repository files navigation

rbook

the why

the what

detail

old notes, mostly historical interest, showing the design path

finished sub-tasks

installation

howto - notes on figuring out what worked.

earlier notes

how to get the code snippets

more notes

author

About

Resources

License

Stars

Watchers

Forks

Languages