A highly experimental code editor where you edit code, not text.
Cheeky plug: I will be streaming some of the development of Sapling over on my YouTube channel on 2pm EST on Saturdays. Smash the subscribe button to not miss upcoming streams!
Most of the ideas for this project come from my friend Shtanton's blog post. The concept of directly editing syntax trees is called 'structured editing' and is not new; the purpose of Sapling is to use ideas from structured editing to speed up moment-to-moment code editing, much how editors like Vim and Emacs speed up editing. Sapling's editing model will be largely inspired by Vim/NeoVim and kakoune. Sapling also aims to be general purpose - Sapling should be able to edit any language, given that a suitable grammar is provided.
Contributions of all kinds are very welcome!
It is worth noting that Sapling is primarily an experiment to determine whether or not such an editor could work. Therefore, for the time being, Sapling can be expected to change at any time. Hopefully the design of Sapling will converge over time - its current state is similar to how pre-1.0 Rust was continually evolving and making potentially-breaking changes so that post-1.0 Rust could be as useful as possible.
- But Why?
- Goals of Sapling
- Inspirations
- Quick Start
- Pros of AST-based Editing
- 'Extra Fun Challenges'
- What's an AST?
When writing code with any text editor, you are usually only interested in a tiny subset of all the possible strings of text - those that correspond to valid programs in whatever language you're writing. In a text editor, you will spend the overwhelming majority of your time with the text in your editor being invalid as you make edits to move between valid programs. This is inefficient for the programmer, and causes lots of issues for software like Language Servers which have to cope as best they can with these invalid states.
To be fair, editors like Vim, Emacs and Kakoune do better than most by providing shortcuts to do
common text manipulations, which is a step in the right direction. Interestingly, though, the most
useful of these shortcuts are those correspond to modifications of the syntax tree (e.g. ci)
to
remove the replace the contents of ()
in Vim), and so it seems logical to apply modal editing to
directly modifying the syntax trees of programs.
Sapling takes the idea of keystrokes primarily modifying text, but instead applies those keystrokes as actions to the syntax tree of your program. I have no idea if this will be useful, but it seems worth a try.
These goals are roughly in order of importance, with the most important first:
- Editing Speed: Sapling should be an editor that allows power users to edit code as close to their thinking speed as possible. Flattening the learning curve is also important, but Sapling is not trying to be an editor for every single developer and is designed primarily with power users in mind.
- Stability: Sapling should not, under any circumstances, corrupt the user's data or crash. Either of these are considered critical bugs and should be reported.
- Generality: Sapling should, in theory, be able to edit any language. This will likely be done with making the language plug-in-able and probably specified by some kind of grammar.
- Familiarity: Sapling should feel familiar to people who are used to modal editors such as Vim and Kakoune. However, some alterations are required for Sapling to edit ASTs and not just text.
- Interactivity: Sapling should always give the user immediate feedback about their actions. Kakoune is a model example of this, and Vim/NeoVim does pretty well too.
- Performance: The user should not have to wait for Sapling to do anything. Sapling should also have a small resource footprint - an editor should not have to use several hundred megabytes of RAM when idling.
- Vim, NeoVim and Kakoune: 'Modal' editors where keystrokes can correspond to actions on the text rather than always inserting directly to the text buffer. Shoutout in particular to Kakoune for its beautiful multi-selection based editing model.
- Tree Sitter: A generic, flexible, error-handling parser that is not language specific. Designed primarily to provide better syntax highlighting for the Atom text editor.
- grasp: A regex-like language for searching JavaScript ASTs.
- r/nosyntax: A subreddit for strutured and projectional editors. They also have a list of such projects.
- Barista: A structured editor that allows the user to fall back on text editing if required, which is something I'd like to explore for Sapling. The source code is here, but since this was a research project it seems to be unmaintained.
Sapling is not yet on crates.io and is very much still in early development, but if you want to play around with Sapling as it currently stands, the best way is to clone the repository and build from source (you'll need Rust installed in order to do this):
git clone https://github.com/kneasle/sapling.git
cargo run 2> log
Note that Sapling will not compile for Windows. Windows support is absolutely intended, but Sapling
currently uses tuikit as a terminal abstraction, which does
not yet have Windows support. PRs to Sapling or tuikit
to add support for Windows would be very
much appreciated.
q
: Quit Saplingu
: Undo a changeR
: Redo a change
h
/k
: Move the cursor to the previous sibling of the current nodej
/l
: Move the cursor to the next sibling of the current nodec
: Move the cursor to the first child of the current node (if it exists)p
: Move the cursor to the parent of the node it's currently at
r*
: Replace the node under the cursor with the node represented by the key*
x
: Delete the node under the cursoro*
: Insert a new node represented by*
as a child of the cursora*
/i*
: Insert a new node represented by*
before or after the cursor respectively
As with Vim, all commands can be repeated by inserting a count before them. For example, 3u
will
undo 3 steps in one go.
Sapling can currently only edit JSON with the following keys: [a]rray
, [o]bject
, [t]rue
,
[f]alse
, [n]ull
, [s]tring
. There is currently no way to insert text into a string or to open
and close files (yet!).
Sapling handle multiple nodes in one go by adding a count before the node name, for example i3t
will insert 3 true
s before the cursor.
- Because the editor already knows the syntactic structure of your program, the following are
much easier to implement for every language supported by Sapling:
- Syntax highlighting
- Code folding
- Auto-formatting of code (in fact, this is nearly automatic and elegantly preserving code formatting is hard)
- It will hopefully be FAST to edit code
- It might actually be more intuitive than text-based editing
Because the editor has to hold a valid program, the following things that other editors take for granted are hard to implement:
-
Searching a file - because only syntax tree nodes can be selected, we need a way to concisely search for nodes in a tree. grasp seems like it'd be good inspiration for this.
-
Just opening a file - opening a syntactically correct file is essentially the same as writing a compiler-esque parser for every language you want to load (not an easy task but there's plenty of literature/libraries already existing for this). The real issue is that Sapling has to at least attempt to open any file, regardless of syntactic correctness, and this essentially boils down to building an error-correcting parser that's generic enough to parse any language.
Tree Sitter has already had a good crack at this problem, but Tree Sitter is geared towards providing accurate syntax highlighting and has a few missing features that Sapling needs:
- Sapling needs comments to be preserved when parsing (but whitespace is perhaps not so essential)
- Sapling needs to be able to render ASTs back to text, which I don't think Tree Sitter's grammars can handle
For the sake of pragmatism, I think we should initially write a wrapper around tree-sitter for parsing/reading files so that Sapling at least works whilst we decide if a custom grammar is required (and if it is, how it should work).
AST stands for 'Abstract Syntax Tree', and in essence it is a tree-like representation of only the structure of a program, without any details about formatting.
For example, the following Rust code:
fn foo(y: u64, z: u32) {
let x = y * 3 + z as u64;
combine(x, y);
}
would correspond to a syntax tree something like the following (simplified for demonstration purposes). Notice how each 'element' of the code corresponds to one 'node' in the syntax tree: