Org Mode parser re-write in Rust
! Warning, this project is in early development !
Org is probably the best and most complete plain text organizational system known to mankind. It has countless applications like authoring, publishing, task and time tracking, journal, blog, agenda, wiki etc...
Unfortunately Org was originally developed for Emacs and therefore available only inside Emacs. It is a huge limiting factor for Org's development and popularization. Because of that it is not as popular outside of Emacs community as it should be.
Many attempts were made to fix this. It all starts with a parser. But because Org's syntax is not trivial and in fact most of it is context-sensitive with only a few context-free elements, it is quite easy to get it wrong. Some Org parsers have chosen to focus on a restricted subset of Org's syntax like org-ruby or pandoc. More ambitious projects try to cover all features but since Org does not have a formal specification1 they rely on observed Org's behavior in Emacs or author's intuition. As a result they rarely get finished.
But the absence of a good a spec and the complexity of the grammar are not show stoppers. Why reinventing the wheel when we can just copy it! This project takes the only surefire way to get it right - use the original elisp parser implementation as a blueprint!
- Be fast. Speed matters. Period.
- Be feature-complete, compliant with original implementation. Nobody wants Nth competing standard.
- Be standalone, embeddable and reusable. User must not be locked into one particular editor or ecosystem. Integrations with language servers, editors and plugins should be encouraged.
These are the choices that were made to achieve the goals:
Rust. Because it is fast, memory safe and provides C FFI. And most importantly it is cool.
Original elisp algorithm. While using the original elisp source as a guideline might result in less idiomatic Rust code it has its advantages:
Scope of work is well-defined and finish line is visible. This should encourage contributions even from people who want to get started with Rust.
Getting "feature-complete" is just a matter of getting to the finish line.
element - parser crate is currently the main and only focus. It should perform just 2 tasks. Generate concrete syntax tree and serialize it back to canonical Org representation.
The rest of the roadmap is not fully flashed out. Feature-complete parser opens a lot of possibilities, here are just a few of my ideas:
Parse tree manipulation tools (like exporting to other formats).
Language server - a way to solve "the matrix" problem. Enabling other editors to have their own org-mode would be a logical next step.
CLI tools. I'd love to get integration with TaskWarrior and maybe even use Org as TaskWarrior's DOM.
Any contributions are welcome. If you want to help check out contribution guide.
Got a question? Stop by for a chat at gitter
More about Org Mode
- Org-Mode Is One of the Most Reasonable Markup Languages to Use for Text
- Awesome guide about org-mode
- Some attempts were made to formalize the syntax. This project uses them as supplementary materials. See contribution guide for details.