bearded-bear

A parser combinator generator library for parsing ambiguous left-recursive grammars in polynomial time and space.

When finished, this library will read a grammar in some standardized form and output a combinatorial parser. I like combinatorial parsers because they are easy to understand and maintain. They aren't always the best choice, but this library helps make them a good choice for a wider variety of use cases.

Built to learn more about Python's set types, memoization, and combinatorial parsing.

Background

This library is based on "Parser Combinators for Ambiguous Left-Recursive Grammars" (Frost, Hafiz and Callaghan). The paper is available online here:

http://davinci.newcs.uwindsor.ca/~richard/PUBLICATIONS/PADL_08.pdf

Frost et al basically find that we can using memoization to get polynomial runtime and space efficiency for combinatorially parsing ambiguous left-recursive grammars. The authors note that Norvig found a way to achieve sub-exponential runtime for top-down parser combinators. They extend Norvig's work and find a way to do that for the grammars in question.

Motivation / options for parsing ambiguous left-recursive grammars

Why not just avoid left recursion by left-factoring the grammar? It turns out that left-factoring a left-recursive grammar makes the grammar generate different parse trees! If you're doing natural language processing, this can be problematic. See page 4 of the paper for more details.

Why not use a parsing expression grammar (PEG)? PEGs cannot handle ambiguous grammars - or, more precisely, they define away ambiguity by deciding which parse trees to use. The decision process can be implicit in PEG libraries, leading to confusion and even unexpected behavior for users of PEGs.

Questions I have

Is it really the case that we cannot factor out left recursion from some PEGs without "[complicating] the integration of semantic actions"? This Microsoft Research paper suggests it may be possible for many context-free natural languages grammars: http://acl.eldoc.ub.rug.nl/mirror/A/A00/A00-2033.pdf

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
src		src
tests/data		tests/data
.gitignore		.gitignore
README.md		README.md
bear.py		bear.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

bearded-bear

Background

Motivation / options for parsing ambiguous left-recursive grammars

Questions I have

About

Releases

Packages

Languages

zdexter/bearded-bear

Folders and files

Latest commit

History

Repository files navigation

bearded-bear

Background

Motivation / options for parsing ambiguous left-recursive grammars

Questions I have

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages