railParse

This is an alternative to regex with the goal being human readability / comprehension instead of limiting the number of characters typed.

Status of This Project

Unfinished and On Hold. This project is not finished and is currently not being further developed at this time. Work on this project should resume at a later date.

Matching Rules

Matching rules may be combined to create more complex rules Matching rules have multiple functions

parse(stringToParse, startingPoints=set([0]))
returns a set of ending points that match the rule when starting from each starting point
match(stringToMatch)
matches(stringToMatch) returns True if the entire string matches the rule
exact(stringToMatch)
exactMatch(stringToMatch)
exactlyMatches(stringToMatch)
returns True if the entire string matches the rule and no other substrings starting at zero match.
toRegex(outputType="Regex")
Converts the parserule to a regex.Pattern or String

Comparisons

==
True if the self.parse(stringToParse) will always yield the same result as the rule as other.parse(stringToParse).
!=
Opposite of ==.
<
True if other.parse(stringToParse) will always yield atleast every result that self.parse(stringToParse) yields, but there is at least one stringToParse that self.parse(stringToParse) will not yield all the results as other.parse(stringToParse)

Same as < except "self" and "other" are flipped

<=
S< or ==
=
S> or ==

Operators

+
+=
creates a new Sequence() of the original and the ruleOrString
&
&=
creates a new And() of the original and the ruleOrString
|
|=
creates a new Or() of the original and the ruleOrString

Example

# .match() or .matches returns True / False based on if the entire
# string matches the rules
Sequence("abc", "d", Or(" ", "e"), "f").match("abcdef") #True
Sequence("abc", "d", Or(" ", "e"), "f").match("abcd f") #True
Sequence("abc", "d", Or(" ", "e"), "f").match("abcd!f") #False
Sequence("abc", "d", Or(" ", "e"), "f").match("abcd")   #False

# .parse() returns a set of all possible ending points that match
# the string when starting from the beginning
Sequence("abc", "d", Or(" ", "e"), "f").parse("abcdef") #{6}        #"abcdef" can be found once in "abcdef"
Sequence("abc", "d", Or(" ", "e"), "f").parse("abc")    #set()      #Empty set, neither "abcdef" nor "abcd f"
                                                                    #can be found in "abc"
Or("a", "ab").parse("ab")                               #{1, 2}     #both "a" and "ab" can be found in "ab"

Rules

Once(rule)
One(rule)
Used internally. It checks if a string matches the rule.
Chain(*rules)
Sequence(*rules)
A set of rules that have to all be matched in order.
And(*rules)
A set of rules that all have to match in order for characters to be added.
Or(*rules)
Choice(*rules)
A set of alternate rules where atleast one choice has to match.
Optional(rule)
ZeroOrOne(rule)
A rule that matches 0 - 1 time.
OneOrMore(rule)
A rule that matches 1+ times.
ZeroOrMore(rule)
A rule that matches 0+ times.
Next(rule)
LookAhead(rule)
"Positive Look Ahead", a rule that checks the next characters in the string to determine if the current ending character should still be an ending character.
NotNext(rule) - "Negative Look Ahead"
Previous(rule)
LookBehind(rule)
"Positive Look Behind" - starts at every point prior to the starting points and checks that from one atleast one of those points the starting point can be reached by following the rule.
Min(rule)
Lazy(rule)
matches only the first (smallest) match
Max(rule)
Greedy(rule)
matches only the last (largest) match
Not - TODO (INCOMPLETE)

Find TODO (INCOMPLETE)

FindStart
FindEnd

Special (Predefined) Cases

nl
nL
newline
newLine
matches a single newline character
wsc
wSC
whiteSpaceChar
whitespacechar
matches a single whitespacecharacter character
lower
lowercase
matches a single lowercase character
alpha
matches a single alphabetical character
alnum
alphaNum
alphanum
matches a single alphanumeric character
digit
num
matches a single numeric character (digit)
wildChar
wildchar
wildCard
wildcard
wildCardChar
wildcardchar
matches any single character
exclude(charsToExlcude)

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
LICENSE		LICENSE
README.md		README.md
railParse.py		railParse.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

railParse

Status of This Project

Matching Rules

Comparisons

Operators

Example

Rules

Find TODO (INCOMPLETE)

Special (Predefined) Cases

About

Releases

Packages

Languages

License

allenretz/railParse

Folders and files

Latest commit

History

Repository files navigation

railParse

Status of This Project

Matching Rules

Comparisons

Operators

Example

Rules

Find TODO (INCOMPLETE)

Special (Predefined) Cases

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages