feat: implement the first version of parser #5

tisonkun · 2025-01-06T03:59:29Z

This refers to #4.

Signed-off-by: tison <wander4096@gmail.com>

tisonkun · 2025-01-06T04:04:09Z

spath/src/parser/token.rs

Hello @epage! I ever try to use winnow for parsing the JSONPath expression with a customized token kind from Logos output. But it's really hard to understand how to finish this integration.

How to bridge the Span/Located functions so that we can report failed position accurately?

My Token type is with span, while the equality is only related to compare the Kind, and possibly the text() result. The demo from this page assume Token is an owned struct without source/span like logos typically have.

I'm fine yet with the handmade parser. And this is not a request. Just share a user case where I failed to use winnow while I think it may be fit. If you have some time, maybe you can try to figure out an integration demo for winnow + logos. It can help with the goal that winnow wants to support customized token type.

FWIW, I ever work out a solution to integrate logos with nom. Something like:

#[derive(Debug, Clone, Copy)] pub struct Input<'a> { pub tokens: &'a [Token<'a>], } impl<'a> std::ops::Deref for Input<'a> { type Target = [Token<'a>]; fn deref(&self) -> &Self::Target { self.tokens } } impl nom::InputLength for Input<'_> { fn input_len(&self) -> usize { self.tokens.input_len() } } impl nom::Offset for Input<'_> { fn offset(&self, second: &Self) -> usize { let fst = self.tokens.as_ptr(); let snd = second.tokens.as_ptr(); (snd as usize - fst as usize) / size_of::<Token>() } } impl nom::Slice<Range<usize>> for Input<'_> { fn slice(&self, range: Range<usize>) -> Self { Input { tokens: &self.tokens[range], } } } impl nom::Slice<RangeTo<usize>> for Input<'_> { fn slice(&self, range: RangeTo<usize>) -> Self { Input { tokens: &self.tokens[range], } } } impl nom::Slice<RangeFrom<usize>> for Input<'_> { fn slice(&self, range: RangeFrom<usize>) -> Self { Input { tokens: &self.tokens[range], } } } impl nom::Slice<RangeFull> for Input<'_> { fn slice(&self, _: RangeFull) -> Self { *self } } pub fn match_text(text: &'static str) -> impl FnMut(Input) -> IResult<&Token> { move |i| match i.tokens.first().filter(|token| token.text() == text) { Some(token) => Ok((i.slice(1..), token)), _ => Err(nom::Err::Error(Error::from_error_kind( i, ErrorKind::ExpectText(text), ))), } } pub fn match_token(kind: TokenKind) -> impl FnMut(Input) -> IResult<&Token> { move |i| match i.tokens.first().filter(|token| token.kind == kind) { Some(token) => Ok((i.slice(1..), token)), _ => Err(nom::Err::Error(Error::from_error_kind( i, ErrorKind::ExpectToken(kind), ))), } }

Rest follows nom-rule.

How to bridge the Span/Located functions so that we can report failed position accurately?

This has had some discussion at winnow-rs/winnow#591 (comment) (and the discussion linked to from that). I've already merged the first part of the fix for the 0.7.0 release which I'm actively working on.

My Token type is with span, while the equality is only related to compare the Kind, and possibly the text() result. The demo from this page assume Token is an owned struct without source/span like logos typically have.

This is discussed at winnow-rs/winnow#591 (comment) and I'll be working on this soon for 0.7

I will note that your solution for Nom will work just as well with Winnow until those changes are made. I hadn't considered the need for match_text. I'll need to keep that in mind.

Signed-off-by: tison <wander4096@gmail.com>

tisonkun added 6 commits January 6, 2025 09:25

feat: implement the first version of parser

517ddb3

Signed-off-by: tison <wander4096@gmail.com>

refactor json value converter

0025d6d

Signed-off-by: tison <wander4096@gmail.com>

clippy

4e51d77

Signed-off-by: tison <wander4096@gmail.com>

normalized path

bacdf46

Signed-off-by: tison <wander4096@gmail.com>

impl parser

42142cb

Signed-off-by: tison <wander4096@gmail.com>

handmade parser

c72ffe0

Signed-off-by: tison <wander4096@gmail.com>

tisonkun commented Jan 6, 2025

View reviewed changes

tisonkun added 10 commits January 6, 2025 13:03

handmade more

1981181

Signed-off-by: tison <wander4096@gmail.com>

parse integer

2a34f65

Signed-off-by: tison <wander4096@gmail.com>

parse string

ecd3050

Signed-off-by: tison <wander4096@gmail.com>

slice

e73bc25

Signed-off-by: tison <wander4096@gmail.com>

parse rest

048276b

Signed-off-by: tison <wander4096@gmail.com>

Descendant

026f570

Signed-off-by: tison <wander4096@gmail.com>

Clippy

9632b92

Signed-off-by: tison <wander4096@gmail.com>

binder

b02500d

Signed-off-by: tison <wander4096@gmail.com>

fixu

6dbac7e

Signed-off-by: tison <wander4096@gmail.com>

simplest case

22bb3b8

Signed-off-by: tison <wander4096@gmail.com>

tisonkun merged commit 0e66b80 into main Jan 6, 2025
9 checks passed

tisonkun deleted the impl-parser branch January 6, 2025 07:29

tisonkun mentioned this pull request Jan 6, 2025

How to match all identifier that beyond ASCII? maciejhirsz/logos#460

Closed

epage mentioned this pull request Jan 6, 2025

Mark parsing higher order tokens easier. (perhaps implement Compare for &[T] where T:Compare ?) winnow-rs/winnow#637

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: implement the first version of parser #5

feat: implement the first version of parser #5

Uh oh!

tisonkun commented Jan 6, 2025 •

edited

Loading

Uh oh!

tisonkun Jan 6, 2025

Uh oh!

tisonkun Jan 6, 2025 •

edited

Loading

Uh oh!

epage Jan 6, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: implement the first version of parser #5

feat: implement the first version of parser #5

Uh oh!

Conversation

tisonkun commented Jan 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

tisonkun Jan 6, 2025

Choose a reason for hiding this comment

Uh oh!

tisonkun Jan 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

epage Jan 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tisonkun commented Jan 6, 2025 •

edited

Loading

tisonkun Jan 6, 2025 •

edited

Loading