feat(napi/parser)!: add `range` option #11728

bacarybruno · 2025-06-15T15:49:27Z

Heyo team 👋!

I took a shot at #10307 to add loc and range fields. I followed this suggestion #10307 (comment) by making the fields configurable.

I needed to get this information from oxc-parser for different use cases including writing an eslint parser. Is this gets merged I would also need/like to add the a tokens option (to be discussed).

The JSDoc comments are based on https://typescript-eslint.io/packages/typescript-estree/#parsecode-options

P.S: I'm new to Rust and this is my first contrib on this repo, so feel free to suggest improvements. That would be really appreciated!

graphite-app · 2025-06-15T15:49:32Z

How to use the Graphite Merge Queue

Add either label to this PR to merge it via the merge queue:

0-merge - adds this PR to the back of the merge queue
hotfix - for urgent hot fixes, skip the queue and merge this PR next

You must have a Graphite account in order to use the merge queue. Sign up using this link.

_{An organization admin has enabled the Graphite Merge Queue in this repository.} _{Please do not merge from GitHub as this will restart CI on PRs being processed by the merge queue.}

codspeed-hq · 2025-06-16T13:07:33Z

CodSpeed Instrumentation Performance Report

Merging #11728 will not alter performance

_{Comparing bacarybruno:oxc-10307-ast-loc-range (b02d526) with main (d991fed)}

Summary

✅ 38 untouched benchmarks

overlookmotel

Thanks very much for tackling this. Given that you say you're new to Rust, very impressive that you managed to figure out the codegen as well as everything else!

I've made a few comments below, but the larger problem is that, while the implementation for ranges is good, I don't think loc is going to work correctly.

`loc`

The problem is UTF-8 vs UTF-16 offsets. Rust works with UTF-8 strings, whereas JS works in UTF-16. So there's an AST pass that happens before serialization which converts all the offsets from UTF-8 to UTF-16:

oxc/napi/parser/src/lib.rs

Lines 91 to 92 in ea6ce9d

    
           let mut comments = 
        
               convert_utf8_to_utf16(&source_text, &mut program, &mut module_record, &mut errors);

oxc/crates/oxc_napi/src/lib.rs

Lines 12 to 59 in ea6ce9d

    
           /// Convert spans to UTF-16 
        
           pub fn convert_utf8_to_utf16( 
        
               source_text: &str, 
        
               program: &mut Program, 
        
               module_record: &mut ModuleRecord, 
        
               errors: &mut [OxcError], 
        
           ) -> Vec<Comment> { 
        
               let span_converter = Utf8ToUtf16::new(source_text); 
        
               span_converter.convert_program(program); 
        
               // Convert comments 
        
               let mut offset_converter = span_converter.converter(); 
        
               let comments = program 
        
                   .comments 
        
                   .iter() 
        
                   .map(|comment| { 
        
                       let value = comment.content_span().source_text(source_text).to_string(); 
        
                       let mut span = comment.span; 
        
                       if let Some(converter) = offset_converter.as_mut() { 
        
                           converter.convert_span(&mut span); 
        
                       } 
        
                       Comment { 
        
                           r#type: match comment.kind { 
        
                               CommentKind::Line => String::from("Line"), 
        
                               CommentKind::Block => String::from("Block"), 
        
                           }, 
        
                           value, 
        
                           start: span.start, 
        
                           end: span.end, 
        
                       } 
        
                   }) 
        
                   .collect::<Vec<_>>(); 
        
               // Convert spans in module record to UTF-16 
        
               span_converter.convert_module_record(module_record); 
        
               // Convert spans in errors to UTF-16 
        
               if let Some(mut converter) = span_converter.converter() { 
        
                   for error in errors { 
        
                       for label in &mut error.labels { 
        
                           converter.convert_offset(&mut label.start); 
        
                           converter.convert_offset(&mut label.end); 
        
                       } 
        
                   } 
        
               } 
        
               comments 
        
           }

During serialization, you're using Rope to convert offsets to line/column. But Rope expects UTF-8 offsets, and by this point we've already converted them to UTF-16.

If the whole source text is ASCII, then there's no difference, but I think you'll find loc is incorrect for source text like this:

[
"🍄🍄🍄",
123
]

Solving this problem is not so easy (especially making it work and performant).

I propose:

Scale back the scope of this PR to only add ranges.
I'll write up on #10307 the steps I think we'd need to take to support loc too, and we can discuss.

Custom serializers

This point applies to range too: range field also needs to be added manually in all the hand-written ESTree impls in serialize directory.

We'll also need to support it in raw transfer, but you can leave that to me - raw transfer is a bit of a minefield.

Ditto we'll need conformance testing for range, but I can handle that.

Tests

In absence of conformance test coverage, could you please add some tests for the range option to https://github.com/oxc-project/oxc/blob/main/napi/parser/test/parse.test.ts.

No need for anything fancy, just 3 tests which check the output with ranges: false, ranges: true, and ranges: undefined.

tasks/ast_tools/src/derives/estree.rs

crates/oxc_estree/src/lib.rs

crates/oxc_estree/src/serialize/mod.rs

napi/parser/index.d.ts

overlookmotel · 2025-06-17T13:13:01Z

I see you're making changes after my feedback, so marking as draft for now. Please let me know when it's ready for review again.

And thank you for your efforts!

crates/oxc_ast/src/serialize/mod.rs

overlookmotel · 2025-06-20T18:01:31Z

I know this is still a work in progress, so apologies for making comments prematurely. I just wanted to give feedback while I had the chance, as I'm going to be away from keyboard over the weekend.

bacarybruno · 2025-06-20T19:07:55Z

@overlookmotel I think I've gotten closer to what you had in mind. I'll continue when you get back from the weekend. Have a good weekend and thanks again for your help 🙏
I'k keeping the PR in draft in the meantime

Restore comment 2

overlookmotel

I hope this doesn't come across as rude, but I'm going away tomorrow for a week, and didn't want to leave your PR hanging any longer, so I've pushed some commits to finish it off. Going to merge it now.

We have some follow-up work to complete after this is merged. I'll make a comment on #10307 in case you want to continue on with this work (please do!)

napi/parser/test/parse.test.ts

To continue on this work #11728 Context: #10307 (comment)

bacarybruno added 3 commits June 15, 2025 16:41

feat: add loc and range fields

d65061e

chore: cleanup

cb6fa21

chore: cleanup

2407a0c

bacarybruno requested a review from overlookmotel as a code owner June 15, 2025 15:49

github-actions bot added A-parser A-ast A-ast-tools C-enhancement labels Jun 15, 2025

Merge branch 'main' into oxc-10307-ast-loc-range

b2ad337

Boshen assigned overlookmotel Jun 16, 2025

bacarybruno and others added 2 commits June 16, 2025 14:58

Merge branch 'main' into oxc-10307-ast-loc-range

dbfc16c

[autofix.ci] apply automated fixes

1e9a466

overlookmotel changed the title ~~feat(parser): add loc and range~~ feat(ast/estree): add loc and range Jun 16, 2025

overlookmotel reviewed Jun 16, 2025

View reviewed changes

tasks/ast_tools/src/derives/estree.rs Outdated Show resolved Hide resolved

crates/oxc_estree/src/lib.rs Outdated Show resolved Hide resolved

crates/oxc_estree/src/serialize/mod.rs Outdated Show resolved Hide resolved

overlookmotel mentioned this pull request Jun 16, 2025

AST nodes with loc and/or range fields. #10307

Open

chore: remove loc

208bd1f

bacarybruno changed the title ~~feat(ast/estree): add loc and range~~ feat(ast/estree): add range Jun 16, 2025

Merge branch 'main' into oxc-10307-ast-loc-range

363ac7c

graphite-app bot reviewed Jun 16, 2025

View reviewed changes

napi/parser/index.d.ts Outdated Show resolved Hide resolved

bacarybruno added 2 commits June 17, 2025 01:39

feat: move range configuration from serializer to config types

4012c79

chore: move to config

7663b54

overlookmotel marked this pull request as draft June 17, 2025 13:13

bacarybruno added 2 commits June 18, 2025 01:25

chore: range as array

274a335

chore: format

d1ecb1c

bacarybruno force-pushed the oxc-10307-ast-loc-range branch from 1690b28 to d1ecb1c Compare June 17, 2025 23:26

chore: start cleanup

2a63d18

overlookmotel reviewed Jun 20, 2025

View reviewed changes

crates/oxc_ast/src/serialize/mod.rs Outdated Show resolved Hide resolved

bacarybruno and others added 5 commits June 20, 2025 20:41

chore: avoid multiplying the configs

09daca0

chore: fmt

fe16f2c

chore: try to simplify

26e67db

chore: revert generated code

1088610

Merge branch 'main' into oxc-10307-ast-loc-range

f168938

overlookmotel added 8 commits June 24, 2025 01:51

Update generated code

ffb1396

Restore line break

8a37a12

Rename range to ranges

4938016

Reformat quote! macro usage

fc46eb8

Restore comment

461753c

Restore comment 2

to_estree_js_json etc take ranges param

8e7fa9c

CompactJSSerializer etc new and with_capacity take ranges option

336b7cf

oxc_estree do not expose Config trait and types

c0a4687

overlookmotel changed the title ~~feat(ast/estree): add range~~ feat(napi/parser)!: add range option Jun 24, 2025

overlookmotel added 4 commits June 24, 2025 02:05

Tweak comment

010c67e

Regenerate NAPI parser types

fb9be85

ESTreeStructSerializer call parent serializer ranges method

333b6fe

Ignore TS errors in tests

afcb64f

overlookmotel force-pushed the oxc-10307-ast-loc-range branch from f8e4f58 to afcb64f Compare June 24, 2025 01:06

overlookmotel approved these changes Jun 24, 2025

View reviewed changes

overlookmotel marked this pull request as ready for review June 24, 2025 01:10

graphite-app bot reviewed Jun 24, 2025

View reviewed changes

napi/parser/test/parse.test.ts Outdated Show resolved Hide resolved

Fix test name

b02d526

overlookmotel merged commit 9a2548a into oxc-project:main Jun 24, 2025
25 checks passed

bacarybruno mentioned this pull request Jun 25, 2025

feat(ast): add range field to custom serializers #11890

Merged

oxc-bot mentioned this pull request Jun 25, 2025

release(crates): v0.75.0 #11897

Merged

overlookmotel pushed a commit that referenced this pull request Jul 1, 2025

feat(ast): add range field to custom serializers (#11890)

79c93e3

To continue on this work #11728 Context: #10307 (comment)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat(napi/parser)!: add `range` option #11728

feat(napi/parser)!: add `range` option #11728

Uh oh!

bacarybruno commented Jun 15, 2025 •

edited

Loading

Uh oh!

graphite-app bot commented Jun 15, 2025

Uh oh!

codspeed-hq bot commented Jun 16, 2025 •

edited

Loading

Uh oh!

overlookmotel left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

overlookmotel commented Jun 17, 2025

Uh oh!

Uh oh!

overlookmotel commented Jun 20, 2025

Uh oh!

bacarybruno commented Jun 20, 2025

Uh oh!

overlookmotel left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

	let mut comments =
	convert_utf8_to_utf16(&source_text, &mut program, &mut module_record, &mut errors);

	/// Convert spans to UTF-16
	pub fn convert_utf8_to_utf16(
	source_text: &str,
	program: &mut Program,
	module_record: &mut ModuleRecord,
	errors: &mut [OxcError],
	) -> Vec<Comment> {
	let span_converter = Utf8ToUtf16::new(source_text);
	span_converter.convert_program(program);

	// Convert comments
	let mut offset_converter = span_converter.converter();
	let comments = program
	.comments
	.iter()
	.map(\|comment\| {
	let value = comment.content_span().source_text(source_text).to_string();
	let mut span = comment.span;
	if let Some(converter) = offset_converter.as_mut() {
	converter.convert_span(&mut span);
	}
	Comment {
	r#type: match comment.kind {
	CommentKind::Line => String::from("Line"),
	CommentKind::Block => String::from("Block"),
	},
	value,
	start: span.start,
	end: span.end,
	}
	})
	.collect::<Vec<_>>();

	// Convert spans in module record to UTF-16
	span_converter.convert_module_record(module_record);

	// Convert spans in errors to UTF-16
	if let Some(mut converter) = span_converter.converter() {
	for error in errors {
	for label in &mut error.labels {
	converter.convert_offset(&mut label.start);
	converter.convert_offset(&mut label.end);
	}
	}
	}

	comments
	}

Uh oh!

feat(napi/parser)!: add range option #11728

feat(napi/parser)!: add range option #11728

Uh oh!

Conversation

bacarybruno commented Jun 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

graphite-app bot commented Jun 15, 2025

How to use the Graphite Merge Queue

Uh oh!

codspeed-hq bot commented Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CodSpeed Instrumentation Performance Report

Merging #11728 will not alter performance

Summary

Uh oh!

overlookmotel left a comment

Choose a reason for hiding this comment

loc

Custom serializers

Tests

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

overlookmotel commented Jun 17, 2025

Uh oh!

Uh oh!

overlookmotel commented Jun 20, 2025

Uh oh!

bacarybruno commented Jun 20, 2025

Uh oh!

overlookmotel left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

feat(napi/parser)!: add `range` option #11728

feat(napi/parser)!: add `range` option #11728

bacarybruno commented Jun 15, 2025 •

edited

Loading

codspeed-hq bot commented Jun 16, 2025 •

edited

Loading

`loc`