No parsing of text inside <ref>? #67

wetneb · 2017-09-23T19:03:37Z

Hi!
After some investigation, it looks like things enclosed by XML tags are not parsed further. For instance:

Groundbreaking claim.<ref>see {{cite book|author=Chuck Norris|title=The Truth}}</ref>

The citation template will not be parsed at all: the content of the <ref>...</ref> is just represented as a String.

I understand this might be desirable for tags like <nowiki/> but I'm not sure why this would apply to any tag? How could I modify the parser to recurse inside the <ref>?

The text was updated successfully, but these errors were encountered:

hannesd · 2017-09-25T09:32:47Z

The behavior you describe only applies to tags that were configured as tag extensions in the parser (through the config). Tag extensions are basically functions that are free to choose what they want to do with their content. Therefore the parser cannot just parse their content because there's a good chance the content isn't even wikitext.

I took a look at the engine code and the <math> and <ref> extension have nonsensical implementations (return null). You have two options here:

a) Remove the math and ref tag extension from the configuration. They should then be treated as unknown XML elements and their content would get parsed.

b) Properly implement the extension you require. Here's just an example how one could parse the extension's content in the MathTagExtImpl.invoke(...) method of the tag extension:

try
{
	EngProcessedPage processed = frame.getEngine().parseAndPostprocess(
			new PageId(frame.getTitle(), -1L),
			body.getContent(),
			null);
	EngPage page = processed.getPage();
	return nf().unwrap(page);
}
catch (EngineException e)
{
	return nf().softError("Processing of <math> tag failed");
}

I cannot guarantee that this code actually works as I have not tested it.

wetneb · 2017-09-25T09:34:31Z

@hannesd awesome, thanks a lot!

This is a temporary fix before we do full Wikitext parsing inside references (this needs a change upstream). See sweble/sweble-wikitext#67 .

wetneb closed this as completed Sep 25, 2017

wetneb added a commit to OpenRefine/OpenRefine that referenced this issue Oct 20, 2017

Forbid pipe characters in URL references to ease parsing.

e2a22a6

This is a temporary fix before we do full Wikitext parsing inside references (this needs a change upstream). See sweble/sweble-wikitext#67 .

wetneb mentioned this issue Oct 20, 2017

Forbid pipe characters in URL references to ease parsing. OpenRefine/OpenRefine#1275

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

No parsing of text inside <ref>? #67

No parsing of text inside <ref>? #67

wetneb commented Sep 23, 2017

hannesd commented Sep 25, 2017

wetneb commented Sep 25, 2017

No parsing of text inside <ref>? #67

No parsing of text inside <ref>? #67

Comments

wetneb commented Sep 23, 2017

hannesd commented Sep 25, 2017

wetneb commented Sep 25, 2017