Fixing links in print.html #866

steveklabnik · 2019-01-14T21:02:35Z

This is only a failing test, not an implementation. Opening it now for two reasons:

this fails two tests right now. @Michael-F-Bryan or someone else, how do I properly regenerate the search index json fixture? I tried to do it manually and it didn't seem to work.
does this test look reasonable?

working on a patch now cc @cetra3 if you have any advice

cetra3 · 2019-01-14T22:39:53Z

The search index is changed due to the addition of an extra chapter, which means you have to regenerate the searchindex_fixture.json file by:

Setting GENERATE_FIXTURE to true on line 469
Running cargo test
Setting GENERATE_FIXTURE back to false

Unfortunately you can't just copy it from a normal book build, as it's not pretty printed.

The current code basically concatenates all the files into one big html file and does it as it's rendering all the files. I.e, it renders a file, outputs it to the location it's at, then appends the html into print.html.

I was going to use another project I did (https://github.com/cetra3/mdcollate) to generate a correct print.md file.

With mdcollate, I was going to:

Traverse from SUMMARY.md
Generate a single print.md file
Use this as print.html

Unfortunately there would need to be a few enhancements to mdcollate to support this. Namely, dealing with windows file systems (backslashes!) & converting it into a library.

steveklabnik · 2019-01-15T16:00:30Z

which means you have to regenerate the searchindex_fixture.json file by:

Oh wow, I just missed this, thanks :(

The current code basically concatenates all the files into one big html file and does it as it's rendering all the files. I.e, it renders a file, outputs it to the location it's at, then appends the html into print.html.

Yep, that's the root cause here. The strategy I'm attempting to take is to check if links are relative, and then re-base them. We'll see...

steveklabnik · 2019-01-15T21:10:06Z

Okay! While this hasn't built here, things pass locally, as well as testing in the main Rust repository (cc rust-lang/rust#57495)

Can someone review this?

ehuss · 2019-01-15T23:48:01Z

I don't know the mdbook source code well enough to provide a review, but I have kicked the tires on this PR and it seems to work for me.

I tossed together a quick script to convert mdbook 0.1 to 0.2: https://gist.github.com/ehuss/b2a647eb69aa8f57bc4fb397656dfac1 I ran it against the reference and I found some curious issues (unrelated to this PR) that maybe someone can answer:

Previously you could have a link like #foo and it would be relative to the same page. Is it possible to do that in mdbook 0.2? Or does it have to include the page name like page.html#foo?
The .md to .html translation doesn't seem to work for links with # fragments that have md in the middle. For example, foo.html#phantomdata gets rewritten as foo.html#phant.html. Am I doing something wrong, or is this just a bug?

steveklabnik · 2019-01-16T16:29:56Z

@ehuss oh that's super cool!

I have to re-read the code, but I think those are both bugs; I'd like to support them, at least. Maybe I can fix those issues quickly and make them a part of this PR as well. I'll investigate later today and report back.

@ehuss

In a regex, `.` means `[0-9a-zA-Z]`, not a period character. This means that this regex worked, unless a link anchor contained `md` inside of it. This fixes the regex to match a literal period. Reported by @ehuss in #866 (comment)

steveklabnik · 2019-01-16T20:47:33Z

@ehuss I've pushed a commit to fix the second issue, nice catch!

The first issue, #foo, is hard though: when we make everything one page, we can't guarantee that these are all accurate. Imagine you have two different pages that contain #foo headings and anchors... fixing that problem is hard. Does linkcheck fail on these links, do you know?

ehuss · 2019-01-16T22:08:05Z

The print.html page just ends up with a few links that point to the wrong section. The headers get unique anchor names (#foo #foo-1 #foo-2), but the links will all point to #foo. I believe this has always been the case (see #542).

The issue I was specifically referring to is if a fragment-only link can elide the page name or not. It's not a big deal, just a minor convenience. The script I posted inserts the page name, so it's an easy workaround.

steveklabnik · 2019-01-17T21:40:20Z

I plan on merging this tomorrow and cutting a new release.

steveklabnik · 2019-01-18T14:54:23Z

Thanks folks!

@ehuss

In a regex, `.` means `[0-9a-zA-Z]`, not a period character. This means that this regex worked, unless a link anchor contained `md` inside of it. This fixes the regex to match a literal period. Reported by @ehuss in rust-lang#866 (comment)

Fixing links in print.html

steveklabnik force-pushed the gh828 branch 2 times, most recently from 0e01d71 to 5d93b14 Compare January 15, 2019 19:07

failing test

a481735

steveklabnik force-pushed the gh828 branch from 5d93b14 to 6c4fa05 Compare January 15, 2019 19:09

Fix the bug

2ddbb37

steveklabnik force-pushed the gh828 branch from 6c4fa05 to 2ddbb37 Compare January 15, 2019 19:10

steveklabnik mentioned this pull request Jan 15, 2019

Support multiple versions of MdBook for docs rust-lang/rust#57495

Merged

Fix a bug in link re-writing

23ac06e

In a regex, `.` means `[0-9a-zA-Z]`, not a period character. This means that this regex worked, unless a link anchor contained `md` inside of it. This fixes the regex to match a literal period. Reported by @ehuss in #866 (comment)

steveklabnik merged commit 25c1ca1 into master Jan 18, 2019

steveklabnik deleted the gh828 branch January 18, 2019 14:54

steveklabnik mentioned this pull request Jan 29, 2019

Upgrades for mdbook 0.2.0, and link style cleanup. rust-lang/reference#391

Closed

ehuss mentioned this pull request May 17, 2019

I cannot get the new links to work #828

Closed

Ruin0x11 pushed a commit to Ruin0x11/mdBook that referenced this pull request Aug 30, 2020

Merge pull request rust-lang#866 from rust-lang-nursery/gh828

37814dd

Fixing links in print.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixing links in print.html #866

Fixing links in print.html #866

steveklabnik commented Jan 14, 2019

cetra3 commented Jan 14, 2019 •

edited

Loading

steveklabnik commented Jan 15, 2019

steveklabnik commented Jan 15, 2019

ehuss commented Jan 15, 2019

steveklabnik commented Jan 16, 2019

steveklabnik commented Jan 16, 2019

ehuss commented Jan 16, 2019

steveklabnik commented Jan 17, 2019

steveklabnik commented Jan 18, 2019

Fixing links in print.html #866

Fixing links in print.html #866

Conversation

steveklabnik commented Jan 14, 2019

cetra3 commented Jan 14, 2019 • edited Loading

steveklabnik commented Jan 15, 2019

steveklabnik commented Jan 15, 2019

ehuss commented Jan 15, 2019

steveklabnik commented Jan 16, 2019

steveklabnik commented Jan 16, 2019

ehuss commented Jan 16, 2019

steveklabnik commented Jan 17, 2019

steveklabnik commented Jan 18, 2019

cetra3 commented Jan 14, 2019 •

edited

Loading