Skip to content
This repository has been archived by the owner on Jun 15, 2023. It is now read-only.

Cuts off links that span two lines #40

Open
marshalmiller opened this issue Feb 16, 2020 · 3 comments
Open

Cuts off links that span two lines #40

marshalmiller opened this issue Feb 16, 2020 · 3 comments

Comments

@marshalmiller
Copy link

Links that span spill over onto the second line are cut off when being recognized and thus reported as dead.

@almereyda
Copy link

Some links sometimes also span over multiple lines > 2 and are equally not recognised.

@metachris
Copy link
Owner

Thanks for the report/request. But it's hard to detect... Maybe someone is interested and can come up with a regex that works?

@maximiliancw
Copy link

Replacing all line breaks (e.g. \n) in the text before passing it to the regex should work?
Specifically, we could do so in the extract_url function, I believe. Will try this out and submit a PR, if it works.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants