Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Doesn't extracts from markdown format properly #15

Closed
ankitxjoshi opened this issue Apr 9, 2018 · 1 comment
Closed

Doesn't extracts from markdown format properly #15

ankitxjoshi opened this issue Apr 9, 2018 · 1 comment
Assignees
Milestone

Comments

@ankitxjoshi
Copy link

Code:

from urlextract import URLExtract

extractor = URLExtract()
text="""
[http://httpbin.org/status/200](http://httpbin.org/status/200)
"""
urls = extractor.find_urls(text)
print(urls)

Output:
['[http://httpbin.org/status/200](http://httpbin.org/status/200)', '[http://httpbin.org/status/200](http://httpbin.org/status/200)']

@lipoja lipoja self-assigned this Apr 9, 2018
lipoja added a commit that referenced this issue Apr 18, 2018
…#14).  Proper parsing of URLs form markdown (issue #15)

* devel:
  Updating changelog (WIP)
  Improving extracting of URL from enclosure of two characters.
  Updating removing remove_enclosure from URL. Take last occurrence of right enclosure character.
  fixing extracting URLs form markdown (issue #15)
  Updating extraction of URL from enclosure characters.
  Correcting README - Known issues (typo)
  Adding extracting removing URL from enclosure of two characters (e.g. brackets). Adding enclosure removal, add_enclosure and remove_enclosure testing.
  Update after_tld_chars. (add ">") Move hostname regexp to definition as variable.
  Code clean up and stylization.
  Removing '@' from left stop characters. User Information part of Authority of URL will be returned (e.g. email)
  Creating unit tests for find_urls() method from doctests. Using pytest for testing.
@lipoja
Copy link
Owner

lipoja commented Apr 19, 2018

I've added method for proper parsing of markdown.

@lipoja lipoja closed this as completed Apr 19, 2018
@lipoja lipoja added this to the 0.9 milestone Apr 22, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants