-
Notifications
You must be signed in to change notification settings - Fork 285
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
QSURLExtractor needs improving #1146
Comments
This isn't available by default on users’ systems. Not a deal breaker, but it complicates things quite a bit.
I looked into this a bit the other day. I think the best solution is to just use a list and manually prevent duplicates.
I don’t remember, but more likely, it doesn’t convert any of them. Properly or otherwise. :-) |
I used the assumption that since we're already packaging BeautifulSoup.py We could probably just pinch the method (or are they functions in Python?) Let me know if you're looking into these Pythony things, or I will ;-) On 28 September 2012 13:54, Rob McBroom notifications@github.com wrote:
|
Yeah, but Beautiful Soup was designed to be one file. That other thing appears to be a more traditional "module". I'm sure there's a way to include it if it's important enough, though.
It's interesting, but no, probably not any time soon. |
A couple of things:
<a></a>
tag usingthisLink['title'] = link.contents
What I've done previously is to then strip out all the HTML tags from this using the common file from the webscraping import
so
link
var should be an ordered set if possible. It's a bit unnerving having results show up in any order on the pagehttp://www.wordreference.com/fren/grand
in QS's 1st pane, then right arrow into it. Notice how there are lots of things like"
etc.The text was updated successfully, but these errors were encountered: