Skip to content

Commit

Permalink
Update scraper.py
Browse files Browse the repository at this point in the history
  • Loading branch information
paulbradshaw committed Jan 25, 2018
1 parent 9168cbd commit 662ecfc
Showing 1 changed file with 3 additions and 2 deletions.
5 changes: 3 additions & 2 deletions scraper.py
Expand Up @@ -18,8 +18,9 @@
print link
print link.text
print link.text_content()
print link.attrib['href']
fullurl = baseurl+link.attrib['href'].split("?")[0]
cleanurl = link.attrib['href'].split("?")[0]
print cleanurl
fullurl = baseurl+cleanurl
linkslist.append(fullurl)

for url in linkslist:
Expand Down

0 comments on commit 662ecfc

Please sign in to comment.