public
Description: Small RDF python scraper for gumtree ads (include geocoding)
Clone URL: git://github.com/moustaki/flatscrap.git
Search Repo:
Beginning of a gumtree scrapper
moustaki (author)
Thu Mar 06 09:50:04 -0800 2008
commit  42068eb2abd4a916f4d6083592caed9deec21fed
tree    4e83daa2c760ebe31ab0ae392acf549ba15f0f71
parent  a57f635d03cec29e13201a7897c7ceccb1001129
...
 
 
 
 
 
 
 
 
 
 
 
 
 
 
...
1
2
3
4
5
6
7
8
9
10
11
12
13
14
0
@@ -1 +1,15 @@
0
+#!/usr/bin/python
0
+import urllib
0
+import sys
0
+from BeautifulSoup import BeautifulSoup
0
+
0
+print "Scrapping "+sys.argv[1]
0
+
0
+f = urllib.urlopen(sys.argv[1])
0
+html = f.read()
0
+f.close()
0
+
0
+soup = BeautifulSoup(html)
0
+
0
+print soup('span','fieldlabel')

Comments

    No one has commented yet.