Skip to content

Commit

Permalink
Swedish news sites
Browse files Browse the repository at this point in the history
  • Loading branch information
fivefilters committed Sep 18, 2013
1 parent 19c0d2a commit 1074737
Show file tree
Hide file tree
Showing 10 changed files with 74 additions and 4 deletions.
4 changes: 3 additions & 1 deletion dn.se.txt
Original file line number Diff line number Diff line change
Expand Up @@ -23,4 +23,6 @@ author: //div[@id="byline"]/div/p/strong

# Date
date: substring(substring-after(//p[@class="published"], 'Publicerad '), 0, 11)
test_url: http://www.dn.se/nyheter/varlden/landade-flygplan-mitt-i-villaomrade

test_url: http://www.dn.se/nyheter/varlden/landade-flygplan-mitt-i-villaomrade
test_url: http://www.dn.se/m/rss/senaste-nytt
8 changes: 8 additions & 0 deletions fria.nu.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,8 @@
body: //div[contains(@class, 'layout__inner')]//div[contains(@class, 'file-image') or contains(@class, 'node__content')]
author: //article//div[contains(@class, 'field-byline')]
strip_id_or_class: rekommenderade
strip_id_or_class: disqus
strip_id_or_class: annonser

test_url: http://www.fria.nu/artikel/112079
test_url: http://www.fria.nu/taxonomy/term/1928/all/feed
7 changes: 7 additions & 0 deletions friatidningen.se.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
body: //div[contains(@class, 'layout__inner')]//div[contains(@class, 'file-image') or contains(@class, 'node__content')]
author: //article//div[contains(@class, 'field-byline')]
strip_id_or_class: rekommenderade
strip_id_or_class: disqus
strip_id_or_class: annonser

test_url: http://www.friatidningen.se/artikel/112074
7 changes: 7 additions & 0 deletions goteborgsfria.se.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
body: //div[contains(@class, 'layout__inner')]//div[contains(@class, 'file-image') or contains(@class, 'node__content')]
author: //article//div[contains(@class, 'field-byline')]
strip_id_or_class: rekommenderade
strip_id_or_class: disqus
strip_id_or_class: annonser

test_url: http://www.goteborgsfria.se/artikel/112079
11 changes: 11 additions & 0 deletions gp.se.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,11 @@
body: //div[@id='articleContainer']
author: //div[@id='articleContent']//div[contains(@class, 'byline')]//span[contains(@class, 'name fn')]
strip_id_or_class: toolbar
strip_id_or_class: ADad
strip_id_or_class: articleSerieWrapper
strip_id_or_class: articleFloatContainer
strip: //div[contains(@class, 'byline')]//img
prune: no

test_url: http://www.gp.se/nyheter/bohuslan/1.2045564-styckade-mannen-hade-mordat-hustrun
test_url: http://www.gp.se/1.16560
7 changes: 7 additions & 0 deletions landetsfria.se.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
body: //div[contains(@class, 'layout__inner')]//div[contains(@class, 'file-image') or contains(@class, 'node__content')]
author: //article//div[contains(@class, 'field-byline')]
strip_id_or_class: rekommenderade
strip_id_or_class: disqus
strip_id_or_class: annonser

test_url: http://www.landetsfria.se/artikel/112070
7 changes: 7 additions & 0 deletions skanesfria.se.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
body: //div[contains(@class, 'layout__inner')]//div[contains(@class, 'file-image') or contains(@class, 'node__content')]
author: //article//div[contains(@class, 'field-byline')]
strip_id_or_class: rekommenderade
strip_id_or_class: disqus
strip_id_or_class: annonser

test_url: http://www.skanesfria.se/artikel/112045
7 changes: 7 additions & 0 deletions stockholmsfria.nu.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
body: //div[contains(@class, 'layout__inner')]//div[contains(@class, 'file-image') or contains(@class, 'node__content')]
author: //article//div[contains(@class, 'field-byline')]
strip_id_or_class: rekommenderade
strip_id_or_class: disqus
strip_id_or_class: annonser

test_url: http://www.stockholmsfria.nu/artikel/112068
13 changes: 10 additions & 3 deletions sydsvenskan.se.txt
Original file line number Diff line number Diff line change
Expand Up @@ -2,10 +2,17 @@ title: //h1

author: //a[contains(@href, '/sok/?')]/text()

date: substring-after(//span[@class='date'], 'Publicerad ')
date: //meta[@name='bi3dPubDate']/@content

body: //div[@class='two_column_left']
body: (//div[contains(@class, 'slider_wrapper')])[1] | //div[@id='article_image' or @class='two_column_left']
strip_id_or_class: story
strip_id_or_class: article_body_ad
strip: //div[@class='leadText saplo:lead']/h5

test_url: http://www.sydsvenskan.se/kultur-och-nojen/-jag-vill-garna--stanna--
replace_string(<br />): <br /><br />

prune: no

test_url: http://www.sydsvenskan.se/malmo/allt-jag-ager-ligger-pa-botten/
test_url: http://www.sydsvenskan.se/kultur-och-nojen/-jag-vill-garna--stanna--
test_url: http://www.sydsvenskan.se/rss.xml
7 changes: 7 additions & 0 deletions uppsalafria.se.txt
Original file line number Diff line number Diff line change
@@ -0,0 +1,7 @@
body: //div[contains(@class, 'layout__inner')]//div[contains(@class, 'file-image') or contains(@class, 'node__content')]
author: //article//div[contains(@class, 'field-byline')]
strip_id_or_class: rekommenderade
strip_id_or_class: disqus
strip_id_or_class: annonser

test_url: http://www.uppsalafria.se/artikel/97167

0 comments on commit 1074737

Please sign in to comment.