You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to add a publisher to the CH class, and the publisher I'm attempting to add is aargauerzeitung.ch. However, I'm encountering an issue with retrieving the topics or keywords. Here's the code for the class:
import datetime
from typing import List, Optional
from lxml.cssselect import CSSSelector
from fundus.parser import ArticleBody, BaseParser, ParserProxy, attribute
from fundus.parser.utility import (
extract_article_body_with_selector,
generic_author_parsing,
generic_date_parsing,
)
class AARGAUERZEITUNGParser(ParserProxy):
class V1(BaseParser):
_paragraph_selector = CSSSelector(
"p.headline__lead")
Question
Hey there,
I'm trying to add a publisher to the
CH
class, and the publisher I'm attempting to add is aargauerzeitung.ch. However, I'm encountering an issue with retrieving the topics or keywords. Here's the code for the class:import datetime
from typing import List, Optional
from lxml.cssselect import CSSSelector
from fundus.parser import ArticleBody, BaseParser, ParserProxy, attribute
from fundus.parser.utility import (
extract_article_body_with_selector,
generic_author_parsing,
generic_date_parsing,
)
class AARGAUERZEITUNGParser(ParserProxy):
class V1(BaseParser):
_paragraph_selector = CSSSelector(
"p.headline__lead")
Here's an example of an article from the publisher: https://www.aargauerzeitung.ch/aargau/baden/baden-kreativitaet-liegt-in-der-familie-schwester-von-esc-superstar-nemo-hat-kampagne-fuer-das-grand-casino-baden-produziert-ld.2619716?reduced=true
This is the current code structure for adding a publisher:
Appenzeller_Zeitung = PublisherSpec(
name="Appenzeller Zeitung",
domain="https://www.appenzellerzeitung.ch/",
sources=[
RSSFeed("https://www.appenzellerzeitung.ch/schweiz.rss"),
NewsMap(" https://www.appenzellerzeitung.ch/sitemap.xml"),
Sitemap("https://www.appenzellerzeitung.ch/sitemap.xml"),
],
parser=APPENZELLERZEITUNGParser
)
I appreciate any help or guidance you can provide!
The text was updated successfully, but these errors were encountered: