## Automatic News Scraping with Python, Newspaper and Feedparser

*The goal is to extract key details from news articles, including the title, author, publish date, and main content. This information can be used for tasks like building personal news feeds, trend analysis, or creating NLP datasets. Python, along with the Newspaper and Feedparser modules, is used to scrape and parse articles efficiently.*

In [1]:
!pip install newspaper3k
!pip install feedparser



In [2]:
import newspaper
import feedparser

In [3]:
def scrape_news_from_feed(feed_url):
    articles = []
    feed = feedparser.parse(feed_url)
    for entry in feed.entries:
        # create a newspaper article object
        article = newspaper.Article(entry.link)
        # download and parse the article
        article.download()
        article.parse()
        # extract relevant information
        articles.append({
            'title': article.title,
            'author': article.authors,
            'publish_date': article.publish_date,
            'content': article.text
        })
    return articles


feed_url = 'http://feeds.bbci.co.uk/news/rss.xml'
articles = scrape_news_from_feed(feed_url)

# print the extracted articles
for article in articles:
    print('Title:', article['title'])
    print('Author:', article['author'])
    print('Publish Date:', article['publish_date'])
    print('Content:', article['content'])
    print()


Title: this is what the victims of LA fires went back to get
Author: []
Publish Date: None
Content: Pearls, a power saw and a lawnmower toy: What LA fire survivors went back for

BBC Peter Mitchell built the furniture and cabinets in his home from scratch - now they're gone

A toy lawnmower: Henry Giles

Henry Giles hid his toys in the bushes, hoping they'd be safe

When Henry Giles, age six, returned to his home in Altadena with his parents, he knew exactly where to look. The front gate and wall on the property is all that has survived untouched, with intact mail still in the letter box. Behind it, two burned out cars sit in the driveway, and in the back a swing set is destroyed. But tucked into some bushes was a bucket and a toy lawn mower. "Mommy look, they survived! We hid them in the bushes because we knew there would be a fire!" Henry excitedly yelled Henry and his brother Lucas inspected the remaining plants: "This one survived! This didn't. Our tomato plant is dead." Few other 