news-please - an integrated web crawler and information extractor for news that just works
-
Updated
Jul 31, 2024 - Python
news-please - an integrated web crawler and information extractor for news that just works
ChatWeb can crawl web pages, read PDF, DOCX, TXT, and extract the main content, then answer your questions based on the content, or summarize the key points.
The Python-based web app extracts and summarizes news using NewsAPI, newspaper3k, spacy, Pegasus and T5 from Hugging Face. It categorizes news articles and uses a graph-based summary feature to summarize multiple documents. The app works with news in any language supported by NewsAPI.
News Extractor
Add a description, image, and links to the news-extractor topic page so that developers can more easily learn about it.
To associate your repository with the news-extractor topic, visit your repo's landing page and select "manage topics."