GitHub - TanyaAng/Articles_API: Web crawling application for dynamic web page, build with Backend

ARTICLES API

Table of Contents

About The Project
- Built With
Getting Started
- Installation
Usage
API endpoints
License
Contact

About The Project

Python application for web crawling of dynamic web page, which generates content by doing asynchronous Javascript calls after page is loaded. The Scrapy spider implements inside a Selenuim WebDriver to handle the asynchronous JS calls and handles:

collecting of NBS articles on multiple pages;
validations of collected data;
saving collected and valid articles to sqlite database;
with FastAPI framework are build endpoints of collected data.

back to top

Build With

Getting Started

Installation

Clone the repo

https://github.com/TanyaAng/Articles_API.git

Install all Python libraries
```
pip install -r requirements.txt
```
Make nbs_articles root directory

back to top

Usage

Run Spider from terminal:

(venv) ..\nbs_articles> scrapy crawl article

Run FastApi:

back to top

API endpoints

Datapoint	HTTP Method	Description
/articles/	GET	get all crawled articles and their properties
/articles/?label={label}	GET	get list of articles with the same label
/articles/?date={date}	GET	get list of articles from the date
/article/{article_id}	GET	get single article
/article/{article_id}	DELETE	delete single article
/article/{article_id}	PUT	update single article

back to top

License

MIT License

back to top

Contact

Tanya Angelova - LinkedIn - t.j.angelova@gmail.com

Project Link: github link

back to top

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
nbs_articles		nbs_articles
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ARTICLES API

About The Project

Build With

Getting Started

Installation

Usage

API endpoints

License

Contact

About

Releases

Packages

Languages

TanyaAng/Articles_API

Folders and files

Latest commit

History

Repository files navigation

ARTICLES API

About The Project

Build With

Getting Started

Installation

Usage

API endpoints

License

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages