Economist Scraping Assessment

The assignment

We would like you to create a web application that shows data collected by the Economist website (https://www.economist.com/).

The data should be provided by an API infrastructure.
To earn extra credit you need to provide a scraping system that allows us to retrieve data from the Economist website in realtime.
The features it should include:
- Show in a list of articles from the Economist website
- Create an authentication system with a simple signup and login setup
- Create an API infrastructure that gives back the list of article and single article information (only to logged-in users)
Extra credit features:
- Website scraping of the Economist website
- Put the project online using Heroku or similar

Build and Run with Docker and docker-compose

It is possible execute ./run.sh script, in order to run the application and configure MongoDb instance to persist the data.

or, to launch the single commands:

Build Docker image by:

$ docker build -t economist_scraping:latest .

Run docker-compose command:

$ docker-compose up -d

Build and Run on your Local Machine

In alternative, you can build and run the application on your local machine.

Install dependencies by:

$ npm install

Build application by:

$ npm run build

Run unit tests and integration tests, in order to understand if everything is well:

$ npm test

Finally, run the application:

$ npm start

Description

The application provides three different endpoint, in order to retrieve and persist the main info about each article from Economist newspaper website.

In particular:

Trigger the parser of the Article homepage, retrieve title and subtitle and persist them
- POST <HOST>/api/v1/articles
The above API provides the list of ids of each article.
Retrieve all articles by:
- GET <HOST>/api/v1/articles
Retrieve the single article, providing the article identifier
- GET <HOST>/api/v1/articles/<ARTICLE_ID>

For example, you can verify the above endpoint by following cURL commands:

curl -X POST http://localhost:3000/api/v1/articles

curl -X GET http://localhost:3000/api/v1/articles

curl -X GET http://localhost:3000/api/v1/articles/<ARTICLE_ID>

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
mongo-initdb		mongo-initdb
src		src
tests		tests
.env		.env
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
jest.config.js		jest.config.js
package.json		package.json
run.sh		run.sh
tsconfig.json		tsconfig.json
tslint.json		tslint.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Economist Scraping Assessment

The assignment

Build and Run with Docker and docker-compose

Build and Run on your Local Machine

Description

About

Releases

Packages

Languages

NicoMincuzzi/economist_scraping

Folders and files

Latest commit

History

Repository files navigation

Economist Scraping Assessment

The assignment

Build and Run with Docker and docker-compose

Build and Run on your Local Machine

Description

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages