web_crawler

A Gleam project

Quick start

Install erlang and gleam

Run the script with your input:

. ./run.sh http://www.yourdomain.com

Run the tests:

# Run the eunit tests
rebar3 eunit

# Run the Erlang REPL
rebar3 shell

## Details

Please complete the user story below

Your code should compile and run in one step
Write it as you would write a production ready feature
Feel free to use whatever frameworks/languages/libraries/packages you like

As a user running the application I can enter a web URL to be crawled So that I can generate and view a visual representation of the static assets each page depends on and the links between the pages.

Conditions of Satisfaction

The crawler should be limited to one domain - so when crawling https://elixir-lang.org/ it would crawl all pages within the domain, but not follow external links, for example to the Facebook and Twitter accounts.

Given a URL, it should output a site map, showing which static assets each page depends on, and the links between pages.

The resultant generated site map can be as simple as a text file or a snazzy, detailed webpage report.

Bonus points for tests and making it as fast as possible!

You’ll be asked to talk through your code during the next interview round.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.github/workflows		.github/workflows
src		src
test		test
.gitignore		.gitignore
README.md		README.md
gleam.toml		gleam.toml
rebar.config		rebar.config
rebar.lock		rebar.lock
run.sh		run.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

web_crawler

Quick start

Please complete the user story below

Conditions of Satisfaction

About

Languages

joecorkerton/web-crawler

Folders and files

Latest commit

History

Repository files navigation

web_crawler

Quick start

Please complete the user story below

Conditions of Satisfaction

About

Resources

Stars

Watchers

Forks

Languages