Project Title

Real Estate Web Scraper in Haskell

Description

A simple concurrent real estate web scraper. This scraper is written in Haskell. It is a simple web scraper that uses the Haskell Conduit Downloader library to make HTTP requests and the Scalpel library to extract information from HTML pages. ATwo implementations of a cache and datalake, using STM and postgresql using Postgres Simple exist. The scraper is concurrent with each work started in a seperate thread.

Currently the scraper can only scrape the PropertyBook ZW website. The scraper can easily be extended to scrape other websites.

Getting Started

Dependencies

Should work anywhere you can use cabal
Haskell
Cabal
Postgres
Postgres Simple
Haskell Conduit Downloader
Scalpel
STM
UnliftIO

Executing program

Still a work in progress
You might need to take a look at the code to see how to use it in case you're early here.

$ cabal repl
λ> pbc <- newPropertyBookInMemoryCrawler
λ> import Crawler.Simple
λ>
λ> runCrawler pbc "https://www.propertybook.co.zw/"
λ>
λ> stopCrawler pbc

Help

All help is welcome. If you have any suggestions, please open an issue or a pull request.

Authors

Contributors names and contact info

ex. Trevor Sibanda ex. @trevorsibanda

Version History

0.0.1
- Initial Release

License

This project is licensed under the GNU GPLv3 License

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
app		app
samples		samples
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
README.md		README.md
property-scraper.cabal		property-scraper.cabal

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project Title

Description

Getting Started

Dependencies

Executing program

Help

Authors

Version History

License

About

Releases

Packages

Languages

License

trevorsibanda/hs-scraper

Folders and files

Latest commit

History

Repository files navigation

Project Title

Description

Getting Started

Dependencies

Executing program

Help

Authors

Version History

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages