This project scrapes profiles from meetme.com using docker. Follow the quick start and it will create an API for accessing all of the created profiles to be hosted on netlify.
Scrapes profiles between ages 19 and 35.
- MM_EMAIL=
- MM_PASSWORD=
Note the locations in app.rb
.
docker-compose up --build
- ssh into docker host
- run
ruby app.rb
- wait for sidekiq jobs to finish (takes about 1-2 hours)
- run
ruby consolidate.rb
- creates/results/profiles.json
with all crawled profile info and relative photo paths
In the ~/Desktop/results
, run netlify deploy
. Make note of the domain.
nearby_crawler
- fetches the profiles nearby and stores them. Queues fetching photo jsonget_photo_jsons(member_id)
- returns json of photosget_photo(url)
- persists photos
Output folder structure (in Docker):
/results/
- {member_id}.json
- {member_id}_{photo_id}.jpg