Code challenge for Data Engineer
We'd like you to grab data from the files listed below and propose interrelated data model and schemas. After that, load it into a free-tier AWS Redshift and provide code and documentation.
We'd like you to propose a way (setup and tools) for daily loading of mobile app related metadata from app stores to AWS Redshift.
We'd also like to learn more about your coding skills.
The file activity_points.geojson contains crowdsourced locations in Dar es Salaam, Tanzania. The quality and the source of the data is unknown. Not all attributes of the data always contain a value. There is no additional metadata.
Your task is to derive bus stop locations from the data.