Skip to content

ThatsAMorais/data_test_develop

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 

Repository files navigation

Assignment

Write a script to download and parse the given XML feed, manipulate some of the data, and deliver a CSV of the required fields. You may use any additional libraries that you wish, please include a requirements.txt if you do.

CSV Requirements:

  • Contains only properties listed from 2016 [DateListed]
  • Contains only properties that contain `and' in the Description field
  • CSV ordered by DateListed
  • Required fields:
    • MlsId
    • MlsName
    • DateListed
    • StreetAddress
    • Price
    • Bedrooms
    • Bathrooms
    • Appliances (all sub-nodes comma joined)
    • Rooms (all sub-nodes comma joined)
    • Description (the first 200 characters)

Technical Requirements

  • Interpreter version: python 2.7
  • Reasonable unit test coverage
  • All libraries used must be documented in requirements.txt
    • We will be using pip install -r requirements.txt prior to running your code
  • Raw information to parse / feed url

Submission Requirements

  • Work should be tracked with Git
  • Submit final product by pushing your branch to this repo
    • Branch being submitted cannot be called master

Purpose

A lot of the work in our department is parsing and manipulating data from a variety of sources. The given example is one of our XML files that we send to Zillow for property syndication. Our goal in this test is to see how you will approach the processing of this feed. Your solution should take into the account that there will be other XML feeds that need parsing as well, so how modular/reusable you make the code is very important.

Time Considerations

This assignment is expect to take a few hours. We ask that you do not spend too much time on this solution. If you are stuck or have questions, feel free to reach out and we will answer quickly.

About

Initial code test for data team candidates. Testing ability to parse data programatically.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published