Skip to content

Week 08 (W2 Jan11) London RE

Chuck Lee edited this page Jan 11, 2017 · 8 revisions

Summary:

Over these weeks, we have developed some more vital descriptive features for our dataset and are now looking into developing predictive features using all of the available data attributes. Of course, we will also make alterations to the existing descriptive attributes as/if needed as well as create new ones when required. Short summary of the work done over this time period is:

  • Extracted geocoded values and other vital information for all public transit stations in London.
  • For each listing address, calculated the distance to the nearest public transport point.
  • Took last feedback into account and revised the Area measurement attribute(s).

Details of work done:

Public Transit data:

Using OpenStreetMap, extracted vital information about the metro public transit system

Histogram of Nearest Public Station

Histo!

Predictive Tasks - Future Work

Use of current values as training data

We are planning to use our current data as training data.

More data-mining

We are going to grab more recent entries from the Zoopla api and we will begin to use that data for our predictions. We will factor in some correlations and previous data and try to predict prices for these newer listings.

Clone this wiki locally