Skip to content

Week 08 (W2 Jan11) London RE

Chuck Lee edited this page Jan 11, 2017 · 8 revisions

Summary:

Over these weeks, we have developed some more vital descriptive features for our dataset and are now looking into developing predictive features using all of the available data attributes. Of course, we will also make alterations to the existing descriptive attributes as/if needed as well as create new ones when required. Short summary of the work done over this time period is:

  • Extracted geocoded values and other vital information for all public transit stations in London.
  • For each listing address, calculated the distance to the nearest public transport point.
  • Took last feedback into account and revised the Area measurement attribute(s).

Details of work done:

Public Transit data:

Using OpenStreetMap, extracted vital information about the metro public transit system

Histogram of Nearest Public Station

Histo!

As one can see, most listings are within 1-2 kilometers to public transit.

Predictive Tasks - Future Work

Use of current values as training data

We are planning to use our current data as training data.

More data-mining

We are going to grab more recent entries from the Zoopla api and we will begin to use that data for our tests. The information gathered from the training data will be able to somewhat accurately predict information for the test data.

Learning

We are still exploring techniques and ways to accomplish this. As we have just gotten back from Holidays, we are planning on meeting this week to discuss our tasks. Please stay tuned.

Clone this wiki locally