Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Challenge: Tele2 invites you to try your hand at Location-Based Analysis! #78
Tele2 will supply hackathon participants with anonymized VLR (Visitor Location Register) data. This data describes cell switches within a 12-month period in Tele2’s 2G and 3G network – when a device connects to a Tele2 network station. By combining this data with a map of the locations of Tele2 stations, it is possible to put together approximate location-based analysis (LBA).
The data provided is obscured to protect the privacy of our customers, yet this is an exclusive chance to use a mobile operator’s data to build services on location-based data. Maybe you’d like to build an app that suggests kindergartens or schools to parents with a specific route, so they can save time on commuting? Or try your hand at traffic analysis to optimize road planning? Combining the VLR data with other data available at the event and open data available online is bound to yield interesting results.
The amount of data is large and will be accessible to participants through a Hadoop framework. More details will be revealed prior to the event.
küsimus - kas andmete hulgas on ka välismaa roaming andmed, st. andmed selle kohta, kas Tele2 kliendid on viibinud Eestist väljas ja seal mobiilimastidesse ühendunud?
Või kui selliseid andmeid seal hulgas pole, siis kas on Tele2'hel teoreetiliselt see andmestik olemas, või see kuulub välismaa operaatoritele ja ligipääs puudub?
About the cluster
You can run HBase command line client as: hbase shell
About the data and its format
There is VLR data in the cluster and geographic mobile cell coverage maps.
Data in the cluster
More details on the format (even more details are in the Java code):
Table vlr-by-imsi (access entries by IMSI and date)
Table vlr-by-cell (access entries by Cell ID and date)
Date/time information in row keys stored in such a way, that a string like “201609” is first converted into an integer like 201609 and then the bytes comprising that integer are saved. In column keys, each part is stored separately in its own place.
MSISDN, IMSI and IMEI are anonymised, but in IMEI the part describing the type of the phone was preserved unchanged, you can use it.
Mobile coverage maps
About the code examples
What you should prepare to build on top of the examples?
(Of course, you are free to use any language or tech if you want)