Skip to content

Exploring the Spanish long distance railway transportation pricing system.

Notifications You must be signed in to change notification settings

Salvinha-vlc/Renfe-project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

16 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Ironhack Logo

A closer look into the Spanish railway passenger transportation pricing

Salvador Rocher Espinosa

Data Analytics, August 2019, Barcelona

Content

Project Description

As someone who lives and works in a Spanish city 400km away from home, I have found that the most convenient way to travel back and forth is to resort to the train. As a frequent user I have grown baffled of the pricing pattern upon buying the tickets, moving sometimes along the same levels, while others out of the most common levels.

Hence, this stirred me to know more about the Spanish long distance railway transportation pricing system.

Hypotheses / Questions

“Do train ticket prices really change over the days”?

And if so,

“Is there an optimal moment to buy them?”

The initial hypothesis is that prices really change over days, in particular, they move up as departure day approaches.

Bonus question: "Are there relevant intraday price ticket differences?"

Dataset

In this project, only Renfe’s long distances routes were considered.

The dataset is sourced from a Renfe scrapping process carried over by thegurus.tech (link below), where prices for the sampled routes departing trains where checked several times on loop each day. In particular, the trains whose priced were checked range near 3 months, from April 12th, 2019 to July 7th, 2019.

Dataset

Workflow

This is the workflow I envisaged for this project:

  1. Question formulation
  2. Data fetch
  3. Getting to know the raw data
  4. Data wrangling
  5. Data analysis and visualizations
  6. Conclusions
  7. Presentation

And this is the correspondence with the ipynb files that can be found in "The-code" folder.

  1. Getting to know data 1 --> 3)
  2. Getting to know data 2 --> 3)
  3. Data wrangling 1 --> 4)
  4. Data wrangling 2 --> 4)
  5. Paper. Analysis + figures --> 5), 6)

Note. 1) and 2) do not have correspondence with ipynb files, they have been explained here in this README file. 7) Presentation can be found in the repo along this README file.

Links

Repository
Slides
Medium

About

Exploring the Spanish long distance railway transportation pricing system.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published