Skip to content

An open source book to learn data science, data analysis and machine learning, suitable for all ages!


Notifications You must be signed in to change notification settings


Repository files navigation

Data Science Live Book

Data Science Live Book

tl;dr: Hi there! I invite you to read the book online and/or download here. Thanks and have a nice day :)

Paperback & Kindle at Amazon

This book is now available at Amazon in [Kindle]( Link: Black & White and color 📗 🚀.

LiIt can be shipped to over 100 countries. 🌎

Also available in PPDF :)

!(tl;dr): An overview...

It's a book to learn data science, machine learning, data analysis with tons of examples and explanations around several topics like:

  • Exploratory data analysis
  • Data preparation
  • Selecting best variables
  • Model performance

Most of the written R code can be used in real scenarios! I worked on the funModeling R package at the same time, so it is used many times in the book.

How about some examples?

It's a playbook with full of data preparation receipts.

I.e. in the missing values chapter you'll find how to input and convert these values into something useful for both, analysis and predictive modeling.

Other example, in the outliers chapter you'll get to know to some methods that spot outliers based on different criteria; funModeling contains a function that can help to process all data at once...

Or more conceptually, we have a numeric variable and we need to convert it into categorical, or vice-versa, do we have to convert or just leave it as it comes?

And so on and so on...

Book's philosophy

The book has all of its chapters interrelated, so you can start by any of them. My apologies if the number of links distracts from the reading. I wanted it that way just to show how all the machine learning concepts are somehow related.

There is a lot of effort in justifying what the book states. Yet, this is not enough, the reader can replicate and improve the examples, and thus generate their own knowledge.

To develop a critical thinking, without taking any statement as the "truly truth", it?s really important in this sea of books, courses, videos and any kind of technical material to learn. This book is just another view in the data science perspective.

I put some random errors...

... both technical and grammatical, the problem is I don't know where! So if you want to raise your hand and shout: "That's not correct! I think the correct form is... {replace-with-your-detailed-answer-here}", I invite you to report on the github repository, or email me at -at-

Download the PDF, epub and Kindle version!

If you learn anything new with this book, or it helped you somehow to saving time at your work, you can support the project by acquiring the portable version. (name your price starting at US$ 5)

There is no difference between the portable and web versions :)

After the purchase you'll will receive an email to download it in the three formats.

Download here!

Keep in touch: @pabloc_ds.

~ Thanks for reading !.