TeX R CSS Other
Switch branches/tags
Nothing to show
Clone or download
Latest commit 66a4fe4 Jul 8, 2018
Permalink
Failed to load latest commit information.
appendix lots of changes: references fix (html + pdf) , new funModeling chapte… Jan 23, 2018
data_preparation fix colname file Jul 8, 2018
emojis new parameters (deleted the "str_") Jan 19, 2018
exploratory_data_analysis trying new World_Development_Indicators.csv Jul 8, 2018
introduction version on amazon Apr 5, 2018
model_performance migration to bookdown!!! FINALLY!!!! Oct 5, 2017
scoring - final plot names on select vars and model performance Mar 11, 2018
selecting_best_variables migration to bookdown!!! FINALLY!!!! Oct 5, 2017
.Rbuildignore migration to bookdown!!! FINALLY!!!! Oct 5, 2017
.gitignore new version in treating empty values Feb 26, 2018
01_exploratory_data_analysis.Rmd - rewriting and typo of MAS score (thanks Kevin Hammond!) Jul 7, 2018
02_data_preparation.Rmd fix colname file Jul 8, 2018
03_selecting_best_variables.Rmd typo boxplot Jul 8, 2018
04_assesing_model_performance.Rmd img cap on appendix Mar 13, 2018
30_appendix.Rmd no message Mar 13, 2018
40_download_pdf_ebook.Rmd amazon Apr 17, 2018
99_references.Rmd lots of changes: references fix (html + pdf) , new funModeling chapte… Jan 23, 2018
DESCRIPTION migration to bookdown!!! FINALLY!!!! Oct 5, 2017
LICENSE migration to bookdown!!! FINALLY!!!! Oct 5, 2017
_bookdown.yml lots of changes: references fix (html + pdf) , new funModeling chapte… Jan 23, 2018
_build.sh migration to bookdown!!! FINALLY!!!! Oct 5, 2017
_output.yml fig caption in progess, fig are display in the correct place Feb 22, 2018
book.bib migration to bookdown!!! FINALLY!!!! Oct 5, 2017
create_site.R disable pdf generation on site Dec 21, 2017
create_site_dbg.R lots of changes: references fix (html + pdf) , new funModeling chapte… Jan 23, 2018
data-science-live-book.Rproj migration to bookdown!!! FINALLY!!!! Oct 5, 2017
google_analytics.html added ga Oct 8, 2017
index.Rmd typo May 5, 2018
packages.bib lots of changes: references fix (html + pdf) , new funModeling chapte… Jan 23, 2018
preamble.tex coloremoji deletion Oct 6, 2017
readme.md readme Feb 17, 2018
style.css migration to bookdown!!! FINALLY!!!! Oct 5, 2017
template.tex fig caption in progess, fig are display in the correct place Feb 22, 2018
toc.css migration to bookdown!!! FINALLY!!!! Oct 5, 2017

readme.md

Data Science Live Book

Data Science Live Book

tl;dr: Hi there! I invite you to read the book online and/or download here. Thanks and have a nice day :)

!(tl;dr): An overview...

It's a book to learn data science, machine learning, data analysis with tons of examples and explanations around several topics like:

  • Exploratory data analysis
  • Data preparation
  • Selecting best variables
  • Model performance

Most of the written R code can be used in real scenarios! I worked on the funModeling R package at the same time, so it is used many times in the book.


How about some examples?

It's a playbook with full of data preparation receipts.

I.e. in the missing values chapter you'll find how to input and convert these values into something useful for both, analysis and predictive modeling.

Other example, in the outliers chapter you'll get to know to some methods that spot outliers based on different criteria; funModeling contains a function that can help to process all data at once...

Or more conceptually, we have a numeric variable and we need to convert it into categorical, or vice-versa, do we have to convert or just leave it as it comes?

And so on and so on...


Book's philosophy

The book has all of its chapters interrelated, so you can start by any of them. My apologies if the number of links distracts from the reading. I wanted it that way just to show how all the machine learning concepts are somehow related.

There is a lot of effort in justifying what the book states. Yet, this is not enough, the reader can replicate and improve the examples, and thus generate their own knowledge.

To develop a critical thinking, without taking any statement as the "truly truth", it?s really important in this sea of books, courses, videos and any kind of technical material to learn. This book is just another view in the data science perspective.


Hmmm... next releases?

It could vary, but I have some ideas like how to put more information on the predictive model creation and validation, validating clustering models, dimension reduction techniques, "how to become a data scientist", among others.


I put some random errors...

... both technical and grammatical, the problem is I don't know where! So if you want to raise your hand and shout: "That's not correct! I think the correct form is... {replace-with-your-detailed-answer-here}", I invite you to report on the github repository, or email me at pcasas.biz -at- gmail.com


Download the PDF, epub and Kindle version!

If you learn anything new with this book, or it helped you somehow to saving time at your work, you can support the project by acquiring the portable version. (name your price starting at US$ 5)

There is no difference between the portable and web versions :)

After the purchase you'll will receive an email to download it in the three formats.

Download here!



Keep in touch: @pabloc_ds.

~ Thanks for reading !.