Skip to content

A collection of resources to facilitate your journey into data science

Notifications You must be signed in to change notification settings

justinmccoy/datascience_resources

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

22 Commits
 
 
 
 

Repository files navigation

A collection of resources to facilitate your journey into data science

Before getting started with data science it's important to realize there are many different roles involved:

alt text

  • Data Engineer ingesting, exploring, transforming, cleaning and understand data
  • Data Scientist shaping and evaluating data, creating or building models, and communicating and sharing results
  • Business Analyst understand the problem domain, evaluating models, or communicating results
  • App Developer consumes data and models to create applications

Data science is considered a team sport due to these different roles and need for collaboration. Instead of working in siloed environments it's helpful to have a central platform for all the roles to collaborate, for this I recommend getting started with the Data Science Experience. Here you can import data from many different sources, clean and transform the data, use tools and libraries of your choice (RStudio, Jupyter Notebooks, SPSS, TensorFlow), train and test models, setup automation of tasks, easily communicate results (Pixie Dust) and share models for consumption as API endpoints. You get all of this built on top of Apache Spark and IBM’s Cloud Platform.

Great Podcast about DataScience: Partially Derivative

Guides

Videos

21 Short Videos Introducing Features of the Data Science Experience

21 Short Videos Introducing Features of the Data Science Experience

Examples

PowerAI

Data

Libraries

About

A collection of resources to facilitate your journey into data science

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages