Skip to content

isaacarnault/Data_Science

Repository files navigation

Data collection and statistics using Python and R

Project Status: Active – The project has reached a stable, usable state and is being actively developed.

Scripting in Python and R

The following gist offers a focus on Data Collection, one of the stages* of the Data Science methodology. We will also perform basic math operations on a single dataframe to see how they render using Python or R.

Versioning

I used no vesioning system for this gist, which repos status is flagged as concept because it is intended to be a demo or POC (proof-of-concept).

Author

Licence

All public gists https://gist.github.com/aiPhD
Copyright 2018, Isaac Arnault
MIT License, http://www.opensource.org/licenses/mit-license.php

Sources

Exercise

  • Perform a data collection in Python and R using Jupyter.
    ⇢ Use the following dataframe from Spatialkey.com.
  • How many observations and variables does the dataframe contain? Base your assessment on your scripting outputs.
  • Calculate Sum, Min, Max and Mean of variable "raisedAmt" using Python (and Pandas) and using R.
    — (*) Ten stages are crucial regarding Data Science methodology, among which Data collection. See architecture.md.

About

Data collection using Python and R - Original gist ⇢

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages