Skip to content
Data collection using Python and R - Original gist ⇢
R
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
LICENSE
OUTPUT.md
README.md
architecture.md
exercise_solution.md
scripting_in_Python.R
scripting_in_R.R

README.md

Data collection and statistics using Python and R

Project Status: Active – The project has reached a stable, usable state and is being actively developed.

Scripting in Python and R

The following gist offers a focus on Data Collection, one of the stages* of the Data Science methodology. We will also perform basic math operations on a single dataframe to see how they render using Python or R.

Versioning

I used no vesioning system for this gist, which repos status is flagged as concept because it is intended to be a demo or POC (proof-of-concept).

Author

Licence

All public gists https://gist.github.com/aiPhD
Copyright 2018, Isaac Arnault
MIT License, http://www.opensource.org/licenses/mit-license.php

Sources

Exercise

  • Perform a data collection in Python and R using Jupyter.
    ⇢ Use the following dataframe from Spatialkey.com.
  • How many observations and variables does the dataframe contain? Base your assessment on your scripting outputs.
  • Calculate Sum, Min, Max and Mean of variable "raisedAmt" using Python (and Pandas) and using R.
    — (*) Ten stages are crucial regarding Data Science methodology, among which Data collection. See architecture.md.
You can’t perform that action at this time.