Since joining Silicon Valley Hands On Programming meetup, we gladly finished Edx/BerkeleyX CS100.1x and CS190.1x. Now we switched gear to Stanford CS246: Mining Massive Data Sets (Winter 2015).
I am still playing CodingBat in Java, so the major solution posted here will be coded in Apache Spark Python, Python MrJob Hadoop and Ruby-Spark. Wget and cURL in shell scripts will be used to fetch some practicing data.