-
Notifications
You must be signed in to change notification settings - Fork 12
Big Data
Edmundas Mišeikis edited this page Mar 24, 2017
·
34 revisions
- Big Data
- Julia
- R
- Jupyter Notebook
- Apache Spark
- Apache Kafka
- Apache Solr
- Sendence Wallaroo
- Visualization
- Awesome Big Data
- Awesome Data Science
- Awesome Hadoop
- Awesome ElasticSearch
- Awesome JSON Datasets
- Big Data University
- The Dirtiest Little Secret about Big Data Jobs
- Scalable Stream Processing - A Survey of Storm, Samza, Spark and Flink
- Google Data Studio
- 50+ Data Science, Machine Learning Cheat Sheets, updated
- How Big Data Is Going To Change Over The Next Three Years
- I hate Matlab: How an IDE, a language, and a mentality harm
- JuliaBox
- Julia: A Fast Language for Numerical Computing
- Julia 0.5 Highlights
- Introduction to Julia Programming Language and its Features
- The R Project for Statistical Computing
- Awesome R on GitHub
- sparklyr — R interface for Apache Spark
- Tutorial: Scalable R on Spark with SparkR, sparklyr and RevoScaleR
- 18 New Must Read Books for Data Scientists on R and Python
- R exercises
- Running R on Amazon Athena
- (Julia, Python, R)
- 27 Jupyter Notebook tips, tricks and shortcuts | Jupyter (IPython) notebooks features
- nteract
- The State of Jupyter
- Awesome Spark
- on gitHub
- https://www.washingtonpost.com/news/the-switch/wp/2016/06/09/this-is-where-the-real-action-in-artificial-intelligence-takes-place/
- https://www.edx.org/course/introduction-apache-spark-uc-berkeleyx-cs105x?gclid=Cj0KEQjwhZm7BRCUyfS6ho2VjOEBEiQAumpGMsxsSGZZuglLa7NEP-oJ6DV397GAeFBkfk4-OukLlTgaAsmS8P8HAQ
- https://www.infoq.com/articles/apache-spark-introduction
- http://www.i-programmer.info/news/197-data-mining/9805-apache-spark-20-technical-preview.html
- https://www.supergloo.com/spark-tutorial/
- https://databricks.com/blog/2016/07/28/continuous-applications-evolving-streaming-in-apache-spark-2-0.html
- https://app.pluralsight.com/library/courses/apache-spark-fundamentals
- 7 Steps to Mastering Apache Spark
- A Powerful Big Data Trio: Spark, Parquet and Avro
- Apache Kafka: Online Talk Series
- Building a distributed Runtime for Interactive Queries in Apache Kafka with Vert.x
- High-throughput, low-latency, resilient, event-by-event data processing framework
- @sendenceeng - Sendence Engineering
- Hello Wallaroo!
- Metrocosm by @galka_max
- Superset
- [Video] 23 Visualizations and When to Use Them
- The 10 Best Data Visualization Articles of 2016 (and Why They Were Awesome)
- Data is Beautiful on Reddit
- Visdom - A flexible tool for creating, organizing, and sharing visualizations of live, rich data. Supports Torch and Numpy
-
Loopy - A tool for thinking in systems
- on GitHub by NickyCase @ncasenmare
- Confluent Graphs