Skip to content
Juan Rodriguez Hortala edited this page Jul 10, 2014 · 9 revisions

Hadoop Platform

Cloudera CDH 4.4.0

Software components dependencies

Dependencies are hosted in https://s3.amazonaws.com/juanrh.bicingbcn/pub/sw_dependencies/ as files with read permissions for anyone. Static hosting in S3 is not configured

  • Apache Spark 1.0.0 for CDH4
  • Apache Kafka 0.8.1.1 on Scala 2.10
  • Saiku Server 2.5: uses Mondrian 3.5.7
  • Redis 2.4.10
  • Apache Phoenix 3.0.0-incubating

Libraries

Python

  • Requests sudo pip2.7 install requests
  • Boto sudo pip2.7 install boto
  • Twisted sudo pip2.7 install Twisted
  • Sphinx $ sudo pip2.7 install Sphinx