Household appliances lifetime prediction

Running the code

For now we have a data production and consumption demo. This can be ran using:

docker-compose up --build

sensor_heater will produce updates for the simulated appliance. These are passed to Kafka and then consumed by the db_interface and the streaming_worker.

Spark job submission

After the infrastructure is setup you can access a container called spark from which we submit our Spark jobs. Copy the relevant files over to the container using docker cp path/to/script spark:/opt/bitnami/spark.

Prediction streaming

Copy the file spark/streaming_worker/streaming_worker.py to the spark container.
docker exec -it spark bash to get into the container.
spark-submit --master spark://spark-master:7077 --packages org.apache.spark:spark-sql-kafka-0-10_2.11:2.4.5 --py-files streaming_worker.py streaming_worker.py to submit.

Machine learning

Copy the file spark/ml/regression.py to the spark container.
docker exec -it spark bash to get into the container.
spark-submit --packages com.datastax.spark:spark-cassandra-connector_2.11:2.4.2 --py-files regression.py regression.py to submit to Spark.

System Architecture

Please find the google slides describing our system architecture here.

TODO:

Interactive map with pins that simulate households. Click on a pin and be presented with options: view data (historical query) / request lifetime of appliances of household (leaflet for Vue).
View data as interactive/dynamic graphs.
If no time for leaflet, a list of all households would suffice.

Name		Name	Last commit message	Last commit date
Latest commit History 258 Commits
cassandra		cassandra
frontend/household-view		frontend/household-view
kafka_cluster		kafka_cluster
kubernetes		kubernetes
mapreduce		mapreduce
prediction_consumer		prediction_consumer
sensors		sensors
spark		spark
test		test
.gitignore		.gitignore
Calculate the remaining life of household appliances.pdf		Calculate the remaining life of household appliances.pdf
README.md		README.md
REPORT.md		REPORT.md
cassandra-cluster-service.yaml		cassandra-cluster-service.yaml
cassandra-volume-persistentvolumeclaim.yaml		cassandra-volume-persistentvolumeclaim.yaml
docker-compose.yml		docker-compose.yml
rmkafka.sh		rmkafka.sh

uberVelocity/distributed-spark-for-scalable-smart-homes

Folders and files

Latest commit

History

Repository files navigation

Household appliances lifetime prediction

Running the code

Spark job submission

Prediction streaming

Machine learning

System Architecture

TODO:

About

Resources

Stars

Watchers

Forks

Languages