Skip to content

vionwinnie/pyspark-horse-race-predict

main
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 

Integrating data pipeline with Tensorflow Model Deployment using PySpark

  • Presentation for PyCon HK 2020 Fall Session (Cantonese Track)
  • Speaker: Winnie Yeung
  • Slide Deck

Problem Description

How can we predict the winning horse out of each race at Jockey Club horse race?

Tech Stack

  • GCP Dataproc, PySpark 2.4.7, Pandas, Tensorflow 2.0, Java

Running jobs

  • Individual Script: pyspark < script.py
  • Submit job on GCP Dataproc: /shells/ nohup ./submit_inference_job.sh &

Credits

Useful links:

About

Using PySpark for Tensorflow model inferencing on GCP Dataproc Cluster. Demo for PyCon Hong Kong Fall 2020 Presentation

Topics

Resources

Stars

Watchers

Forks