Permalink
Cannot retrieve contributors at this time
Name already in use
A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
2019-web/data/talks/169.yaml
Go to fileThis commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
28 lines (16 sloc)
1.43 KB
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
| # Talk details are specified in YAML files | |
| # YAML was selected because we can use multi-line strings and add | |
| # comments in the file. | |
| speaker_name: "Jonathan Rioux" | |
| talk_title: "PySpark: avoiding common pitfalls and keeping your sanity" | |
| # At least 1 tag is necessary!! | |
| talk_tags: | |
| - "Machine Learning & Data Science" | |
| - "Tools, Testing, and Practices" | |
| talk_abstract: "For a Python developer, using PySpark can often feel foreign, like driving a race car in sandals. You see the power, yet it feels like you're fighting against the machine. This talk is about battle stories using PySpark from development to production, and how my many errors can lead to better code on your end. In no particular order, I'll discuss about speeding up your development, avoiding 'friendly enemies' and testing your code. You'll see how to avoid embarrassing mistakes by seeing me making them, and you'll leave a more insightful PySpark developer." | |
| about_author: "Jonathan is the data science practice lead for EPAM Canada, a global engineering consultancy. He worked in insurance, analytics and data science for a little over a decade. He is passionate about programming languages and how they allow to map more and more complex ideas. Jonathan is the author of _PySpark in Action_ (Manning, scheduled for 2020)." | |
| talk_metadata: | |
| - "**Date:** Sunday Nov. 17" | |
| - "**Location:** Round Room (PyData Track)" | |
| - "**Begin time:** 11:00" | |
| - "**Duration:** 25 minutes" |