Skip to content
AWS Glue tutorial for data developers.
Python
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
README.md
glue_script.py
movie_data.txt

README.md

AWS Glue tutorial

AWS Glue tutorial for data developers.

This is a complementary repository for this AWS Glue tutorial with Spark and Python for data developers.

DynamicFrame vs DataFrame in AWS Glue

Note the difference between DynamicFrame and DataFrame. DataFrame is Spark native table like structure. DynamicFrame class is an attempt from AWS to address limitations of the DataFrame.

DynamicFrames might be handy to read and write data. Often the data processing is more efficient with standard PySpark functions.

You can’t perform that action at this time.