Big Data & Cloud Computing - PySpark, Dask, GCP, ...
-
Updated
Jul 8, 2019 - HTML
Big Data & Cloud Computing - PySpark, Dask, GCP, ...
In this phase of the project we matched schema's using jaccard similarity, and then we used an instanced based matcher in the FlexMatcher package to compare schemas of covid19 data.
Add a description, image, and links to the jaccard-similarity topic page so that developers can more easily learn about it.
To associate your repository with the jaccard-similarity topic, visit your repo's landing page and select "manage topics."