This notebook will remove duplicates in your google drive!
-
Updated
Feb 1, 2023 - Jupyter Notebook
This notebook will remove duplicates in your google drive!
Data mining on stack overflow Q/A data to understand the landscape of languages and developers in computer science
A simple tool to compare new data to historical records. It will tag rows accordingly as duplicate or NULL. The team of interns I was in designed this tool using PySpark and Jupyter Notebook in Microsoft Fabric as a practice exercise within Lexmark Research and Development Corporation's Digital Transformation program.
Add a description, image, and links to the duplicate-detection topic page so that developers can more easily learn about it.
To associate your repository with the duplicate-detection topic, visit your repo's landing page and select "manage topics."