Skip to content
#

preprocessing

Here are 1,381 public repositories matching this topic...

The Frequent Dataset Mining project offers a comprehensive solution for mining frequent itemsets from the extensive Amazon dataset using Apache Kafka. Leveraging the power of distributed computing, this project employs two powerful algorithms, Apriori and PCY, to efficiently process and analyze large volumes of data.

  • Updated May 31, 2024
  • Python

This repository contains the PMAAD course project from the Artificial Intelligence Degree at Universitat Politècnica de Catalunya. It models and analyzes Spotify's top 40 weekly streamed songs (2017-2021) using R. Techniques include clustering, textual analysis, and geospatial analysis to uncover music trends and characteristics.

  • Updated May 30, 2024
  • HTML

Breast Cancer Data Analysis: Analyzes and classifies breast cancer data using a Naive Bayes classifier with preprocessing, label encoding, and k-fold cross-validation. Cars Dataset Analysis: Explores a cars dataset with data loading, statistics, and visualizations, including price distribution and correlation heatmap. Hayes-Roth Classification: C

  • Updated May 28, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the preprocessing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the preprocessing topic, visit your repo's landing page and select "manage topics."

Learn more