#

modin

Here are 24 public repositories matching this topic...

gandalf1819 / NYCOpenData-Profiling-Analysis

Open Data Profiling, Quality and Analysis on NYC OpenData dataset with semantic profiling using fuzzy ratio, Levenshtein distance and regex

big-data pandas pyspark levenshtein-distance hdfs dask regular-expressions fuzzywuzzy fuzzy-logic data-profiling nyc-opendata modin nyc-311-dataset dask-distributed

Updated Nov 10, 2020
Jupyter Notebook

kedro-dataframe-dropin

mzjp2 / kedro-dataframe-dropin

A Kedro plugin that provides pandas dropin replacements for the pandas datasets (e.g modin and cuDF)

data gpu-acceleration modin rapidsai kedro-plugin kedro-catalog

Updated Feb 2, 2021
Python

bhattbhavesh91 / modin-example

Simple example on how Modin can peed up your Pandas workflows by changing a single line of code

python pandas-dataframe pandas modin

Updated May 31, 2021
Jupyter Notebook

adrianmarino / recommendation-system-approaches

Recommendation system approaches

spark deep-learning tensorflow keras recommender-system ray movielens modin

Updated Mar 1, 2022
Jupyter Notebook

Movie-Recommendation-Rating-Prediction

jacobceles / Movie-Recommendation-Rating-Prediction

Using the MovieLens dataset with Surprise to compare different algorithms for rating prediction, and also create a movie recommendation system on top of it.

python machine-learning movie-recommendation rating-prediction surprise-python modin surprise-library

Updated May 6, 2022
Jupyter Notebook

udsb

unum-cloud / udsb

Unlimited Data-Science Benchmarks for Numeric, Tabular and Graph Workloads

arrow sqlite numpy pandas cublas networkx dask apache-arrow modin cudf cugraph

Updated Mar 31, 2023
Jupyter Notebook

comprakash / delta-transformation-pipeline

A transformation pipeline for Delta Lake using AWS SDK for Pandas

kubernetes etl s3 ray transformation modin delta-lake delta-rs aws-sdk-pandas kuberay

Updated Jul 12, 2023
Python

Helzheng123 / datasci_2_manipulation

Delve deeper into data manipulation using Python's prominent libraries. Explore the functionalities of Pandas and get a glimpse of alternatives like Polars, Dask, and Modin.

python data-transformation pandas data-cleaning modin polars

Updated Sep 12, 2023
Jupyter Notebook

c-susan / datasci_2_manipulation

HHA507 / Data Science / Assignment 2 / Data Manipulation

numpy jupyter-notebook data-transformation pandas data-cleaning modin polars

Updated Sep 25, 2023
Jupyter Notebook

murattkiran / File-ingestion-and-schema-validation

schema-validation dask file-ingestion modin

Updated Nov 9, 2023
Jupyter Notebook

hariprasath-v / Intel_oneAPI_Hackerearth_Predict-the-quality-of-freshwater

Build a machine model to predict whether the freshwater is safe to drink or not.Based on the measures like pH, TDS, etc.

exploratory-data-analysis pandas python3 xgboost classification lightgbm catboost modin f1score onedal shapash

Updated Nov 28, 2023
HTML

PratikDavidson / intel-oneAPI-LLM

oneAPI Hackathon: The LLM Challenge

intel keras-tensorflow modin huggingface oneapi bert-fine-tuning huggingface-transformers streamlit-webapp itex

Updated Jan 27, 2024
Jupyter Notebook

oneapi-src / ai-structured-data-generation

AI Starter Kit to generate structured synthetic data using Intel® Distribution of Modin

machine-learning modin

Updated Feb 2, 2024
Python

drshahizan / Python-big-data

Python and Pandas are known to have issues around scalability and efficiency. You will learn how to use libraries such as Modin, Dask, Ray, Vaex etc to overcome the problems faced by Pandas.

data-science pandas dask ray vaex modin

Updated Feb 20, 2024
Jupyter Notebook

jmcarpenter2 / swifter

A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner

pandas-dataframe parallel-computing parallelization pandas dask modin

Updated Mar 20, 2024
Python

ivanbgd / bioinf_demo

A Bioinformatics demo in Python working with FASTQ files and using the Modin library

python bioinformatics computational-biology trie python3 biopython fastq modin larger-than-memory

Updated Mar 21, 2024
Python

AAWorks / options-pricing

Global Markets Options Pricing

openai-gym pandas dqn make monte-carlo-simulation deep-q-network black-scholes-merton modin streamlit tf-agents binomial-pricing polygon-api

Updated May 2, 2024
Python

intel / hdk

A low-level execution library for analytic data processing.

data-science machine-learning query cpu sql analytics gpu pandas query-builder query-engine heterogeneous-parallel-programming modin

Updated May 9, 2024
C++

ray-project / xgboost_ray

Distributed XGBoost on Ray

data-science machine-learning kaggle xgboost dask modin

Updated Jun 25, 2024
Python

prehensilecode / sge_accounting_stats

Simple stats on SGE accounting data

python hpc pandas sge data-analysis modin

Updated Jul 4, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to the modin topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the modin topic, visit your repo's landing page and select "manage topics."