Skip to content
View sinhrks's full-sized avatar

Organizations

@pydata @stan-ja @dask @pandas-ml @pandas-dev

Block or report sinhrks

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Reproducibility for Humans: A lightweight tool to perform reproducible machine learning experiment.

Python 24 5 Updated Apr 24, 2019

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 15,077 3,646 Updated Mar 8, 2025

N-D labeled arrays and datasets in Python

Python 3,727 1,115 Updated Mar 7, 2025

Design documents and code for the pandas 2.0 effort.

Python 303 39 Updated Nov 9, 2018

It'll detect your anomalies! Part of the Kale stack.

Python 2,134 334 Updated Feb 22, 2016

Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more

Python 44,786 18,311 Updated Mar 7, 2025

Apache Airflow - A platform to programmatically author, schedule, and monitor workflows

Python 39,079 14,774 Updated Mar 9, 2025

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 64,871 14,625 Updated Mar 8, 2025

Feather: fast, interoperable binary data frame storage for Python, R, and more powered by Apache Arrow

JavaScript 2,741 165 Updated Nov 19, 2021

Quickly and accurately render even the largest data.

Python 3,379 373 Updated Mar 1, 2025

Gaussian processes framework in python

Python 2,076 564 Updated Jan 15, 2025

pandas japanese extension

Python 82 10 Updated Jul 23, 2020

Stan models for state space time series

R 143 41 Updated Jul 3, 2017

pandas, scikit-learn, xgboost and seaborn integration

Python 319 78 Updated Aug 14, 2020
Python 10 4 Updated Mar 5, 2018

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++ 26,673 8,749 Updated Mar 8, 2025

Experimental multicore fork of Python 3

Python 584 25 Updated Dec 10, 2024

Airspeed Velocity: A simple Python benchmarking tool with web-based reporting

Python 890 185 Updated Feb 27, 2025

Parallel computing with task scheduling

Python 12,999 1,750 Updated Mar 7, 2025

the portable Python dataframe library

Python 5,581 617 Updated Mar 9, 2025

a web application framework for python

Python 833 156 Updated Mar 12, 2022

A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning al…

R 1,879 334 Updated Sep 16, 2022

A flexible framework of neural networks for deep learning

Python 5,910 1,366 Updated Aug 28, 2023

A Theano framework for building and training neural networks

Python 1,155 349 Updated Feb 19, 2019

R interface to Bokeh http://hafen.github.io/rbokeh/

R 311 64 Updated Nov 1, 2023

Data Migration for the Blaze Project

Python 1,004 136 Updated Jul 15, 2022

Recipes for using Python's pandas library

Jupyter Notebook 6,748 2,331 Updated Oct 24, 2024

NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.

Python 5,419 1,324 Updated Dec 22, 2020

IPython kernel for Torch with visualization and plotting

Jupyter Notebook 1,096 156 Updated Nov 10, 2017

Define fortify and autoplot functions to allow ggplot2 to handle some popular R packages.

R 528 65 Updated Jun 24, 2024
Next
Showing results