Skip to content
@capitalone

Capital One

We’re an open source-first organization — actively using, contributing to and managing open source software projects.

Pinned Loading

  1. DataProfiler Public

    What's in your data? Extract schema, statistics and entities from datasets

    Python 1.5k 169

  2. datacompy Public

    Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!

    Python 539 139

  3. locopy Public

    locopy: Loading/Unloading to Redshift and Snowflake using Python.

    Python 107 48

  4. rubicon-ml Public

    Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

    Jupyter Notebook 132 36

  5. dataCompareR Public

    dataCompareR is an R package that allows users to compare two datasets and view a report on the similarities and differences.

    R 76 26

  6. edgetest Public

    edgetest is a tox-inspired python library that will loop through your project's dependencies, and check if your project is compatible with the latest version of each dependency

    Python 23 8

Repositories

Showing 10 of 47 repositories
  • rubicon-ml Public

    Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

    Jupyter Notebook 132 Apache-2.0 36 11 1 Updated Mar 25, 2025
  • datacompy Public

    Pandas, Polars, Spark, and Snowpark DataFrame comparison for humans and more!

    Python 539 Apache-2.0 139 6 (1 issue needs help) 1 Updated Mar 25, 2025
  • locopy Public

    locopy: Loading/Unloading to Redshift and Snowflake using Python.

    Python 107 Apache-2.0 48 6 (1 issue needs help) 0 Updated Mar 25, 2025
  • DataProfiler Public

    What's in your data? Extract schema, statistics and entities from datasets

    Python 1,471 Apache-2.0 169 65 (8 issues need help) 6 Updated Mar 20, 2025
  • edgetest Public

    edgetest is a tox-inspired python library that will loop through your project's dependencies, and check if your project is compatible with the latest version of each dependency

    Python 23 Apache-2.0 8 4 (1 issue needs help) 0 Updated Mar 7, 2025
  • Stratum-Observability Public

    A no-dependency library defining a framework for sending analytics and observability events in a standardized format. Stratum-Observability is a plugin-based framework that allows you to create your own custom plugins to define, validate, and publish events to your observability stack.

    TypeScript 22 Apache-2.0 6 2 0 Updated Mar 3, 2025
  • TestProject Public
    0 0 0 0 Updated Feb 26, 2025
  • federated-model-aggregation Public

    The Federated Model Aggregation (FMA) Service is a collection of installable python components that make up the generic workflow/infrastructure needed for federated learning.

    Python 31 Apache-2.0 12 17 (1 issue needs help) 1 Updated Jan 14, 2025
  • global-attribution-mapping Public

    GAM (Global Attribution Mapping) explains the landscape of neural network predictions across subpopulations

    Python 33 Apache-2.0 25 10 1 Updated Jan 13, 2025
  • ablation Public

    Evaluating XAI methods through ablation studies.

    Python 15 Apache-2.0 7 1 0 Updated Dec 28, 2024

Top languages

Loading…

Most used topics

Loading…