Skip to content
@ydataai

YData

Accelerating AI with improved data

banner_ydata

YData.ai Medium LinkedIn Twitter Youtube Data-Centric AI Discord YData Profiling YData Synthetic YData Academy

Welcome to YData

Our mission is to help data science teams access and understand their data assets, and produce quality data to sucessfully deploy machine learning models.

We're the creators of YData Fabric, the first data-centric platform for data quality. We're also strong advocates of open source software and we're actively developing ydata-profiling, ydata-synthetic, and ydata-quality, three open source projects focused on producing high-quality data for machine learning applications.

You can stay up to date with the latest developments on our News or follow our Medium blog for hands-on tutorials on our open source packages.

We have a growing community of data scientists on our Discord Server, where we discuss emergent topics on Data Profiling, Data Labeling, and Synthetic Data. Join us to share feedback and discuss feature requests!

You can also find all about our montly events and data initiatives on our newsletter or reach us at developers@ydata.ai.

footer_ydata

Pinned Loading

  1. ydata-profiling Public

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

    Python 12.8k 1.7k

  2. ydata-fabric-sdk Public

    Fabric SDK to interact with the Fabric platform

    Python 19 7

  3. ydata-synthetic Public

    Synthetic data generators for tabular and time-series data

    Jupyter Notebook 1.5k 249

  4. academy Public

    Tutorials for YData's Fabric platform

    Jupyter Notebook 31 7

  5. ydata-talkdatatome Public

    Make your dataset talk to you. The AI assistant for data preparation.

    Python 9 1

  6. sd-metrics Public

    A repository that collects different metrics evaluate the quality of synthetic data under the scope data democratization. The metrics evaluate the quality of the synthetic data under the following …

    2

Repositories

Showing 10 of 71 repositories
  • ydata-synthetic Public

    Synthetic data generators for tabular and time-series data

    Jupyter Notebook 1,511 MIT 249 44 8 Updated Mar 10, 2025
  • ydata-profiling Public

    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.

    Python 12,778 MIT 1,697 243 (39 issues need help) 21 Updated Mar 10, 2025
  • go-core Public

    Core and shared code for our go projects

    Go 4 MIT 0 1 1 Updated Mar 10, 2025
  • ydata-fabric-sdk Public

    Fabric SDK to interact with the Fabric platform

    Python 19 MIT 7 1 11 Updated Mar 9, 2025
  • aws-adapter Public

    AWS Adapter

    Go 0 0 1 4 Updated Mar 10, 2025
  • python-core Public

    Core functionality for all python packages at YData

    Python 0 MIT 1 1 8 Updated Mar 9, 2025
  • azure-adapter Public

    Azure Adapter

    Go 0 0 1 5 Updated Mar 7, 2025
  • authentication-service Public

    Handles authentication using OIDC flow

    Go 2 MIT 0 1 9 Updated Mar 7, 2025
  • aws-asg-tags-lambda Public

    A lambda that extracts the auto scaling groups from the k8s node pools provided by the user and adds the specified tags to those nodes

    Swift 5 MIT 0 1 7 Updated Mar 3, 2025
  • JavaScript 2 MIT 0 1 7 Updated Mar 3, 2025