Skip to content

The publication is a collection of sample code to show how data from SAP and non-SAP systems can be made available for training in ANY hyperscaler machine learning service via several layers of abstraction from data connection to training using our FedML Python libraries.

License

Notifications You must be signed in to change notification settings

AFK-Python/datasphere-fedml

 
 

Repository files navigation

REUSE status

FedML

Description


The SAP Federated ML Python libraries (FedML) applies the Data Federation architecture of SAP Datasphere for intelligently sourcing SAP as well as non-SAP data for Machine Learning experiments done at the Hyperscalers thereby removing the need for replicating or moving data. By abstracting the Data Connection, Data load, and Model training (with flexibility and provision for user provided training scripts), Model Deployment, and Inferencing for Hyperscaler Machine learning processes , the FedML library provides end to end integration with few lines of code .

What's New

Here are some high level major feature additions in Version 2.0 of FedML:

  • FedML is now pip installable from PyPi repo.
  • Support for deployment to hyperscaler environment.
  • Support for deployment to SAP Business Technology Platform Kyma environment.
  • Support for inferencing with hyperscaler deployed as well as Kyma deployed models.
  • Support for writing inferenced results back to SAP Datasphere.

Solution Architecture

ARD

Requirements

  • SAP Datasphere tenant instance, with connectivity established to the remote data sources, and views exposed, that can be consumed by FedML.

  • Access to corresponding Hyperscaler Machine learning environments with approriate configurations. See Configuration section.


Download and Installation

  1. Try out examples from the samples-notebooks directory of corresponding Hyperscaler library

  2. For setting up the remote models in SAP Datasphere to federate data from hyperscaler data stores for use with FedML , here are some Discovery Mission examples :


Configuration

  • For AWS FedML library specific pre-requisites, configuration and documentation, please refer here
  • For GCP FedML library specific pre-requisites, configuration and documentation, please refer here
  • For Azure FedML library specific pre-requisites, configuration and documentation, please refer here
  • For Databricks FedML library specific pre-requisites, configuration and documentation, please refer here

Limitations

None

How to obtain support

This project is provided "as-is" with no expectation for major changes or support.
Create an issue in this repository if you find a bug or have questions about the content.
For additional support, ask a question in SAP Community.

Licensing

Copyright (c) 2021 SAP SE or an SAP affiliate company. All rights reserved. This project is licensed under the Apache Software License, version 2.0 except as noted otherwise in the LICENSE file.

About

The publication is a collection of sample code to show how data from SAP and non-SAP systems can be made available for training in ANY hyperscaler machine learning service via several layers of abstraction from data connection to training using our FedML Python libraries.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 82.1%
  • Python 17.9%