Skip to content

Latest commit

 

History

History
49 lines (35 loc) · 1.41 KB

ROADMAP.md

File metadata and controls

49 lines (35 loc) · 1.41 KB

HybridBackend Roadmap

HybridBackend v0.6 (2022-05)

Objective: "Communication-efficient training and evaluation at scale"

  • Data-Parallel Training and Evaluation

    • Bucketized Gradients Aggregation using AllReduce
    • Global Metric Operations
    • Out-Of-Range Coordination
  • Hybrid-Parallel Embedding Learning

    • Bucketized Embedding Exchanging using AllToAllv
    • Fusion and Quantization of AllToAllv
    • Fusion of Partitioning and Stitching

Objective: "Easy to use with existing AI workflows"

  • Usability

    • Support of MonitoredSession and Estimator
    • Declarative API for Model Definition
  • Compatibility

    • Support of NVIDIA TensorFlow and DeepRec
  • Interoperability

    • Inference Pipeline Needs No Change
    • Support of SavedModel
    • Support of Variable, XDL HashTable and PAI Embedding Variable

HybridBackend v0.5 (2021-11)

Objective: "Memory-efficient loading of categorical data"

  • Parquet Dataset

    • Reading batch of tensors from numeric fields in zero-copy way
    • Reading batch of sparse tensors from numeric list fields in zero-copy way
    • Support of string fields
    • Support of local filesystem, HDFS, S3 and OSS
  • Data Pipeline Functions

    • Resizing batch of tensors and ragged tensors
    • Converting ragged tensors to sparse tensors

Objective: "Easy to use with existing AI workflows"

  • Compatibility
    • Support of TensorFlow 1.15 and Tensorflow 1.14
    • GitHub actions for uploading wheels to PyPI