Skip to content

AISDC/DNNTrainerFlow

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 

Repository files navigation

version PyTorch PyTorch license

DNNTrainerFlow

A demonstration of an automatic workflow for rapid DNN training using remote AI system resource.

The sample DNN used in the demo can be obtained from the ddp branch of BraggNN

The big picture

The Big Picture

The implementation

title

Requirements

AI system side

  • funcx_endpoint=0.3.2
  • PyTorch=1.9.0
  • horovod=0.22.1
  • h5py=2.10.0
  • numpy=1.19.2

Client/User side

  • globus-automate-client=0.12.0
  • funcx=0.3.2

Contacts and Docs

more detail can be found from https://arxiv.org/abs/2105.13967

please reach to zhengchun.liu#@#anl.gov if you run into problems.

About

a demonstration of an automatic workflow for rapid DNN training using remote resource

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published