Skip to content

Latest commit

 

History

History
106 lines (73 loc) · 3.54 KB

index.rst

File metadata and controls

106 lines (73 loc) · 3.54 KB

Welcome to AWS Neuron

AWS Neuron is the SDK used to run deep learning workloads on AWS Inferentia and AWS Trainium based instances. It supports customers in their end-to-end ML development lifecycle to build new models, train and optimize these models, and then deploy them for production. To learn about the model architectures currently supported on Inf1, Inf2, Trn1 and Trn1n instances, please see :ref:`model_samples_tutorials`. To learn about upcoming capabilities, please view the :ref:`neuron_roadmap`.

AWS Neuron includes a deep learning compiler, runtime, and tools that are natively integrated into TensorFlow, PyTorch and Apache MXNet. The EC2 Trn1/Trn1n instances are optimized for the highest performance and best price-performance training in AWS. The EC2 Inf2 instances are designed for high-performance deep learning inference applications. With Neuron, customers can quickly start using Inf/Trn instances through services like Amazon Sagemaker, Amazon Elastic Container Service (ECS), Amazon Elastic Kubernetes Service (EKS), AWS Batch, and AWS Parallel Cluster.

Check :ref:`announcements-main` and check :ref:`latest-neuron-release` for latest release.

.. grid:: 2


   .. card:: Get Started with Neuron
         :link: neuron-quickstart
         :link-type: ref


   .. card:: Neuron Quick Links
         :link: docs-quick-links
         :link-type: ref


.. toctree::
   :maxdepth: 1
   :caption: Overview

   Quick Links </general/quick-start/docs-quicklinks>
   Get Started with Neuron </general/quick-start/index>
   Samples and Tutorials </general/models/index>
   Performance </general/benchmarks/index>
   What’s New </release-notes/index>
   Announcements </general/announcements/index>

.. toctree::
   :maxdepth: 1
   :caption: ML Frameworks

   PyTorch Neuron </frameworks/torch/index>
   JAX Neuron </frameworks/jax/index>
   TensorFlow Neuron </frameworks/tensorflow/index>
   Apache MXNet </frameworks/mxnet-neuron/index>

.. toctree::
   :maxdepth: 1
   :caption: NeuronX Distributed (NxD)

   NxD Training (Beta) </libraries/nxd-training/index>
   NxD Core </libraries/neuronx-distributed/index>


.. toctree::
   :maxdepth: 1
   :caption: Additional Libraries

   Transformers Neuron </libraries/transformers-neuronx/index>
   AWS Neuron Reference for NeMo Megatron </libraries/nemo-megatron/index>

.. toctree::
   :maxdepth: 1
   :caption: Developer Flows

   Neuron DLAMI </dlami/index>
   Neuron Containers </containers/index>
   Workload Orchestration </general/devflows/index>
   Setup Guide </general/setup/index>

.. toctree::
   :maxdepth: 1
   :caption: Runtime & Tools

   Neuron Runtime </neuron-runtime/index>
   Neuron Tools </tools/index>
   Neuron Calculator </general/calculator/neuron-calculator>

.. toctree::
   :maxdepth: 1
   :caption: Compiler

   Neuron Compiler </compiler/index>
   Neuron Kernel Interface (Beta) <general/nki/index>
   Neuron C++ Custom Operators </neuron-customops/index>

.. toctree::
   :maxdepth: 1
   :caption: Learning Neuron

   Architecture </general/arch/index>
   Features </general/arch/neuron-features/index>
   Application Notes </general/appnotes/index>
   FAQ </general/faq>
   Troubleshooting </general/troubleshooting>


.. toctree::
   :maxdepth: 1
   :caption: About Neuron

   /release-notes/release
   Roadmap </general/roadmap-readme>
   Support </general/support>