Skip to content

Latest commit

 

History

History
90 lines (65 loc) · 9.72 KB

File metadata and controls

90 lines (65 loc) · 9.72 KB

The ASPLOS 2025 / EuroSys 2025 Contest on an Optimized Neuron Kernel Interface (NKI) Implementation of Llama 3.2 1B (Inference)

Teams from around the globe are invited to contribute submissions toward producing the fastest implementation of the Llama 3.2 1B model (inference only), written using the Neuron Kernel Interface (NKI) programming interface and running on Amazon ML hardware (Trainium/Inferentia). Prizes will be awarded to top-ranking teams who commit to open-sourcing their solutions prior to next year's conference.

Rank Prize Team Links
First Place 🥇 $25,000 Shiwei Gao + Ruwen Fan + Shaoxun Zeng + Haodi Jiang + Huajun Bai + Yitian Yang + Hao Guo + Qing Wang + Jiwu Shu + Youyou Lu (Tsinghua University) [code]
Second Place 🥈 $10,000 Dong Li + Dinghong Song + Jierui Xu (University of California Merced / University of Wisconsin Madison)
Third Place 🥉 $5,000 Dongkwan Kim + Chan Lee + Seonyoung Cheon + Kunmo Jeong + Hoyun Youm + Sungwoo Yun + Yongwoo Lee (Yonsei University)

Photos from the Award Ceremony

Important Dates

Date Event
2024-12-01 Contest Announced
2025-01-18 Contest GitHub Repository & Benchmark Subset Released
2025-02-03 Application Deadline for Student Travel Grants
2025-02-22 Contest Registrations & Preliminary Submissions Due* (deadline extended!)
2025-03-03 Early Registration Deadline for ASPLOS 2025 / EuroSys 2025
2025-03-10 Contest Final Submissions Due* (EXTENDED)
2025-03-15 Special Session Schedule Finalized
2025-03-30 Contest Special Session during ASPLOS 2025 / EuroSys 2025 Workshops
2025-04-01 Contest Winners Announced during ASPLOS 2025 / EuroSys 2025 Conference

*Submissions are due by 11:59pm at any time on Earth.

Problem Description

Amazon Web Services has two family of machine learning chips, called Trainium and Inferentia. AWS Neuron SDK is an SDK with a compiler and profiling tools for programming these devices using high-level libraries like PyTorch. AWS recently released a new programming interface called Neuron Kernel Interface (NKI) that gives programmers down-to-the-metal access to Trainium/Inferentia hardware features, potentially unlocking even greater performance opportunities.

For this contest, teams will submit code that leverages NKI to implement the Llama3.2 1B model, targeting a single Trainium1 (trn1) chip.

Contest GitHub Repository and Benchmark Subset

Please consult the following GitHub repository for full contest details: https://github.com/aws-samples/nki-llama.

Contest Organizers

  • Emery Berger (Amazon Web Services), emerydb@amazon.com
  • Aninda Manocha (Amazon Web Services)
  • Wei Tang (Amazon Web Services)
  • Emily Webber (Amazon Web Services)
  • Ziyang Xu (Amazon Web Services)

Contest Sponsor

Cloud Computing Services - Amazon Web Services (AWS)

Amazon Web Services