Skip to content
View asaadaldien's full-sized avatar

Organizations

@halide @llvm

Block or report asaadaldien

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
asaadaldien/README.md

Hey there, It's Ahmed Taei! 👋

I'm a software engineer and applied mathematician, the blend of both disciplines defines my work. I develop systems, algorithms, compilers, and languages for AI and numerical computing generally, so my work usually resides at the intersection of all of that.

About Me

  • Joined NVIDIA (2025–present)
  • 2023–2024 @ Modular: Mojo 🔥 on GPUs at Modular. Part of this work was presented as an LLVM talk: Watch here.
  • 📚 In my previous endeavors, I developed distributed ML training systems / algorithms, DSLs for ML kernels on custom silicon, built compilers and runtime stack from the ground up for ML accelerators. Part of this work involved contributions to open-source projects like OpenXLA/IREE Compiler, PyTorch, TensorFlow, and Caffe2.

Connect with Me

LinkedIn
Twitter

Pinned Loading

  1. iree-org/iree iree-org/iree Public

    A retargetable MLIR-based machine learning compiler and runtime toolkit.

    C++ 3k 659

  2. llvm/llvm-project llvm/llvm-project Public

    The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

    LLVM 31k 12.7k

  3. llvm/torch-mlir llvm/torch-mlir Public

    The Torch-MLIR project aims to provide first class support from the PyTorch ecosystem to the MLIR ecosystem.

    C++ 1.4k 532

  4. halide/Halide halide/Halide Public

    a language for fast, portable data-parallel computation

    C++ 6k 1.1k

  5. pytorch/pytorch pytorch/pytorch Public

    Tensors and Dynamic neural networks in Python with strong GPU acceleration

    Python 87k 23.4k

  6. pytorch/glow pytorch/glow Public

    Compiler for Neural Network hardware accelerators

    C++ 3.3k 699