Skip to content
View mosharaf's full-sized avatar

Highlights

  • Pro

Organizations

@mesos @eecs489 @SymbioticLab

Block or report mosharaf

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results
Python 64 5 Updated Mar 30, 2025

Advanced Scalable Systems for X

31 1 Updated Dec 3, 2024

A time-cost tradeoff problem solver

Python 10 Updated Jan 8, 2025

Measure and optimize the energy consumption of your AI applications!

Python 243 33 Updated Mar 28, 2025

A resilient distributed training framework

Python 92 5 Updated Apr 11, 2024

Large Language Model (LLM) Systems Paper List

862 34 Updated Mar 28, 2025

How much energy do GenAI models consume?

Python 42 4 Updated Oct 16, 2024

A Cluster-Wide Model Manager to Accelerate DNN Training via Automated Training Warmup

Python 34 5 Updated Jan 9, 2023

FedScale is a scalable and extensible open-source federated learning (FL) platform.

Python 396 119 Updated Dec 18, 2023

Aequitas enables RPC-level QoS in datacenter networks.

C++ 16 2 Updated Jul 19, 2022

Federated Learning Systems Paper List

70 6 Updated Feb 7, 2024

Hydra adds resilience and high availability to remote memory solutions.

C 30 4 Updated Feb 22, 2022

Justitia provides RDMA isolation between applications with diverse requirements.

C 39 8 Updated May 25, 2022

Oort: Efficient Federated Learning via Guided Participant Selection

Python 126 26 Updated Oct 27, 2021

A Generic Resource-Aware Hyperparameter Tuning Execution Engine

Python 15 3 Updated Jan 8, 2022

Prefetching and efficient data path for memory disaggregation

C 67 23 Updated Jul 16, 2020

A Federated Execution Engine for Fast Distributed Computation Over Slow Networks

Scala 26 7 Updated Apr 26, 2021

Fine-grained GPU sharing primitives

Jupyter Notebook 141 19 Updated Mar 13, 2020

Tiresias is a GPU cluster manager for distributed deep learning training.

Python 152 50 Updated May 7, 2020

📚 👓 A collection of research papers, codes, tutorials and blogs on Federated Computing/Learning.

472 84 Updated Aug 1, 2023

Infiniswap enables unmodified applications to efficiently use disaggregated memory.

C 245 50 Updated Sep 26, 2020

Varys: Efficient Clairvoyant Coflow Scheduler

Scala 34 26 Updated Aug 6, 2015

Facebook's Realtime Distributed FS based on Apache Hadoop 0.20-append

Java 1 1 Updated Oct 6, 2013

Facebook's Realtime Distributed FS based on Apache Hadoop 0.20-append

Java 877 360 Updated Oct 10, 2014
Showing results