pytorch / FBGEMM Public

Notifications You must be signed in to change notification settings
Fork 432
Star 1.1k

Code
Issues 25
Pull requests 290
Discussions
Actions
Projects
Wiki
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Wiki
Security
Insights

Pull requests: pytorch/FBGEMM

Labels 19 Milestones 0

New pull request New

290 Open 2,267 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Reviews

Filter by reviews

No reviews Review required Approved review Changes requested

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Pull requests list

MX4 check smem and fixes cla signed fb-exported

#2703 opened Jun 8, 2024 by spcyppt

Loading…

[fbgemm_gpu] Adjust install and test steps based on installed variant cla signed

#2702 opened Jun 7, 2024 by q10

Loading…

Revert D57738223: Multisect successfully blamed "D57738223: [fp8 kv cache] wmma_gqa_attn_splitk" for one test failure cla signed fb-exported

#2695 opened Jun 6, 2024 by Aya-ZIbra

Loading…

[comm][ROCm] move memory copy into one_shot_all_reduce cla signed module: rocm

#2693 opened Jun 6, 2024 by wenkaidu

Loading…

Clean up commented code cla signed fb-exported

#2692 opened Jun 6, 2024 by amylittleyang

Loading…

Faster Dequantize cla signed fb-exported

#2683 opened Jun 5, 2024 by Aya-ZIbra

Loading…

Support MTIA in DenseTableBatchedEmbeddingBagsCodegen cla signed fb-exported

#2680 opened Jun 4, 2024 by gnahzg

Loading…

Support stochastic rounding for FP32/BF16 -> FP8 conversion cla signed fb-exported

#2677 opened Jun 4, 2024 by jianyuh

Loading…

Re-enable CAR in OSS build cla signed fb-exported

#2660 opened Jun 1, 2024 by jianyuh

Loading…

wip cla signed

#2656 opened May 31, 2024 by q10

Loading…

Improve Int4 Quantization and Fix Groupwise Scaling cla signed fb-exported

#2655 opened May 31, 2024 by jwfromm

Loading…

Fix ads publish error cla signed fb-exported

#2649 opened May 30, 2024 by snabelkabiya

Loading…

Add -lrt to asmjit bazel build cla signed

#2634 opened May 25, 2024 by cyyever

Loading…

[ROCm] enable experimental gen_ai build cla signed module: rocm

#2610 opened May 20, 2024 by jeffdaily

Loading…

FP32 Autovec Final Optimization cla signed

#2586 opened May 13, 2024 by crystalrchen

Loading…

all_to_one cuda support non-2d inputs cla signed fb-exported

#2575 opened May 9, 2024 by IvanKobzarev

Loading…

add max norm support to PARTIAL_ROWWISE_ADAM cla signed fb-exported

#2567 opened May 7, 2024 by zainhuda

Loading…

Pyre Configurationless migration for] [batch:9/28] cla signed fb-exported

#2557 opened May 3, 2024 by connernilsen

Loading…

Fp8 updated cla signed

#2550 opened Apr 30, 2024 by elopez0409

Loading…

Change the caller cla signed

#2549 opened Apr 30, 2024 by jianyuh

Loading…

Pyre Configurationless migration for] [batch:6/29] cla signed

#2548 opened Apr 29, 2024 by connernilsen

Loading…

Integrate triton row and blockwise fp8 gemm to llm inference. cla signed fb-exported

#2547 opened Apr 29, 2024 by choutim

Loading…

Add fp8 row/block-wise scaled GEMMs cla signed

#2546 opened Apr 29, 2024 by choutim

Loading…

Revert D56685840: Multisect successfully blamed "D56685840: [fbgemm] Change model transform fp8 linear op to fbgemm quantize ops" for one test failure cla signed

#2545 opened Apr 29, 2024 by jianyuh

Loading…

Refactor fbgemm / llama csrc code base cla signed

#2544 opened Apr 29, 2024 by jianyuh

Loading…

Previous 1 2 3 4 5 … 11 12 Next

Previous Next

ProTip! no:milestone will show everything without a milestone.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly