Skip to content

Commit

Permalink
Add GPTQ quantization to executorch (not tested) (#2274)
Browse files Browse the repository at this point in the history
Summary:
Pull Request resolved: #2274

will test in OSS and fix issues in a separate PR

bypass-github-export-checks
bypass-github-pytorch-ci-checks
bypass-github-executorch-ci-checks

Reviewed By: digantdesai

Differential Revision: D54569842

fbshipit-source-id: 7c3e60833d8e144cb5842842b4f640138dfa9382
  • Loading branch information
jerryzh168 authored and facebook-github-bot committed Mar 8, 2024
1 parent daaecb1 commit 0b6add8
Show file tree
Hide file tree
Showing 5 changed files with 887 additions and 30 deletions.
2 changes: 0 additions & 2 deletions backends/vulkan/serialization/vulkan_graph_serialize.py
Original file line number Diff line number Diff line change
Expand Up @@ -12,7 +12,6 @@
from dataclasses import dataclass
from typing import ClassVar, List

# pyre-ignore[21]: Could not find module `executorch.exir._serialize._bindings`.
import pkg_resources
import torch

Expand All @@ -35,7 +34,6 @@ def convert_to_flatbuffer(vk_graph: VkGraph) -> bytes:
json_path = os.path.join(d, "schema.json")
with open(json_path, "wb") as json_file:
json_file.write(vk_graph_json.encode("ascii"))
# pyre-ignore
_flatc_compile(d, schema_path, json_path)
output_path = os.path.join(d, "schema.bin")
with open(output_path, "rb") as output_file:
Expand Down
Loading

0 comments on commit 0b6add8

Please sign in to comment.