Introduce LoraConfig by lucylq · Pull Request #17458 · pytorch/executorch

lucylq · 2026-02-13T20:25:15Z

Pull Request resolved: #17229

Introduce LoraConfig to hold lora parameters such as:

checkpoint
rank
target_modules (e.g. q_proj, k_proj, v_proj, up_proj, down_proj, gate_proj, o_proj)
lora_alpha

LoraConfig validation done post-init. LoraConfig can be created with config.json file.

Update cases of export_llama_lib to use LoraConfig instead of adapter_checkpoint and adapter_config.

NOTE: we may need to extend this to support more customizable features like lora config per layer etc. cc @hakanb

ghstack-source-id: 340930093
@exported-using-ghexport

Differential Revision: D92304723

@hakanb

Pull Request resolved: #17229 Introduce LoraConfig to hold lora parameters such as: - checkpoint - rank - target_modules (e.g. q_proj, k_proj, v_proj, up_proj, down_proj, gate_proj, o_proj) - lora_alpha LoraConfig validation done post-init. LoraConfig can be created with config.json file. Update cases of export_llama_lib to use LoraConfig instead of adapter_checkpoint and adapter_config. NOTE: we may need to extend this to support more customizable features like lora config per layer etc. cc @hakanb ghstack-source-id: 340930093 @exported-using-ghexport Differential Revision: [D92304723](https://our.internmc.facebook.com/intern/diff/D92304723/)

pytorch-bot · 2026-02-13T20:25:19Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17458

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 1 New Failure, 184 Pending

As of commit b25bd71 with merge base c12dc35 ():

NEW FAILURE - The following job has failed:

trunk / test-static-llama-qnn-eval-linux (baseline, 62) / linux-job (gh)
RuntimeError: Command docker exec -t ad2e0894cd1f6e451ab2a544590aeff44e239ba1f25b907baaa1a05384dc0f29 /exec failed with exit code 1

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-02-13T20:26:13Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Copilot

Pull request overview

This PR introduces a LoraConfig dataclass to consolidate LoRA adapter configuration parameters (checkpoint path, rank, alpha, and target modules) that were previously managed as separate fields (adapter_checkpoint and adapter_config) in BaseConfig. The refactoring improves organization by encapsulating related LoRA parameters into a single configuration object.

Changes:

Introduced LoraConfig dataclass with support for both JSON file-based and explicit parameter configuration
Refactored BaseConfig to use LoraConfig instead of separate adapter_checkpoint and adapter_config fields
Updated model loading logic to use LoraConfig for adapter parameter resolution
Updated YAML configurations and test scripts to use the new LoraConfig structure

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 6 comments.

File	Description
extension/llm/export/config/llm_config.py	Introduces LoraConfig dataclass; updates BaseConfig to use lora_config field; updates from_args to construct LoraConfig from CLI args
examples/models/qwen3/config/qwen3_xnnpack_lora.yaml	Adds lora_config nested structure with adapter_checkpoint and adapter_config fields using OmegaConf environment variable interpolation
examples/models/llama/model.py	Updates model initialization to use lora_config instead of separate adapter fields; adds logic to parse adapter_config JSON when needed; removes redundant validation code
.ci/scripts/test_lora.sh	Updates test script to use environment variables for LoRA config paths; switches to qwen3_xnnpack_lora.yaml config; adds override for lora_config=null in quantization test

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-13T20:30:19Z

+
+

The LoraConfig dataclass lacks validation to ensure that either adapter_config is provided OR all explicit values (lora_rank, lora_alpha, target_modules) are provided. According to the documentation, these are two mutually exclusive ways to configure LoRA, but there's no post_init method to validate this constraint. This could lead to invalid configurations where adapter_checkpoint is provided but neither adapter_config nor the explicit parameters are set, resulting in runtime errors when the config is used in model.py (lines 72-77).

Suggested change

def __post_init__(self) -> None:

"""

Validate that LoRA configuration is provided in exactly one of the two

supported ways:

1. Via adapter_config JSON (adapter_config is not None), in which case

explicit LoRA parameters must not be set.

2. Via explicit values (adapter_config is None), in which case all of

lora_rank, lora_alpha, and target_modules must be provided.

"""

has_adapter_config = self.adapter_config is not None

has_explicit_params = (

self.lora_rank != 0

or self.lora_alpha != 0

or bool(self.target_modules)

)

# Enforce mutual exclusivity between adapter_config and explicit params.

if has_adapter_config and has_explicit_params:

raise ValueError(

"LoraConfig must be configured either with 'adapter_config' or "

"with explicit parameters ('lora_rank', 'lora_alpha', "

"'target_modules'), but not both."

)

# If no adapter_config is provided, require all explicit parameters.

if not has_adapter_config:

if self.lora_rank <= 0 or self.lora_alpha <= 0 or not self.target_modules:

raise ValueError(

"Invalid LoraConfig: when 'adapter_config' is not provided, "

"'lora_rank' and 'lora_alpha' must be positive and "

"'target_modules' must be a non-empty list."

)

Copilot · 2026-02-13T20:30:19Z

+            if lora_config.adapter_config and lora_config.lora_rank == 0:
+                with open(lora_config.adapter_config, "r") as f:
+                    cfg = json.load(f)
+                lora_config.lora_rank = cfg["r"]
+                lora_config.lora_alpha = cfg["lora_alpha"]
+                lora_config.target_modules = cfg["target_modules"]


Directly mutating dataclass fields can lead to unexpected behavior, especially when using OmegaConf which creates structured configs. Instead of mutating lora_config in-place, the config values should be validated and populated during initialization or in a post_init hook. This mutation breaks the principle that configs should be immutable after creation and can cause issues if the same config object is reused or if OmegaConf features like interpolation or merging are used.

Copilot · 2026-02-13T20:30:20Z

+        Note: user is responsible for parsing the config and
+        ensure it doesn't conflict with any explicit values.


The documentation note on lines 76-77 states "user is responsible for parsing the config and ensure it doesn't conflict with any explicit values", but this is misleading. Looking at the implementation in model.py (lines 72-77), the config JSON is only parsed when lora_rank is 0, and the values are directly assigned without any conflict checking. If a user provides both adapter_config and explicit values, the explicit values will be silently overwritten if lora_rank is 0. This behavior should either be prevented through validation, or the documentation should be updated to accurately reflect what happens.

Suggested change

Note: user is responsible for parsing the config and

ensure it doesn't conflict with any explicit values.

Note: when adapter_config is provided and lora_rank is left at its

default value (0), values loaded from the JSON (such as lora_rank,

lora_alpha, and target_modules) will overwrite any explicit values

passed to this dataclass; no conflict checking is performed.

Copilot · 2026-02-13T20:30:20Z

+            # Resolve LoRA params from adapter_config JSON if not already set.
+            if lora_config.adapter_config and lora_config.lora_rank == 0:
+                with open(lora_config.adapter_config, "r") as f:
+                    cfg = json.load(f)
+                lora_config.lora_rank = cfg["r"]
+                lora_config.lora_alpha = cfg["lora_alpha"]
+                lora_config.target_modules = cfg["target_modules"]


The condition 'lora_config.lora_rank == 0' is used to decide whether to parse adapter_config JSON. However, 0 is the default value for lora_rank, which creates an ambiguous situation: it's unclear whether lora_rank is 0 because it wasn't set (and should be loaded from JSON) or because the user explicitly wants a rank of 0. A rank of 0 would be invalid for LoRA (since it means no low-rank adaptation). Consider using None as the default value for lora_rank instead of 0, or checking if adapter_config is provided and any of the LoRA parameters are still at their defaults.

Suggested change

# Resolve LoRA params from adapter_config JSON if not already set.

if lora_config.adapter_config and lora_config.lora_rank == 0:

with open(lora_config.adapter_config, "r") as f:

cfg = json.load(f)

lora_config.lora_rank = cfg["r"]

lora_config.lora_alpha = cfg["lora_alpha"]

lora_config.target_modules = cfg["target_modules"]

# Resolve LoRA params from adapter_config JSON if provided, without

# relying on a specific sentinel value for lora_rank.

if lora_config.adapter_config:

with open(lora_config.adapter_config, "r") as f:

cfg = json.load(f)

if not lora_config.lora_rank:

lora_config.lora_rank = cfg["r"]

if not lora_config.lora_alpha:

lora_config.lora_alpha = cfg["lora_alpha"]

if not lora_config.target_modules:

lora_config.target_modules = cfg["target_modules"]

Copilot · 2026-02-13T20:30:20Z

+                    cfg = json.load(f)
+                lora_config.lora_rank = cfg["r"]
+                lora_config.lora_alpha = cfg["lora_alpha"]
+                lora_config.target_modules = cfg["target_modules"]


Missing error handling when loading adapter_config JSON. If the JSON file is malformed or missing required keys like 'r', 'lora_alpha', or 'target_modules', this will raise a KeyError without a clear error message. Consider wrapping this in a try-except block with a more informative error message, or add validation after loading to check that all required fields are present.

Suggested change

cfg = json.load(f)

lora_config.lora_rank = cfg["r"]

lora_config.lora_alpha = cfg["lora_alpha"]

lora_config.target_modules = cfg["target_modules"]

try:

cfg = json.load(f)

except json.JSONDecodeError as e:

raise ValueError(

f"Failed to parse LoRA adapter config JSON file "

f"'{lora_config.adapter_config}': {e}"

) from e

try:

lora_config.lora_rank = cfg["r"]

lora_config.lora_alpha = cfg["lora_alpha"]

lora_config.target_modules = cfg["target_modules"]

except KeyError as e:

missing_key = e.args[0] if e.args else "unknown"

raise ValueError(

"Missing required key "

f"'{missing_key}' in LoRA adapter config JSON file "

f"'{lora_config.adapter_config}'. Expected keys: "

"'r', 'lora_alpha', 'target_modules'."

) from e

Copilot · 2026-02-13T20:30:21Z

+                lora_config.lora_alpha = cfg["lora_alpha"]
+                lora_config.target_modules = cfg["target_modules"]
+
+            adapter_checkpoint_path = lora_config.adapter_checkpoint


Missing validation for adapter_checkpoint path. When lora_config is provided, adapter_checkpoint_path could be invalid (file doesn't exist, path is empty, etc.), but there's no explicit check before attempting to load it. The code will only fail with an unclear error when torch.load or the safetensors loader is called. Consider adding early validation to check that the file exists and is readable, with a clear error message.

@hakanb

Pull Request resolved: pytorch#17229 Introduce LoraConfig to hold lora parameters such as: - checkpoint - rank - target_modules (e.g. q_proj, k_proj, v_proj, up_proj, down_proj, gate_proj, o_proj) - lora_alpha LoraConfig validation done post-init. LoraConfig can be created with config.json file. Update cases of export_llama_lib to use LoraConfig instead of adapter_checkpoint and adapter_config. NOTE: we may need to extend this to support more customizable features like lora config per layer etc. cc @hakanb ghstack-source-id: 340930093 @exported-using-ghexport Differential Revision: [D92304723](https://our.internmc.facebook.com/intern/diff/D92304723/) Co-authored-by: Github Executorch <github_executorch@arm.com>

lucylq requested a review from larryliu0820 as a code owner February 13, 2026 20:25

Copilot AI review requested due to automatic review settings February 13, 2026 20:25

lucylq requested a review from mergennachin as a code owner February 13, 2026 20:25

meta-cla Bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Feb 13, 2026

Copilot started reviewing on behalf of lucylq February 13, 2026 20:25 View session

Copilot AI reviewed Feb 13, 2026

View reviewed changes

lucylq requested review from kirklandsign and metascroy February 13, 2026 20:34

metascroy approved these changes Feb 13, 2026

View reviewed changes

lucylq merged commit 4f799f1 into main Feb 13, 2026
326 of 330 checks passed

lucylq deleted the lfq.lora-config branch February 13, 2026 20:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce LoraConfig#17458

Introduce LoraConfig#17458
lucylq merged 1 commit intomainfrom
lfq.lora-config

lucylq commented Feb 13, 2026 •

edited

Loading

Uh oh!

pytorch-bot Bot commented Feb 13, 2026 •

edited

Loading

Uh oh!

github-actions Bot commented Feb 13, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 13, 2026

Uh oh!

Copilot AI Feb 13, 2026

Uh oh!

Copilot AI Feb 13, 2026

Uh oh!

Copilot AI Feb 13, 2026

Uh oh!

Copilot AI Feb 13, 2026

Uh oh!

Copilot AI Feb 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

+    def __post_init__(self) -> None:
+        """
+        Validate that LoRA configuration is provided in exactly one of the two
+        supported ways:
+. Via adapter_config JSON (adapter_config is not None), in which case
+           explicit LoRA parameters must not be set.
+. Via explicit values (adapter_config is None), in which case all of
+           lora_rank, lora_alpha, and target_modules must be provided.
+        """
+        has_adapter_config = self.adapter_config is not None
+        has_explicit_params = (
+            self.lora_rank != 0
+            or self.lora_alpha != 0
+            or bool(self.target_modules)
+        )
+        # Enforce mutual exclusivity between adapter_config and explicit params.
+        if has_adapter_config and has_explicit_params:
+            raise ValueError(
+                "LoraConfig must be configured either with 'adapter_config' or "
+                "with explicit parameters ('lora_rank', 'lora_alpha', "
+                "'target_modules'), but not both."
+            )
+        # If no adapter_config is provided, require all explicit parameters.
+        if not has_adapter_config:
+            if self.lora_rank <= 0 or self.lora_alpha <= 0 or not self.target_modules:
+                raise ValueError(
+                    "Invalid LoraConfig: when 'adapter_config' is not provided, "
+                    "'lora_rank' and 'lora_alpha' must be positive and "
+                    "'target_modules' must be a non-empty list."
+                )

		Note: user is responsible for parsing the config and
		ensure it doesn't conflict with any explicit values.

-        Note: user is responsible for parsing the config and
-        ensure it doesn't conflict with any explicit values.
+        Note: when adapter_config is provided and lora_rank is left at its
+        default value (0), values loaded from the JSON (such as lora_rank,
+        lora_alpha, and target_modules) will overwrite any explicit values
+        passed to this dataclass; no conflict checking is performed.

-                    cfg = json.load(f)
-                lora_config.lora_rank = cfg["r"]
-                lora_config.lora_alpha = cfg["lora_alpha"]
-                lora_config.target_modules = cfg["target_modules"]
+                    try:
+                        cfg = json.load(f)
+                    except json.JSONDecodeError as e:
+                        raise ValueError(
+                            f"Failed to parse LoRA adapter config JSON file "
+                            f"'{lora_config.adapter_config}': {e}"
+                        ) from e
+                try:
+                    lora_config.lora_rank = cfg["r"]
+                    lora_config.lora_alpha = cfg["lora_alpha"]
+                    lora_config.target_modules = cfg["target_modules"]
+                except KeyError as e:
+                    missing_key = e.args[0] if e.args else "unknown"
+                    raise ValueError(
+                        "Missing required key "
+                        f"'{missing_key}' in LoRA adapter config JSON file "
+                        f"'{lora_config.adapter_config}'. Expected keys: "
+                        "'r', 'lora_alpha', 'target_modules'."
+                    ) from e

Conversation

lucylq commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot Bot commented Feb 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/17458

❌ 1 New Failure, 184 Pending

Uh oh!

github-actions Bot commented Feb 13, 2026

This PR needs a release notes: label

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lucylq commented Feb 13, 2026 •

edited

Loading

pytorch-bot Bot commented Feb 13, 2026 •

edited

Loading

This PR needs a `release notes:` label