ECMFA 2026

This repository is to provide supplemental materials for the paper 17 ("Leveraging LLMs for Grammar Adaptation: A Study on Metamodel-Grammar Co-Evolution") to the ECMFA 2026 conference. In this paper, we propose a LLM-based approach for metamodel-grammar co-evolution, and compare it with a rule-based methods. Six case languages are divided into two groups, i.e., the training set, and the test set.

1 Directory: training_set

In the training set, the metamodels and grammars of four case languages (BibTeX, EAST-ADL, SML, and Xenia) were used to develop and validate prompting strategies. For each of them, we created a dedicated subdirectory.

It's important to note that the "training" in the training set is not training in the LLM sense—the LLM itself doesn't learn anything from these four DSLs (each DSL is processed in an independent session, with no information exchange between sessions). The term "training set" refers to the researchers using these DSLs to develop and debug prompting policies; it corresponds to researcher-level learning, not model-level learning. This point is also clearly clarified in Section 3.2.

1.1 BibTeX

Iterative development of prompts in BibTeX.

1.2 EAST-ADL

Iterative development of prompts in EAST-ADL.

1.2.1 Claude_Iterative_Development

This subdirectory contains a record of the prompts we used for iterative development on EAST-ADL.

1.2.2 Claude_Test

The finalized prompts we developed in three small-to-medium-sized case languages (BibTeX, SML, and Xenia) were tested on EAST-ADL. This subdirectory contains the test output files (note that this differs from the prompt strategy used in iterative development).

1.2.3 ChatGPT_Test

We also used ChatGPT to perform LLM-based co-evolution tests on EAST-ADL.

1.2.4 Gemini_Test

We also used Gemini to perform LLM-based co-evolution tests on EAST-ADL.

1.3 SML

Iterative development of prompts in SML.

1.4 Xenia

Iterative development of prompts in Xenia.

2 Directory: test_set

In the test set, two DSLs (DOT and Xcore) are used to verify the generalization ability of finalized prompts.

2.1 Dot

The LLM-based co-evolution test on the case language Dot.

2.2 Xcore

The LLM-based co-evolution test on the case language Xcore.

3 Directory: longitudinal_study

We also conducted longitudinal study of the Claude-based co-evolution approach on four versions of QVTo.

4 Experiment Execution Method

All experiments were executed manually through the web-based interfaces of the respective LLMs (Claude Sonnet 4.5, ChatGPT 5.1, and Gemini 3), with no additional automation.

The experiment was executed independently for each DSL. For each DSL, the experiment proceeds in two steps.

In the first step, the generated grammar G1, the manually adapted grammar G1', and Prompt 1 are provided to the LLM for analysis and comparison.
In the second step, the generated grammar G2 and Prompt 2 are provided to the LLM, whose output yields the adapted grammar G2'.

Taking BibTeX as an example:

in the first step, "MyBibTex_generated_grammar.txt" (as G1) and "MyBibTex_target_grammar.txt" (as G1') are provided together with Prompt 1;
in the second step, "MyBibTex_generated_grammar.txt" is provided again (this time as G2) together with Prompt 2. Note that for these six DSLs, G1 and G2 refer to the same generated grammar file; this is inherited from the experimental setup of prior work and is acknowledged as a threat to external validity in the paper (see Section 5.2).

Finalized Prompts:

Prompt 1: The attachment contains two Xtext grammars for the same language: the grammar generated from the metamodel and the target grammar. Please identify the adaptations required to transform the generated grammar into the target grammar.
Prompt 2: Now, I’m sending you the grammar generated from the evolved metamodel. Please adapt it using the adaptations you learned previously and output the adapted grammar to me.

I put the contents of gramamr and instances in .txt files to account for the possibility that readers may not have Xtext installed. Once a .txt file name contains "target", indicating that it is G1', meaning the grammar adapted from G1 (G1 is the grammar generated from the metamodel before the evolution).

5 File format explanation

All grammar content, including grammar content generated by LLMs, is stored in txt files; all analyses generated by LLMs, such as analyses comparing the adaptations that occurred between G1 and G1', are stored in md files; all descriptions organized by humans, such as descriptions of the experimental process, are stored in docx files.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
longitudinal_study		longitudinal_study
test_set		test_set
training_set		training_set
Deeper_look_on_what_Rule-based_Approach_losed.docx		Deeper_look_on_what_Rule-based_Approach_losed.docx
Iterative_Development_Process_of_Prompts.docx		Iterative_Development_Process_of_Prompts.docx
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ECMFA 2026

1 Directory: training_set

1.1 BibTeX

1.2 EAST-ADL

1.2.1 Claude_Iterative_Development

1.2.2 Claude_Test

1.2.3 ChatGPT_Test

1.2.4 Gemini_Test

1.3 SML

1.4 Xenia

2 Directory: test_set

2.1 Dot

2.2 Xcore

3 Directory: longitudinal_study

4 Experiment Execution Method

5 File format explanation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

ECMFA 2026

1 Directory: training_set

1.1 BibTeX

1.2 EAST-ADL

1.2.1 Claude_Iterative_Development

1.2.2 Claude_Test

1.2.3 ChatGPT_Test

1.2.4 Gemini_Test

1.3 SML

1.4 Xenia

2 Directory: test_set

2.1 Dot

2.2 Xcore

3 Directory: longitudinal_study

4 Experiment Execution Method

5 File format explanation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Packages