refactor: Full refactor of the Decompose CLI Tool & introduction of prompt_modules #105

tuliocoppola · 2025-08-28T15:23:23Z

This PR was discussed/showed to @nrfulton.

It introduces the concept of prompt_modules under the helpers, which are completely isolated modules, with a common interface, that can be composed into any kind of pipeline.
These modules are the blocks supporting the Decomposition Pipeline implemented under the CLI tools.

It's a lot of code, but all the introduced files are isolated under the new directory @ mellea/helpers/prompt_modules, and none of them interfere with any other part of the code base.

There's also modifications to the cli/decompose.

Usage example for one of the prompt_modules:

from mellea import MelleaSession
from mellea.backends.ollama import OllamaModelBackend

from mellea.helpers.prompt_modules import constraint_extractor

m_session = MelleaSession(
    OllamaModelBackend(
        model_id="mistral-small3.2:24b",
        model_options={
            ModelOption.CONTEXT_WINDOW: 32768,
            "timeout": 3600,
        },
    )
)

task_prompt: str = <task prompt example below this code block>

task_prompt_constraints: list[str] = constraint_extractor.generate(
    m_session, task_prompt
).parse()

print(task_prompt_constraints)

# e.g.
# [
#     "The entire text can't exceed 600 characters",
#     "The summary must be a maximum of 3 paragraphs",
#     "A specific celebrity name must be incorporated into the summary",
#     "The output must be in a JSON format with \"title\" and \"summary\" keys",
#     "The JSON output must not have any Markdown formatting"
# ]

Now an example of the Decomposition CLI Tool usage:

The example task prompt below is a short prompt, this tool can be used (and was meant) to break larger prompts, but it does work sufficiently well with short prompts also.

For this prompt text file:

my-task-prompt.txt

Summarize a news article into a text with a maximum of 3 paragraphs. The entire text can't exceed 600 characters.

You must also incorporate a specific celebrity name into your summary, the name will be provided per task.

I need you to output it in a JSON like this, but without any Markdown formatting, just the JSON (not enclosed in back quotes):
```
{
  "title": <come up with a good title>,
  "summary": <your summary>
}
```

The Decomposition CLI Tool supports task prompts that doesn't need any user input data, but also support the ones that need.

Note that the example task prompt above needs 2 user input variables to execute its task, it does not explicitly dictates the input variables names.

On the CLI command you must pass a descriptive name for each necessary user input variable, in this example we will call our variables NEWS_ARTICLE and CELEBRITY_NAME, but we could've just as well called them INPUT_ARTICLE and CELEBRITY_DATA, as you can see, the idea is to pass descriptive names for the LLM to correctly infer and reference the input data when generating the subtasks templates.

If your task prompt doesn't need user input data, then you can just omit the --input-var options.

All of this information is also contained on the CLI command help string.

Usage example of the new Decomposition Pipeline CLI Tool:

m decompose run \
--out-dir . \
--prompt-file path/to/text/file/my-task-prompt.txt \
--out-name output_files_name \ # Generates a "output_files_name.json" and a "output_files_name.py".
--input-var NEWS_ARTICLE \
--input-var CELEBRITY_NAME

…feature

nrfulton · 2025-08-29T11:43:50Z

@HendrikStrobelt This is Tulio's new prompt decomposition pipeline.

You do not need to review the full PR -- it's a lot of code, and Tulio gave me a high-level overview. But do please chime in on where this code should live. A few options:

mellea.helpers.dprompt_modules <- current
mellea.prompt_modules <- probably not, right?
cli.decompose.prompt_modules <- also reasonable

nrfulton · 2025-08-29T12:24:58Z

The failed test seems like a false negative. @tuliocoppola can you run the python 3.12 tests locally and confirm they pass there?

tuliocoppola · 2025-08-29T14:22:10Z

The failed test seems like a false negative. @tuliocoppola can you run the python 3.12 tests locally and confirm they pass there?

@nrfulton
Does look like a false negative, on my fork all tests passed -> https://github.com/tuliocoppola/mellea/actions/runs/17297553591/job/49099508067

nrfulton · 2025-08-29T14:33:14Z

@tuliocoppola Overall LGTM. Can you move the code to mellea.helpers.prompt_modules.decompose so that folks aren't confused about what prompt_modules means / understand that this is all related to the decomposition pipeline?

Once that's done we can squash and merge.

tuliocoppola · 2025-08-29T20:33:47Z

@nrfulton
Done, moved the modules.

jakelorocco · 2025-09-03T14:37:38Z

@tuliocoppola, this looks good; can you please move mellea/helpers/prompt_modules/ directly into the cli decompose folder? My reasoning is that most of these helpers are specific to decompose and there are examples, etc... in there.

mergify · 2025-09-03T21:11:19Z

Merge Protections

Your pull request matches the following merge protections and will not be merged until they are valid.

🟢 Enforce conventional commit

Wonderful, this rule succeeded.

Make sure that we follow https://www.conventionalcommits.org/en/v1.0.0/

title ~= ^(fix|feat|docs|style|refactor|perf|test|build|ci|chore|revert)(?:\(.+\))?:

nrfulton · 2025-09-04T03:12:50Z

@tuliocoppola, this looks good; can you please move mellea/helpers/prompt_modules/ directly into the cli decompose folder? My reasoning is that most of these helpers are specific to decompose and there are examples, etc... in there.

And sorry for the back-and-forth here... my fault, and this is really the last move request.

Signed-off-by: Tulio Coppola <tulio.cppl@icloud.com>

tuliocoppola · 2025-09-04T18:54:58Z

@jakelorocco & @nrfulton
Done. Should be ready for merging now.

jakelorocco

lgtm (except for the one thing noted below @tuliocoppola); we chatted and agree that longterm, we will work on porting this code to be more melleaic; for now, keeping it in the cli directory should be good

I was able to run the command, so it looks good

jakelorocco · 2025-09-04T19:30:22Z

cli/decompose/pipeline.py

+                    model_id=model_id,
+                    model_options={
+                        ModelOption.CONTEXT_WINDOW: 32768,
+                        "timeout": backend_req_timeout,


I think it's fine to leave this in for now, but I believe ollama actually sets this as an environment variable. Error from my ollama logs:

time=2025-09-04T15:16:14.510-04:00 level=WARN source=types.go:654 msg="invalid option provided" option=timeout

@jakelorocco
There's an argument when spinning up the client locally with ollama where you can define the request timeout for that client:
https://github.com/ollama/ollama-python/blob/main/ollama/_client.py#L81

So this option would need to be mapped on these lines I guess:
https://github.com/generative-computing/mellea/blob/main/mellea/backends/ollama.py#L67-L68

We can open an issue for this to be implemented later.
I guess that, for now, I can comment that timeout and add a TODO comment to re-enable it when it's supported by the backend class. Since this will probably have to be made available through the special ModelOption thing.

jakelorocco · 2025-09-04T19:51:07Z

cli/decompose/pipeline.py

+        case DecompBackend.rits:
+            assert backend_endpoint is not None, (
+                'Required to provide "backend_endpoint" for this configuration'
+            )
+            assert backend_api_key is not None, (
+                'Required to provide "backend_api_key" for this configuration'
+            )
+
+            from mellea_ibm.rits import RITSBackend, RITSModelIdentifier  # type: ignore
+
+            m_session = MelleaSession(
+                RITSBackend(
+                    RITSModelIdentifier(endpoint=backend_endpoint, model_name=model_id),
+                    api_key=backend_api_key,
+                    model_options={"timeout": backend_req_timeout},
+                )
+            )


We should also remove this and somehow add it through our internal library. We don't want to expose this.

@tuliocoppola, sorry should've spotted this earlier; last change request!

@jakelorocco
I've actually went through this specific matter with @nrfulton, we basically agreed that there's no harm, the service name is not a trade secret or anything, and there's no implementation code exposed, it's just 2 class names.

I'm not sure how we could make this work easily if not like that, and we do need to support this backend asap for the decomposition pipeline.

One thing that might be good is to omit the "rits" option from the CLI's "--backend", and trigger it by providing an in-line environment variable like this:
BACKEND_OVERRIDE=rits m decompose run ...

Just for people to not mistakenly try to use it when looking at the possible values on the CLI's help strings.

If you know an easy way for making this work through the internal library, then we can do it.

Oh okay; if it's already been addressed, I'm fine with it as is. I agree that it's not exposing implementation details here.

…rompt_modules (generative-computing#105) * Implements "prompt_modules" and complete refactor of the "decompose" feature * typo: missing period * minor fix: changed the "NotRequired" import * fix: minor fixes * moves prompt_modules to utils * moves decompose modules to appropriate path * refactor: moves prompt_modules to cli scope Signed-off-by: Tulio Coppola <tulio.cppl@icloud.com> * adds README.md to write later Signed-off-by: Tulio Coppola <tulio.cppl@icloud.com> --------- Signed-off-by: Tulio Coppola <tulio.cppl@icloud.com> Co-authored-by: Tulio Coppola <tuliocoppola@ibm.com> Co-authored-by: Nathan Fulton <nathan@ibm.com>

tuliocoppola and others added 10 commits August 15, 2025 16:35

Implements "prompt_modules" and complete refactor of the "decompose" …

085caa5

…feature

Merge branch 'generative-computing:main' into main

2a43fa0

typo: missing period

48432d9

minor fix: changed the "NotRequired" import

44e0a5c

fix: minor fixes

1935bc4

Merge branch 'generative-computing:main' into main

7a6a6a6

Merge branch 'generative-computing:main' into main

7cc9966

Merge branch 'generative-computing:main' into main

d2f8815

moves prompt_modules to utils

cd51b73

Merge branch 'generative-computing:main' into main

1cb0dc9

nrfulton requested review from HendrikStrobelt and nrfulton August 28, 2025 15:38

moves decompose modules to appropriate path

99bbe04

Merge branch 'main' into main

167483b

nrfulton changed the title ~~Full refactor of the Decompose CLI Tool & introduction of prompt_modules~~ refactor: Full refactor of the Decompose CLI Tool & introduction of prompt_modules Sep 4, 2025

refactor: moves prompt_modules to cli scope

de79f1d

Signed-off-by: Tulio Coppola <tulio.cppl@icloud.com>

tuliocoppola force-pushed the main branch from 1f8e139 to de79f1d Compare September 4, 2025 18:49

adds README.md to write later

90ed743

Signed-off-by: Tulio Coppola <tulio.cppl@icloud.com>

jakelorocco requested changes Sep 4, 2025

View reviewed changes

jakelorocco approved these changes Sep 4, 2025

View reviewed changes

jakelorocco merged commit 5116b21 into generative-computing:main Sep 4, 2025
2 of 4 checks passed

refactor: Full refactor of the Decompose CLI Tool & introduction of prompt_modules #105

refactor: Full refactor of the Decompose CLI Tool & introduction of prompt_modules #105

Uh oh!

Conversation

tuliocoppola commented Aug 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Now an example of the Decomposition CLI Tool usage:

Uh oh!

nrfulton commented Aug 29, 2025

Uh oh!

nrfulton commented Aug 29, 2025

Uh oh!

tuliocoppola commented Aug 29, 2025

Uh oh!

nrfulton commented Aug 29, 2025

Uh oh!

tuliocoppola commented Aug 29, 2025

Uh oh!

jakelorocco commented Sep 3, 2025

Uh oh!

mergify bot commented Sep 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge Protections

🟢 Enforce conventional commit

Uh oh!

nrfulton commented Sep 4, 2025

Uh oh!

tuliocoppola commented Sep 4, 2025

Uh oh!

jakelorocco left a comment

Choose a reason for hiding this comment

Uh oh!

jakelorocco Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

tuliocoppola Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

jakelorocco Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

jakelorocco Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

tuliocoppola Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

tuliocoppola Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

jakelorocco Sep 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

tuliocoppola commented Aug 28, 2025 •

edited

Loading

mergify bot commented Sep 3, 2025 •

edited

Loading

tuliocoppola Sep 4, 2025 •

edited

Loading