Added module extension for fetching CPAN dependencies #86

UebelAndre · 2025-04-23T14:44:39Z

This change introduces the perl_cpan_compiler rule and perl_cpan extension module.

The perl_cpan_compiler rule is backed by Carton and generates a lock file from a given cpanfile that can then be passed to perl_cpan to generate dependencies.

Known limitations:

xs modules are not supported.
Compilation is not hermetic and some host tools may be required (e.g. make) to compile requirements.

This change does not impact any existing rules but does add a new entry to MODULE.bazel so should be considered a minor change.

closes #83

UebelAndre · 2025-04-23T15:24:44Z

@lalten I would love it if you could take a look at the changes here since you have some experience in this domain with rules_cpan!

UebelAndre · 2025-04-23T15:48:50Z

For developing this I had to use a bootstrap script to originally generate the lock file. I'll post it here for future use in case the compiler tool breaks and lockfiles can no longer be generated.

"""bootstrap"""

import json
import sys
import urllib.error
import urllib.request
from pathlib import Path


def deserialize_cpanfile_snapshot(content):
    """Deserialize the contents of a `cpanfile.snapshot` file.

    Args:
        content (str): The text from a `cpanfile.snapshot`

    Returns:
        dict: A mapping of the snapshot data.
    """
    results = {}

    current = ""
    container_name = ""
    for line in content.splitlines():
        text = line.strip()

        if not text or text.startswith("#"):
            continue

        if container_name and line.startswith("      "):
            key, _, value = text.partition(" ")
            results[current][container_name][key] = value
            continue

        if line.startswith("    "):
            if text.startswith("pathname:"):
                _, _, pathname = text.partition(" ")
                results[current]["pathname"] = pathname
                continue
            if text.startswith("provides:"):
                container_name = "provides"
                continue

            if text.startswith("requirements:"):
                container_name = "requirements"
                continue

        if line.startswith("  "):
            current = text
            results[current] = {
                "provides": {},
                "requirements": {},
            }
            continue

    return results


METACPAN_API_ENDPOINT = "https://fastapi.metacpan.org/release"


def _get_release(author: str, distribution: str) -> dict[str, str]:
    url = f"{METACPAN_API_ENDPOINT}/{author}/{distribution}"
    try:
        resp = urllib.request.urlopen(url).read().decode()
    except urllib.error.HTTPError as ex:
        raise RuntimeError(f"Failed to fetch {url}: {ex}") from ex
    try:
        return json.loads(resp)["release"]
    except json.JSONDecodeError as ex:
        raise RuntimeError(f"Failed to parse JSON from {url}: {ex}") from ex
    except KeyError as ex:
        raise RuntimeError(
            f"Failed to find 'release' key in JSON from {url}: {ex}. Json:\n{resp}"
        ) from ex


def sanitize_name(module):
    name, _, _ = module.rpartition("-")
    return name


def main() -> None:
    snapshot_path = Path(sys.argv[1])
    snapshot = deserialize_cpanfile_snapshot(snapshot_path.read_text())

    lockfile = {}
    for module, data in snapshot.items():
        dependencies = set()
        for req in data["requirements"]:
            for mod, mod_data in snapshot.items():
                if req in mod_data["provides"]:
                    dependencies.add(sanitize_name(mod))
                    break
        author = data["pathname"].split("/")[-2]
        release = _get_release(author, module)

        if "Path-Tiny" in module or "String-ShellQuote" in module:
            from pprint import pprint

            pprint(release)

        lockfile[sanitize_name(release["name"])] = {
            "dependencies": sorted(dependencies),
            "sha256": release["checksum_sha256"],
            "strip_prefix": module,
            "url": release["download_url"],
        }

    lockfile = snapshot_path.parent / snapshot_path.name + ".lock.json"
    Path(lockfile).write_text(json.dumps(lockfile, indent=2, sort_keys=True) + "\n")


if __name__ == "__main__":
    main()

lalten · 2025-04-23T16:11:29Z

Could you summarize how the implementation is different from rules_cpan?
rules_cpan's "bootstrap" step is bazel run @rules_cpan//lock

I think it would be preferable to have just one way that works rather than two different implementations. I agree with #83 (comment) that we should move the repos closer together to increase discoverability and improve maintainability

UebelAndre · 2025-04-23T16:47:20Z

Could you summarize how the implementation is different from rules_cpan? rules_cpan's "bootstrap" step is bazel run @rules_cpan//lock

I think it would be preferable to have just one way that works rather than two different implementations. I agree with #83 (comment) that we should move the repos closer together to increase discoverability and improve maintainability

The main difference is that users do not need to separately go run carton install to generate the original cpanfile.snapshot. Once a perl_cpan_compiler target is defined, folks would simply run this target to generate the snapshot file and the Bazel lockfile.

There's currently a shared limitation with both implementations in that the dependencies need to be installable on the host which requires some host tools but given the interface in this PR I think there's a path forward where the tool just hits the CPAN API and only operates on metadata.

Additionally, the repository rules in this PR generate a DAG of dependencies vs a flat target which can be useful when trying to debug issues in external libraries. Down the road this could also be useful for allowing mods/annotations to the packages to inject user defined alterations into the generated module (similar to @rules_rust//crate_universe:defs.bzl%crate.annotation)

UebelAndre · 2025-04-25T15:28:58Z

@lalten would you still be willing to do a full review if you think the direction is good?

lalten

Great work!

MODULE.bazel

perl/cpan/3rdparty/BUILD.bazel

MODULE.bazel

perl/cpan/private/carton.bzl

perl/cpan/private/carton_compiler.pl

.bazelci/presubmit.yml

MODULE.bazel

Co-authored-by: Laurenz <lalten@users.noreply.github.com>

UebelAndre · 2025-04-28T14:32:40Z

@lalten back to you!

README.md

lalten · 2025-04-28T18:57:34Z

lgtm! I think this is better than what's currently at rules_cpan so once this lands it would be cool if you could migrate the current BCR users (which is only Lcov I believe)? Then I'd archive rules_cpan and point users at this implementation.

UebelAndre · 2025-04-28T19:15:02Z

@skeletonkey are you also able to take a look?

UebelAndre force-pushed the cpan branch 2 times, most recently from 293c30e to a12c410 Compare April 23, 2025 15:19

UebelAndre marked this pull request as ready for review April 23, 2025 15:24

UebelAndre requested a review from skeletonkey as a code owner April 23, 2025 15:24

UebelAndre mentioned this pull request Apr 23, 2025

Feature Request: Add tooling for fetching CPAN dependencies #83

Closed

Added module extension for fetching CPAN dependencies

0cdbf92

UebelAndre force-pushed the cpan branch from a12c410 to 0cdbf92 Compare April 23, 2025 16:37

UebelAndre added 3 commits April 23, 2025 10:24

Load rules_perl deps in a separate module

c31e20f

Fix module implementation

5d32168

Delete accidental commit

85c5f55

lalten reviewed Apr 28, 2025

View reviewed changes

UebelAndre and others added 10 commits April 28, 2025 07:01

expose lock json

8edd18e

update name

eb618a6

Updated extension name

779a61a

Updated README

4bbfb02

Added autogen header

4a8faf3

common visibility

8f16a49

Update perl/cpan/private/carton.bzl

52fce83

Co-authored-by: Laurenz <lalten@users.noreply.github.com>

update names

664ed19

rename

729ae1b

Update perl/cpan/private/carton.bzl

0b28d2f

Co-authored-by: Laurenz <lalten@users.noreply.github.com>

UebelAndre requested a review from lalten April 28, 2025 14:32

lalten approved these changes Apr 28, 2025

View reviewed changes

README.md Show resolved Hide resolved

README.md Show resolved Hide resolved

UebelAndre added 2 commits April 28, 2025 12:11

comments

bdc56ab

xs note

2e510f2

xs docs

c3472cd

skeletonkey merged commit d4e5cdb into bazel-contrib:main Apr 29, 2025
1 check passed

UebelAndre deleted the cpan branch April 29, 2025 13:01

UebelAndre mentioned this pull request Apr 29, 2025

Feature Request: Updated cpan rules to support xs modules #88

Open

Uh oh!

Added module extension for fetching CPAN dependencies #86

Added module extension for fetching CPAN dependencies #86

Conversation

UebelAndre commented Apr 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

UebelAndre commented Apr 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

UebelAndre commented Apr 23, 2025

Uh oh!

lalten commented Apr 23, 2025

Uh oh!

UebelAndre commented Apr 23, 2025

Uh oh!

UebelAndre commented Apr 25, 2025

Uh oh!

lalten left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

UebelAndre commented Apr 28, 2025

Uh oh!

Uh oh!

Uh oh!

lalten commented Apr 28, 2025

Uh oh!

UebelAndre commented Apr 28, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

UebelAndre commented Apr 23, 2025 •

edited

Loading

UebelAndre commented Apr 23, 2025 •

edited

Loading