Skip to content

Add support for mining cpan packageURLs#731

Merged
AyanSinhaMahapatra merged 9 commits intomainfrom
minecode-pipeline-cpan
Dec 9, 2025
Merged

Add support for mining cpan packageURLs#731
AyanSinhaMahapatra merged 9 commits intomainfrom
minecode-pipeline-cpan

Conversation

@AyanSinhaMahapatra
Copy link
Copy Markdown
Member

Reference: #685

@AyanSinhaMahapatra AyanSinhaMahapatra force-pushed the minecode-pipeline-cpan branch 3 times, most recently from 8783ec5 to afd274c Compare September 30, 2025 22:46
Reference: #685
Signed-off-by: Ayan Sinha Mahapatra <ayansmahapatra@gmail.com>
Signed-off-by: Ayan Sinha Mahapatra <ayansmahapatra@gmail.com>
Signed-off-by: Ayan Sinha Mahapatra <ayansmahapatra@gmail.com>
Comment thread minecode_pipelines/miners/cpan.py
cpan_packages_url = cpan_repo + "modules/02packages.details.txt.gz"
local_filename = "cpan_packages.gz"

response = requests.get(cpan_packages_url, stream=True)
Copy link
Copy Markdown
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I would use fetch_http (from scanpipe.pipes.fetch import fetch_http) to handle downloads

You can do

cpan_packages_gz_download = fetch_http(cpan_packages_url)
with gzip.open(cpan_packages_gz_download.path, "rb") as f_in:
    ...

Comment thread minecode_pipelines/miners/cpan.py Outdated
Comment thread minecode_pipelines/miners/cpan.py
Signed-off-by: Ayan Sinha Mahapatra <ayansmahapatra@gmail.com>
@keshav-space
Copy link
Copy Markdown
Member

Let's use MineCodeBasePipeline to make sure these mined purls are deployed in git repos.

Signed-off-by: Ayan Sinha Mahapatra <asmahapatra@aboutcode.org>
Reference: #775
Signed-off-by: Ayan Sinha Mahapatra <asmahapatra@aboutcode.org>
@pombredanne
Copy link
Copy Markdown
Member

Bump... anything left todo here?

@keshav-space keshav-space force-pushed the minecode-pipeline-cpan branch from 4c36e9d to 42659e2 Compare December 9, 2025 08:47
Signed-off-by: Ayan Sinha Mahapatra <asmahapatra@aboutcode.org>
Signed-off-by: Ayan Sinha Mahapatra <asmahapatra@aboutcode.org>
Signed-off-by: Ayan Sinha Mahapatra <asmahapatra@aboutcode.org>
@AyanSinhaMahapatra
Copy link
Copy Markdown
Member Author

@AyanSinhaMahapatra AyanSinhaMahapatra merged commit 9445f0d into main Dec 9, 2025
2 checks passed
@AyanSinhaMahapatra AyanSinhaMahapatra deleted the minecode-pipeline-cpan branch December 9, 2025 11:50
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants