Add support for collecting fix commits and (PRs and issues) by ziadhany · Pull Request #1 · aboutcode-data/vulnerablecode-vcs-collector

ziadhany · 2026-03-13T19:00:56Z

Issue:

fedcode-next: Extract fix commits from pull requests and issues body or comments in search for CVE-related messages aboutcode-org/vulnerablecode#2002

Related PRs:

Add support for collecting GitHub/GitLab vulnerability-related issues and pull requests aboutcode-org/vulnerablecode#2008

Signed-off-by: ziad hany <ziadhany2016@gmail.com>

Make sure the pipline throw error if the no token inserted Update the pipeline to use repo secrets avoid env secrets for github actions Signed-off-by: ziad hany <ziadhany2016@gmail.com>

ziadhany · 2026-03-17T00:06:34Z

@keshav-space, please have a look when you have a time. I've run the pipelines and generated the data,

see:
https://github.com/ziadhany/vulnerablecode-vcs-collector

TG1999

Please add tests

Add more target repo for fix commits collection Signed-off-by: ziad hany <ziadhany2016@gmail.com>

ziadhany · 2026-04-10T22:01:29Z

@TG1999 I just added a test, please have a look once you have some time.

Signed-off-by: ziad hany <ziadhany2016@gmail.com>

pombredanne

Can you elaborate on your approach and design?

Why would this code not be part of VulnerableCode? I am not sure I understand the working logic here? My understanding was that we have potential improvers:

input with advisories in VCIO and then we need to extract commits and patches from the reference URLs
input with fixed PURLs for an advisory in VCIO and then we need to determine if we can get a commit out of it
input with fixed PURLs for an advisory in VCIO and then we need to find in the commit logs (or PRs, or issues) if we have a a good fix commit

... and some improver/importers, that scout VCS commit logs and PRs of a specific package possibly between two versions to find if one if a fix commit or patch and either improve (where we focused on a specific advisory, or would create a new advisory from the commit data.

In that sense, the list of targets is NOT fixed, but instead something that is dynamically computed from the actual data in VCIO?

ziadhany · 2026-04-30T19:03:26Z

@pombredanne The main idea behind this design is that cloning repositories, parsing commit messages, and querying the GitHub/GitLab APIs (which involves handling rate limits) take time. Running these tasks directly in VCIO would overwhelm the pipeline workers.

Because of this, I thought it would be better to create a mirror and import this data using a single pipeline in VulnerableCode:

Add the VCS Collector importer aboutcode-org/vulnerablecode#2254

There is also a problem with getting dynamic Git repos targets : some Git repos are just vulnerability data sources or exploit repo url, not actual source code. It is not easy to differentiate between a Git repo that contains source code and one that merely contains vulnerability data.

We also have other PRs addressing related parts of this workflow:

Extracting commits and patches from reference URLs:

Add support for Reference Fix Commits improver aboutcode-org/vulnerablecode#2163

An API to query using commit_id, purl, or vcs_url:

Add API/ UI support for Patch/PackageCommitPatch aboutcode-org/vulnerablecode#2179

However, we do not currently have a way to scout VCS commit logs and PRs for a specific package (e.g., between two versions) to determine if one is a valid fix commit or patch.

I think i should think about this more.

ziadhany added 4 commits March 13, 2026 21:00

Add initial support for collecting fix commits and (PRs and issues)

32707d8

Signed-off-by: ziad hany <ziadhany2016@gmail.com>

Update issues_prs_collector to use packageurl and fix a typo

d341543

Signed-off-by: ziad hany <ziadhany2016@gmail.com>

Update docs, Simplify fix commits and Fix CI

7df4ae4

Signed-off-by: ziad hany <ziadhany2016@gmail.com>

Rename the variable name for env secrets pipeline

093f159

Signed-off-by: ziad hany <ziadhany2016@gmail.com>

ziadhany requested review from keshav-space March 16, 2026 18:11

ziadhany added 2 commits March 16, 2026 21:28

Update the pipeline to use python-dotenv

327395f

Signed-off-by: ziad hany <ziadhany2016@gmail.com>

Update the docs

a5511f2

Make sure the pipline throw error if the no token inserted Update the pipeline to use repo secrets avoid env secrets for github actions Signed-off-by: ziad hany <ziadhany2016@gmail.com>

ziadhany mentioned this pull request Apr 9, 2026

fedcode-next: Extract fix commits from pull requests and issues body or comments in search for CVE-related messages aboutcode-org/vulnerablecode#2002

Open

TG1999 requested changes Apr 10, 2026

View reviewed changes

Add a test for collect fix commits and issue prs

a0e821f

Add more target repo for fix commits collection Signed-off-by: ziad hany <ziadhany2016@gmail.com>

ziadhany requested a review from TG1999 April 10, 2026 21:59

This was referenced Apr 14, 2026

Add support for collecting GitHub/GitLab vulnerability-related issues and pull requests aboutcode-org/vulnerablecode#2008

Closed

fedcode-next: Code pipeline and models to continuously automatically collect fix commits aboutcode-org/vulnerablecode#1721

Open

Add a License file

32ce455

Signed-off-by: ziad hany <ziadhany2016@gmail.com>

pombredanne requested changes Apr 30, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for collecting fix commits and (PRs and issues)#1

Add support for collecting fix commits and (PRs and issues)#1
ziadhany wants to merge 8 commits intoaboutcode-data:mainfrom
ziadhany:vcs-collector

ziadhany commented Mar 13, 2026 •

edited

Loading

Uh oh!

ziadhany commented Mar 17, 2026

Uh oh!

TG1999 left a comment

Uh oh!

ziadhany commented Apr 10, 2026

Uh oh!

pombredanne left a comment

Uh oh!

ziadhany commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

ziadhany commented Mar 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ziadhany commented Mar 17, 2026

Uh oh!

TG1999 left a comment

Choose a reason for hiding this comment

Uh oh!

ziadhany commented Apr 10, 2026

Uh oh!

pombredanne left a comment

Choose a reason for hiding this comment

Uh oh!

ziadhany commented Apr 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ziadhany commented Mar 13, 2026 •

edited

Loading