Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add stats output #110

Merged
merged 1 commit into from
Nov 14, 2022
Merged

feat: add stats output #110

merged 1 commit into from
Nov 14, 2022

Conversation

cfabianski
Copy link
Collaborator

@cfabianski cfabianski commented Nov 10, 2022

Description

Introduces a new stats report format

It looks like this for GitLab

{
  "number_of_lines": 2813694,
  "number_of_data_types": 25,
  "data_types": [
    { "name": "Age Range", "occurrences": 4 },
    { "name": "Browser Fingerprint", "occurrences": 12 },
    { "name": "Country", "occurrences": 2 },
    { "name": "Credit", "occurrences": 9 },
    { "name": "Email Address", "occurrences": 347 },
    { "name": "Emails", "occurrences": 46 },
    { "name": "Firstname", "occurrences": 14 },
    { "name": "Fullname", "occurrences": 294 },
    { "name": "Geographic", "occurrences": 14 },
    { "name": "Group Membership", "occurrences": 41 },
    { "name": "Health Records", "occurrences": 1 },
    { "name": "IP address", "occurrences": 4 },
    { "name": "Image", "occurrences": 54 },
    { "name": "Interactions", "occurrences": 45 },
    { "name": "Interests", "occurrences": 4 },
    { "name": "Job Titles", "occurrences": 5 },
    { "name": "Lastname", "occurrences": 13 },
    { "name": "Likes", "occurrences": 2 },
    { "name": "Links clicked", "occurrences": 2 },
    { "name": "Passwords", "occurrences": 136 },
    { "name": "Physical Address", "occurrences": 14 },
    { "name": "Spoken Languages", "occurrences": 17 },
    { "name": "Telephone Number", "occurrences": 24 },
    { "name": "Unique Identifier", "occurrences": 223 },
    { "name": "Username", "occurrences": 415 }
  ]
}

ToDo

  • Update LoC to use only the file we really scanned (will be done in another PR)

Checklist

  • I've added test coverage that shows my fix or feature works as expected.
  • I've updated or added documentation if required.
  • I've included usage information in the description if CLI behavior was updated or added.
  • PR title follows Conventional Commits format

@swarmia
Copy link

swarmia bot commented Nov 10, 2022

✅  Linked to AMA-3165 · Benchmark Output
➡️  Part of AMA-2947 · OSS - Curio CLI

@cfabianski cfabianski changed the title chore: restructure output feat: add status output Nov 10, 2022
@cfabianski cfabianski marked this pull request as ready for review November 14, 2022 17:17
@cfabianski cfabianski merged commit e4c1a31 into main Nov 14, 2022
@cfabianski cfabianski deleted the AMA-3165/add-benchmark-option branch November 14, 2022 17:17
@cfabianski cfabianski mentioned this pull request Nov 14, 2022
4 tasks
@cfabianski cfabianski changed the title feat: add status output feat: add stats output Nov 18, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant