From 23a0d6710f3b73378576ed6a92f43527af1e8d76 Mon Sep 17 00:00:00 2001 From: "Sean P. Goggins" Date: Thu, 22 Sep 2022 16:50:04 -0500 Subject: [PATCH] schema image update --- docs/source/schema/regularly_used_data.rst | 90 +++++++++++----------- docs/source/schema/unused_data.rst | 26 +++---- docs/source/schema/working_tables.rst | 16 ++-- 3 files changed, 66 insertions(+), 66 deletions(-) diff --git a/docs/source/schema/regularly_used_data.rst b/docs/source/schema/regularly_used_data.rst index 89eb838bcf..1b21835b85 100644 --- a/docs/source/schema/regularly_used_data.rst +++ b/docs/source/schema/regularly_used_data.rst @@ -7,7 +7,7 @@ List of Regularly Used Data Tables In Augur * Worker: Facade worker collects, and also stores platform user information in the commits table. - .. image:: commits.png + .. image:: images/commits.png :width: 200 * Contributor_affiliations: A list of emails and domains, with start and end dates for individuals to have an organizational affiliation. @@ -16,7 +16,7 @@ List of Regularly Used Data Tables In Augur * Can be edited so that an Augur instance can resolve a larger list of affiliations. * These mappings are summarized in the dm_ tables. - .. image:: contributor_affiliations.png + .. image:: images/contributor_affiliations.png :width: 200 * Contributor_repo - Storage of a snowball sample of all the repositories anyone in your schema has accessed on GitHub. So, for example, if you wanted to know all the repositories that people on your project contributed to, this would be the table. @@ -24,7 +24,7 @@ List of Regularly Used Data Tables In Augur * Contributor_breadth_worker populates this table * Population of this table happens last, and can take a long time. - .. image:: contributor_repo.png + .. image:: images/contributor_repo.png :width: 200 * Contributors - These are all the contributors to a project/repo. In Augur, all types of contributions create a contributor record. This includes issue comments, pull request comments, label addition, etc. This is different than how GitHub counts contributors; they only include committers. @@ -37,7 +37,7 @@ List of Regularly Used Data Tables In Augur * GitLab Merge Request Worker * Facade Worker - .. image:: contributors.png + .. image:: images/contributors.png :width: 200 * Contributors_aliases - These are all the alternate emails that the same contributor might use. These records arise almost entirely from the commit log. For example, if I have two different emails on two different computers that I use when I make a commit, then an alias is created for whatever the 2nd to nth email Augur runs across. If a user’s email cannot be resolved, it is placed in the unresolved_commit_emails table. Coverage is greater than 98% since Augur 1.2.4. @@ -46,7 +46,7 @@ List of Regularly Used Data Tables In Augur * Facade Worker - .. image:: contributors_aliases.png + .. image:: images/contributors_aliases.png :width: 200 * Discourse_insights - There are nine specific discourse act types identified by the computational linguistic algorithm that underlies the discourse insights worker. This worker analyzes each comment on each issue or pull request sequentially so that context is applied when determining the discourse act type. These types are: @@ -65,7 +65,7 @@ List of Regularly Used Data Tables In Augur * Discourse Insights Worker - .. image:: discourse_insights.png + .. image:: images/discourse_insights.png :width: 200 * issue_assignees || issue_events || issue_labels @@ -74,7 +74,7 @@ List of Regularly Used Data Tables In Augur * Github or Gitlab Issues Worker - .. image:: issue_assignees.png + .. image:: images/issue_assignees.png :width: 200 * issue_message_ref - A link between the issue and each message stored in the message table. @@ -83,7 +83,7 @@ List of Regularly Used Data Tables In Augur * Github or Gitlab Issues Worker - .. image:: issue_message_ref.png + .. image:: images/issue_message_ref.png :width: 200 * issues - Is all the data related to a GitHub Issue. @@ -92,12 +92,12 @@ List of Regularly Used Data Tables In Augur * Github or Gitlab Issues Worker - .. image:: issues.png + .. image:: images/issues.png :width: 200 * Message - every pull request or issue related message. These are then mapped back to either pull requests, or issues, using the __msg_ref tables - .. image:: message.png + .. image:: images/message.png :width: 200 * Message_analysis: Two factors evaluated for every pull request on issues message: What is the sentiment of the message (positive or negative), and what is the novelty of the message in the context of other messages in that repository. @@ -106,7 +106,7 @@ List of Regularly Used Data Tables In Augur * Message Insights Worker - .. image:: message_analysis.png + .. image:: images/message_analysis.png :width: 200 * Message_analysis_summary: A summary level representation of the granular data in message_analysis. @@ -115,7 +115,7 @@ List of Regularly Used Data Tables In Augur * Message Insights Worker - .. image:: message_analysis_summary.png + .. image:: images/message_analysis_summary.png :width: 200 * Platform: Reference data with two rows: one for GitHub, one for GitLab. @@ -124,7 +124,7 @@ List of Regularly Used Data Tables In Augur * Platform_worker - .. image:: platform.png + .. image:: images/platform.png :width: 200 * Pull_request_analysis: A representation of the probability of a pull request being merged into a repository, based on analysis of the properties of previously merged pull requests in a repository. (Machine learning worker) @@ -133,50 +133,50 @@ List of Regularly Used Data Tables In Augur * Pull request analysis worker - .. image:: pull_request_analysis.png + .. image:: images/pull_request_analysis.png :width: 200 * pull_request_assignees || pull_request_commits || pull_request_events || pull_request_files || pull_request_labels || pull_request_message_ref - All the data related to pull requests. Every pull request will be in the pull_requests data. - .. image:: pull_request_assignees.png + .. image:: images/pull_request_assignees.png :width: 200 - .. image:: pull_request_commits.png + .. image:: images/pull_request_commits.png :width: 200 - .. image:: pull_request_events.png + .. image:: images/pull_request_events.png :width: 200 - .. image:: pull_request_files.png + .. image:: images/pull_request_files.png :width: 200 - .. image:: pull_request_labels.png + .. image:: images/pull_request_labels.png :width: 200 - .. image:: pull_request_ref.png + .. image:: images/pull_request_ref.png :width: 200 * pull_request_meta || pull_request_repo || pull_request_review_message_ref || pull_request_reviewers || pull_request_reviews || pull_request_teams || pull_requests - All the data related to pull requests. Every pull request will be in the pull_requests data. - .. image:: pull_request.png + .. image:: images/pull_request.png :width: 200 - .. image:: pull_request_meta.png + .. image:: images/pull_request_meta.png :width: 200 - .. image:: pull_request_repo.png + .. image:: images/pull_request_repo.png :width: 200 - .. image:: pull_request_review_message_ref.png + .. image:: images/pull_request_review_message_ref.png :width: 200 - .. image:: pull_request_reviewers.png + .. image:: images/pull_request_reviewers.png :width: 200 - .. image:: pull_request_reviews.png + .. image:: images/pull_request_reviews.png :width: 200 - .. image:: pull_request_teams.png + .. image:: images/pull_request_teams.png :width: 200 * Releases: Github declared software releases or release tags. For example: https://github.com/chaoss/augur/releases @@ -185,12 +185,12 @@ List of Regularly Used Data Tables In Augur * Release Worker. - .. image:: releases.png + .. image:: images/releases.png :width: 200 * Repo: A list of all the repositories. - .. image:: repo.png + .. image:: images/repo.png :width: 200 * Repo_badging: A list of CNCF badging information for a project. Reads this api endpoint: https://bestpractices.coreinfrastructure.org/projects.json @@ -199,7 +199,7 @@ List of Regularly Used Data Tables In Augur * linux_badge_worker - .. image:: repo_badging.png + .. image:: images/repo_badging.png :width: 200 * Repo_cluster_messages: Identifying which messages and repositories are clustered together. Identifies project similarity based on communication patterns. @@ -208,7 +208,7 @@ List of Regularly Used Data Tables In Augur * Clustering Worker - .. image:: repo_cluster_messages.png + .. image:: images/repo_cluster_messages.png :width: 200 * Repo_dependencies: enumerates every dependency, including dependencies that are not package managed. @@ -217,7 +217,7 @@ List of Regularly Used Data Tables In Augur * deps_worker - .. image:: repo_dependencies.png + .. image:: images/repo_dependencies.png :width: 200 * Repo_deps_libyear: (enumerates every package managed dependency) Looks up the latest release of any library that is imported into a project. Then it compares that release date, the release version of the library version in your project (and its release date), and calculates how old your version is, compared to the latest version. The resulting statistic is “libyear”. This worker runs at least once a month, so over time, you will see if your libraries are being kept up to date, or not. @@ -230,7 +230,7 @@ List of Regularly Used Data Tables In Augur * deps_libyear_worker - .. image:: repo_deps_libyear.png + .. image:: images/repo_deps_libyear.png :width: 200 * Repo_deps_scorecard: Runs the OSSF Scorecard over every repository ( https://github.com/ossf/scorecard ) : There are 16 factors that are explained at that repository location. @@ -239,12 +239,12 @@ List of Regularly Used Data Tables In Augur * deps_worker - .. image:: repo_deps_scorecard.png + .. image:: images/repo_deps_scorecard.png :width: 200 * Repo_groups: reference data. The repo groups in an augur instance. - .. image:: repo_groups.png + .. image:: images/repo_groups.png :width: 200 * Repo_info: this worker gathers metadata from the platform API that includes things like “number of stars”, “number of forks”, etc. AND it also gives us : Number of issues, number of pull requests, etc. .. THAT information we use to determine if we have collected all of the PRs and Issues associated with a repository. @@ -253,7 +253,7 @@ List of Regularly Used Data Tables In Augur * repo info worker - .. image:: repo_info.png + .. image:: images/repo_info.png :width: 200 * Repo_insights: @@ -262,7 +262,7 @@ List of Regularly Used Data Tables In Augur * Insight worker - .. image:: repo_insights.png + .. image:: images/repo_insights.png :width: 200 * Repo_insights_records: @@ -271,7 +271,7 @@ List of Regularly Used Data Tables In Augur * Insight worker - .. image:: repo_insights_records.png + .. image:: images/repo_insights_records.png :width: 200 * Repo_labor @@ -280,22 +280,22 @@ List of Regularly Used Data Tables In Augur * Value worker - .. image:: repo_labor.png + .. image:: images/repo_labor.png :width: 200 * Repo_meta: Exists to capture repo data that may be useful in the future. Not currently populated. - .. image:: repo_meta.png + .. image:: images/repo_meta.png :width: 200 * Repo_sbom_scans: This table links the augur_data schema to the augur_spdx schema to keep a list of repositories that need licenses scanned. (These are for file level license declarations, which are common in Linux Foundation projects, but otherwise not in wide use). - .. image:: repo_sbom_scans.png + .. image:: images/repo_sbom_scans.png :width: 200 * Repo_stats: Exists to capture repo data that may be useful in the future. Not currently populated. - .. image:: repo_stats.png + .. image:: images/repo_stats.png :width: 200 * Repo_topic: Identifies probable topics of conversation in discussion threads around issues and pull requests. @@ -304,7 +304,7 @@ List of Regularly Used Data Tables In Augur * Clustering Worker - .. image:: repo_topic.png + .. image:: images/repo_topic.png :width: 200 * Topic_words: Unigrams, bigrams, and trigrams associated with topics in the repo_topic table. @@ -313,7 +313,7 @@ List of Regularly Used Data Tables In Augur * Clustering Worker - .. image:: topic_words.png + .. image:: images/topic_words.png :width: 200 * Unresolved_commit_emails - emails from commits that were not initially able to be resolved using automated mechanisms. @@ -322,5 +322,5 @@ List of Regularly Used Data Tables In Augur * Facade Worker. - .. image:: unresolved_commit_emails.png + .. image:: images/unresolved_commit_emails.png :width: 200 diff --git a/docs/source/schema/unused_data.rst b/docs/source/schema/unused_data.rst index b2373529af..fbb82ebe93 100644 --- a/docs/source/schema/unused_data.rst +++ b/docs/source/schema/unused_data.rst @@ -5,65 +5,65 @@ List of Unused Data Tables In Augur * chaoss_metric_status - .. image:: chaoss_metric_status.png + .. image:: images/chaoss_metric_status.png :width: 200 * chaoss_user - .. image:: chaoss_user.png + .. image:: images/chaoss_user.png :width: 200 * commit_comment_ref - .. image:: commit_comment_ref.png + .. image:: images/commit_comment_ref.png :width: 200 * libraries - .. image:: libraries.png + .. image:: images/libraries.png :width: 200 * library_dependencies - .. image:: library_dependencies.png + .. image:: images/library_dependencies.png :width: 200 * library_version - .. image:: library_version.png + .. image:: images/library_version.png :width: 200 * lstm_anomaly_models - .. image:: lstm_anomaly_models.png + .. image:: images/lstm_anomaly_models.png :width: 200 * lstm_anomaly_results - .. image:: lstm_anomaly_results.png + .. image:: images/lstm_anomaly_results.png :width: 200 * message_sentiment - .. image:: message_sentiment.png + .. image:: images/message_sentiment.png :width: 200 * message_sentiment_summary - .. image:: message_sentiment_summary.png + .. image:: images/message_sentiment_summary.png :width: 200 * Repo_group_insights - .. image:: repo_group_insights.png + .. image:: images/repo_group_insights.png :width: 200 * repo_groups_list_serve - .. image:: repo_groups_list_serve.png + .. image:: images/repo_groups_list_serve.png :width: 200 * repo_test_coverage - .. image:: repo_test_coverage.png + .. image:: images/repo_test_coverage.png :width: 200 \ No newline at end of file diff --git a/docs/source/schema/working_tables.rst b/docs/source/schema/working_tables.rst index bc5d93b1c6..d26bca88c0 100644 --- a/docs/source/schema/working_tables.rst +++ b/docs/source/schema/working_tables.rst @@ -11,41 +11,41 @@ List of Working Data Tables In Augur * Facade Worker - .. image:: analysis_log.png + .. image:: images/analysis_log.png :width: 200 * commit_parents - this table keeps a record of parent commits that are squashed during Facade Worker execution. - .. image:: commit_parents.png + .. image:: images/commit_parents.png :width: 200 Other working tables are: * exclude - .. image:: exclude.png + .. image:: images/exclude.png :width: 200 * repos_fetch_log - .. image:: repos_fetch_log.png + .. image:: images/repos_fetch_log.png :width: 200 * settings - .. image:: settings.png + .. image:: images/settings.png :width: 200 * unknown_cache - .. image:: unknown_cache.png + .. image:: images/unknown_cache.png :width: 200 * utility_log - .. image:: utility_log.png + .. image:: images/utility_log.png :width: 200 * working_commits - .. image:: working_commits.png + .. image:: images/working_commits.png :width: 200