Additional EDA on TestGrid Data set #17

MichaelClifford · 2020-09-22T13:30:14Z

To close issue #15 and build upon the initial EDA work done in #16 there are a number of additional questions that we would like answered about the TestGrid dataset. Specifically:

How comparable are the testgrids?
How do we analyze them in aggregate to learn from their combined behavior?
How many/ which tests do they all have in common?
Are their time series dates comparable?
Are there sub-groups that should only be compared with one another?
Is looking at the grid matrices independent of test names a valid approach for issue identification?
What is the expected behavior of a test over time across multiple jobs.
How does the entire test platform/specific tests perform on a given day?
How does the entire test platform behavior evolve over time.
Is there sufficient data here for useful ML approaches?
Can we develop some meaningful alerting/ problem identification with the results of the above questions?

Acceptance Criteria:

Notebook that address the questions above.

Add AICoE CI configuration example

MichaelClifford mentioned this issue Oct 27, 2020

In-depth TestGrid EDA Notebook #27

Merged

tumido added a commit to tumido/ocp-ci-analysis that referenced this issue Nov 3, 2020

Merge pull request aicoe-aiops#17 from 4n4nd/master

8ec0b43

Add AICoE CI configuration example

aakankshaduggal closed this as completed Nov 5, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Additional EDA on TestGrid Data set #17

Additional EDA on TestGrid Data set #17

MichaelClifford commented Sep 22, 2020 •

edited

Loading

Additional EDA on TestGrid Data set #17

Additional EDA on TestGrid Data set #17

Comments

MichaelClifford commented Sep 22, 2020 • edited Loading

MichaelClifford commented Sep 22, 2020 •

edited

Loading