Assess test target execution time & define test schedule #2037

smlambert · 2020-11-03T22:10:50Z

We have nightly and weekly targets defined in the build pipelines. We eventually want to enable the entire 'grid' of test levels x groups at the project for all platforms that we release. We will embrace some notion of graduated testing to conserve machines, as we can not run all testing every night.

This issue will gather test duration times for all top-level test targets on all platforms for all versions to attempt to develop a schedule that works with the current machine capacity (and/or advice where we are needing to augment capacity), ideally creating a script where this can be rerun/revisited on a quarterly cadence.

Average Duration for Nightly test targets:

version	target	xlinux avgDuration (mins)	mac avgDuration (mins)	aix avgDuration (mins)	aarch64	win64	s390x
jdk8	sanity.openjdk	22.78	41.04	101	21	85	120
	sanity.system	135.49	158.79	135	76	66	210
	extended.system	91.29	128.88	75	14	95	150
	sanity.perf	47.02	33.08	8	14	55	127
	sanity.functional	81.5	83	120	223	83	128
	extended.functional	221	112	270	120	141	174
		~599 min (~10hrs)	~557 min (~9.2hrs)	~694 (~11.5hrs)	~468 (~7.8hrs)	~525 (~8.75hrs)	~909 (~15hrs)

Average Duration for Weekly test targets: (TBD)

version	target	xlinux avgDuration (TRSS query)
jdk8	extended.openjdk	TBD query
	special.functional	TBD query
	extended.perf	TBD query
	sanity.external

andrew-m-leonard · 2020-11-11T09:56:14Z

fyi: #2050

andrew-m-leonard · 2020-11-11T10:18:16Z

#2051

smlambert · 2020-11-24T18:56:30Z

Updated average nightly test execution times for xlinux and mac in the table in an earlier comment. Currently if all top-level targets were run serially on a machine... should complete at around 10hrs for jdk8. Note: the execution time varies across platforms (for example, based on information shared earlier in this issue, on mac shows a 9.2hr average execution time). This exercise needs to eventually take that into account, but for a spitball initial estimate, we will assume ~10hr execution time .

We do not run test targets serially, but rather try to divide and queue the top-level targets up across the machines we have. (and multiplied by 4 impls, hotspot, dragonwell, openj9, openj9XL and for jdk11 x 5 impls with addition of corretto on xlinux, we additionally run sanity.openjdk and sanity.system against upstream builds for jdk8/jdk11 on xlinux/aarch64/win64). 40/50/30 hrs of test execution time nightly per each version respectively.

Now to look at machine resources, execution time has to be shared across 19 xlinux machines and across 7 mac machines and 3 aix machines. If all test targets were granularly divided in 1hr segments, and if no other jobs utilize test machines and all machines are online, the shortest completion time for a nightly build is execution time % number of machines. In reality, the queuing/scheduling across Jenkins resources is never entirely optimal, nonetheless, if the execution time % num of machines is too large we will need to consider a different schedule or more machines or not using the default set of tests on that platform, but rather a reduced set of tests.

Note: since only a small percentage of functional tests are tagged for hotspot impl (more are applicable, TODO: review and tag the set), it reduces the execution time for sanity.functional and extended.functional for that impl (by hrs).

Version	Platform/Spec	Nightly Execution Time (impls x versions x avgDuration in hrs)	Num of Test Machines	Execution Time / Machines (shortest completion time possible)
jdk8	xlinux	42
jdk11	xlinux	52
jdk15	xlinux	32
all	xlinux	126 hrs	19	6.63 hrs
jdk8	mac	30
jdk11	mac	30
jdk15	mac	30
all	mac	90 hrs	7	12.86 hrs
jdk8	aix	14
jdk11	aix	14
jdk15	aix	14
all	aix	42 hrs	3	14 hrs
jdk8	aarch64	16
jdk11	aarch64	24
jdk15	aarch64	24
all	aarch64	64 hrs	10	6.4 hrs
jdk8	win64 + win32	54
jdk11	win64 + win32	45
jdk15	win64 + win32	45
all	win64	144 hrs	9 online/3 offline	16 hrs (reduces to 12hrs if all machines online)
jdk8	s390x	45
jdk11	s390x	45
jdk15	s390x	45
all	s390x	135 hrs	4	33.75 hrs
jdk8	ppc64le	24
jdk11	ppc64le	24
jdk15	ppc64le	24
all	ppc64le	72 hrs	9	8 hrs

smlambert · 2020-11-26T04:03:56Z

In the middle of this assessment, I have also managed to locate a similar assessment done in April 2019, adding it for completeness.

smlambert · 2022-10-04T17:21:20Z

Closing this as stale and no longer relevant.

The queries linked in the table above (which call an API in TRSS) are still valid and could be used to tabulate data in the future should it be needed, example:

https://trss.adoptopenjdk.net/api/getTestAvgDuration?level=sanity&jdkVersion=8&group=openjdk&platform=x86-64_linux

smlambert added the question label Nov 3, 2020

smlambert added this to the November 2020 milestone Nov 3, 2020

smlambert added this to TODO in aqa-tests via automation Nov 3, 2020

smlambert self-assigned this Nov 3, 2020

This was referenced Nov 3, 2020

enable Aarch64 jdk8 sanity and extended.functional nightly builds adoptium/temurin-build#2207

Closed

Revisit weekly test list definitions for all platforms adoptium/temurin-build#2189

Closed

smlambert mentioned this issue Nov 13, 2020

jdk11 Hotspot zLinux tests take too long to run... #2051

Closed

smlambert added this to To do in Top Priorities via automation Nov 17, 2020

adamfarley mentioned this issue Dec 1, 2020

Retrospective for October 2020 releases AdoptOpenJDK/TSC#181

Closed

smlambert modified the milestones: November 2020, December 2020 Dec 8, 2020

smlambert modified the milestones: December 2020, January 2021 Jan 6, 2021

adamfarley mentioned this issue Jan 20, 2021

Retrospective for January 2021 Releases AdoptOpenJDK/TSC#195

Closed

smlambert modified the milestones: January 2021, February 2021 Feb 5, 2021

smlambert mentioned this issue Feb 22, 2021

Add a 'test target execution time' widget adoptium/aqa-test-tools#364

Open

smlambert added the canosp_w21 label Feb 23, 2021

smlambert added this to To do in CanOSP W21 Feb 23, 2021

smlambert removed this from To do in Top Priorities Mar 1, 2021

smlambert removed this from the February 2021 milestone Mar 1, 2021

smlambert added help wanted and removed canosp_w21 labels Apr 14, 2021

adamfarley mentioned this issue Sep 16, 2021

Retrospective for September 2021 Releases adoptium/adoptium#77

Closed

12 tasks

smlambert closed this as completed Oct 4, 2022

smlambert removed this from To do in CanOSP W21 Mar 4, 2024

smlambert mentioned this issue Mar 4, 2024

Provide AQA Test metrics per release #5121

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Assess test target execution time & define test schedule #2037

Assess test target execution time & define test schedule #2037

smlambert commented Nov 3, 2020 •

edited

Loading

andrew-m-leonard commented Nov 11, 2020

andrew-m-leonard commented Nov 11, 2020

smlambert commented Nov 24, 2020 •

edited

Loading

smlambert commented Nov 26, 2020

smlambert commented Oct 4, 2022

Assess test target execution time & define test schedule #2037

Assess test target execution time & define test schedule #2037

Comments

smlambert commented Nov 3, 2020 • edited Loading

andrew-m-leonard commented Nov 11, 2020

andrew-m-leonard commented Nov 11, 2020

smlambert commented Nov 24, 2020 • edited Loading

smlambert commented Nov 26, 2020

smlambert commented Oct 4, 2022

smlambert commented Nov 3, 2020 •

edited

Loading

smlambert commented Nov 24, 2020 •

edited

Loading