Bulk file check on slaves #241

artms · 2017-11-03T13:49:09Z

When phabricator jenkins plugin does coverage file parse it checks each covered file on slave which source directory contains source file this can be expensive if you have large code base or large amount if covered source directories or when you have large latency between master<->slave (on another coast), this actually happened with one of project where such "check" was taking 1.5h to completed.
Instead of checking individual files - we collect all needed files and make bulk request. PathResolverChooseMultiCallable is serialized by jenkins and sent to be executed on slave which does this efficiently and returns back result. This might cause extreme memory usage for extremely large code base if such case happens - bulk request will have to be chunked

…and slave

…est recording when master<->slave has large latency (coast-to-coast)

coveralls · 2017-11-03T13:58:04Z

Coverage increased (+0.2%) to 90.799% when pulling ad43cd2 on artms:bulk_file_check into 6cfd526 on uber:master.

kageiit · 2017-11-03T15:51:14Z

src/main/java/com/uber/jenkins/phabricator/coverage/CoberturaXMLParser.java

            NodeList classes = entry.getKey();
            List<String> sourceDirs = entry.getValue();

+            // Collect all filenames in coverage report


We can actually make this lot more efficient

Phab already knows the files affected as part of the diff. In the same spirit as #151

We can just collect file names from whats modified and just check state on those files and ignore the rest of the files in the coverage report. That will mitigate any large memory concerns as well

@kageiit - if we check only files modified won't we miss files which were covered by new tests added into code base? Eg. if I simply add add new unit test which tests some real code but not changing that code itself - we will not even calculate line coverage?

We dont need to calculate the coverage for the main file. We just need aggregates to report back which we can still do without looking if the file exists. Processing line coverage for every line will still be useless as we cannot show it in the ui

msridhar · 2017-11-03T19:36:32Z

src/main/java/com/uber/jenkins/phabricator/coverage/CoberturaXMLParser.java

            List<String> sourceDirs = entry.getValue();

+            // Collect all filenames in coverage report
+            List<String> fileNames = new LinkedList<String>();


Drive-by comment: from what I've read recently, either ArrayList or ArrayDeque is almost always better than LinkedList; this is now checked by ErrorProne

Thanks, fixed

coveralls · 2017-11-07T08:43:40Z

Coverage increased (+0.2%) to 90.799% when pulling 8fc0ff7 on artms:bulk_file_check into 6cfd526 on uber:master.

kageiit · 2017-11-07T15:35:51Z

src/main/java/com/uber/jenkins/phabricator/coverage/CoberturaXMLParser.java

+            Map<String, String> detectedSourceRoots = new PathResolver(workspace, sourceDirs).choose(fileNames);
+
            // Loop over all files in the coverage report
            for (int i = 0; i < classes.getLength(); i++) {


can replace this for loop with for(fileName: fileNames)

We need xml nodes:

// Loop over all files in the coverage report for (int i = 0; i < classes.getLength(); i++) { Node classNode = classes.item(i); String fileName = classNode.getAttributes().getNamedItem(NODE_FILENAME).getTextContent(); if (includeFileNames != null && !includeFileNames.contains(FilenameUtils.getName(fileName))) { continue; } fileName = join(detectedSourceRoots.get(fileName), fileName); SortedMap<Integer, Integer> hitCounts = internalCounts.get(fileName); if (hitCounts == null) { hitCounts = new TreeMap<Integer, Integer>(); } NodeList children = classNode.getChildNodes();

Can you be more specific?

im saying we are looping twice over the same things. There is opportunity to DRY something.

SInce we already know filenames we care about from previous loop, we dont have to repeat the same logic again

artms · 2017-11-08T11:25:00Z

src/main/java/com/uber/jenkins/phabricator/coverage/CoberturaXMLParser.java

-                String detectedSourceRoot = new PathResolver(workspace, sourceDirs).choose(fileName);
-                fileName = join(detectedSourceRoot, fileName);
+            // Loop over all files which are needed for coverage report
+            for (int i = 0; i < fileNames.size(); i++) {


@kageiit - now I'm not looping again through same loop although it requires us to collect childNodes for this loop to work

coveralls · 2017-11-08T11:29:11Z

Coverage increased (+0.2%) to 90.793% when pulling 8a2362c on artms:bulk_file_check into 6cfd526 on uber:master.

Arturas Moskvinas added 3 commits November 2, 2017 22:46

Changes to improve latency related to file operations between master …

44f328c

…and slave

Query file presence on slave in bulk this should reduce duration of t…

8cd0a2d

…est recording when master<->slave has large latency (coast-to-coast)

Remove unused imports and dedup test code

ad43cd2

kageiit suggested changes Nov 3, 2017

View reviewed changes

msridhar reviewed Nov 3, 2017

View reviewed changes

Use ArrayList instead of LinkedList

8fc0ff7

kageiit suggested changes Nov 7, 2017

View reviewed changes

Do not run same loop again

8a2362c

artms commented Nov 8, 2017

View reviewed changes

kageiit approved these changes Nov 10, 2017

View reviewed changes

kageiit merged commit 5f39fa8 into uber-archive:master Nov 10, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Bulk file check on slaves #241

Bulk file check on slaves #241

Uh oh!

artms commented Nov 3, 2017

Uh oh!

coveralls commented Nov 3, 2017

Uh oh!

kageiit Nov 3, 2017

Uh oh!

artms Nov 3, 2017

Uh oh!

kageiit Nov 3, 2017 •

edited

Loading

Uh oh!

msridhar Nov 3, 2017

Uh oh!

artms Nov 7, 2017

Uh oh!

coveralls commented Nov 7, 2017

Uh oh!

kageiit Nov 7, 2017

Uh oh!

artms Nov 7, 2017 •

edited

Loading

Uh oh!

kageiit Nov 7, 2017

Uh oh!

artms Nov 8, 2017

Uh oh!

coveralls commented Nov 8, 2017

Uh oh!

Uh oh!

Bulk file check on slaves #241

Bulk file check on slaves #241

Uh oh!

Conversation

artms commented Nov 3, 2017

Uh oh!

coveralls commented Nov 3, 2017

Uh oh!

kageiit Nov 3, 2017

Choose a reason for hiding this comment

Uh oh!

artms Nov 3, 2017

Choose a reason for hiding this comment

Uh oh!

kageiit Nov 3, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

msridhar Nov 3, 2017

Choose a reason for hiding this comment

Uh oh!

artms Nov 7, 2017

Choose a reason for hiding this comment

Uh oh!

coveralls commented Nov 7, 2017

Uh oh!

kageiit Nov 7, 2017

Choose a reason for hiding this comment

Uh oh!

artms Nov 7, 2017 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

kageiit Nov 7, 2017

Choose a reason for hiding this comment

Uh oh!

artms Nov 8, 2017

Choose a reason for hiding this comment

Uh oh!

coveralls commented Nov 8, 2017

Uh oh!

Uh oh!

kageiit Nov 3, 2017 •

edited

Loading

artms Nov 7, 2017 •

edited

Loading