Paper review: lessons learned from building static analysis tools at google #233

bzz · 2018-07-27T08:20:05Z

First "paper review" blog post. Cross-post from medium

Closes #213

Signed-off-by: Alexander Bezzubov <bzz@apache.org>

Review of "Lessons from building static analysis tools at Google" Signed-off-by: Alexander Bezzubov <bzz@apache.org>

campoy

Hey, could you break the long lines into shorter easier to review chunks?

Otherwise I end up having too many comments per line to understand anything.

Same for the other PR, please.

vmarkovtsev · 2018-08-02T12:45:53Z

content/post/review-building-static-analysis-tools.md

+
+{{% center %}} … {{% /center %}}
+
+[“Lessons from Building Static Analysis Tools at Google](https://cacm.acm.org/magazines/2018/4/226371-lessons-from-building-static-analysis-tools-at-google/fulltext)” by Caitlin Sadowski, Edward Aftandilian, [Alex Eagle](undefined), Liam Miller-Cushon, Ciera Jaspan presents 2 stories: history of failed attempts of integrating [FindBugs](https://github.com/findbugsproject/findbugs), a static analysis tool for Java at [Google](http://twitter.com/Google), and lessons learned from success story of incorporating extensible analysis framework, a [Tricorder project](https://research.google.com/pubs/pub43322.html), to development workflow at Google.


learned from the success story

~~, a~~ Tricorder ~~project~~

vmarkovtsev · 2018-08-02T12:47:09Z

content/post/review-building-static-analysis-tools.md

+
+## Source code analysis recap
+
+Before digging deeper into the paper, a bit of the context on what kind of analysis in general is applicable to programs. There are two main type of program analysis:


two main types

vmarkovtsev · 2018-08-02T12:47:40Z

content/post/review-building-static-analysis-tools.md

+
+Before digging deeper into the paper, a bit of the context on what kind of analysis in general is applicable to programs. There are two main type of program analysis:
+
+* *Static code analysis*, looks only at the source code, without running the program


~~looks only at~~ considers only

vmarkovtsev · 2018-08-02T12:50:46Z

content/post/review-building-static-analysis-tools.md

+Before digging deeper into the paper, a bit of the context on what kind of analysis in general is applicable to programs. There are two main type of program analysis:
+
+* *Static code analysis*, looks only at the source code, without running the program
+* *Dynamic code analysis*, when (potentially customized) program is executed and results are analyzed


Dynamic code analysis studies how a program executes over time, without paying much attention to the source code

"Results" here can be confused with analyzing stdout and written files.

vmarkovtsev · 2018-08-02T12:52:24Z

content/post/review-building-static-analysis-tools.md

+
+* *Static code analysis*, looks only at the source code, without running the program
+* *Dynamic code analysis*, when (potentially customized) program is executed and results are analyzed
+


How about mentioning a few examples of both types?

That's a great suggestion!

But as this is a cross-post (and not a completely new draft), I would at first try keep the factual content intact as much as possible. But will take another look after all other concerns are addressed

vmarkovtsev · 2018-08-02T13:24:49Z

content/post/review-building-static-analysis-tools.md

+
+It was a bunch of static HTML files, that later became a database for a web application, produced as a part of nightly build using Maven that were copied and served from a well known URL inside the company. HTMLs would contain tables of potential software defects, obtained by running multiple existing static analysis tools — [FindBugs](https://github.com/findbugsproject/findbugs), [PMD](https://pmd.github.io/) and [CheckStyle](https://github.com/checkstyle/checkstyle). Each defect was attributed to the latest change in the codebase using “git blame” and “assigned” to a particular engineer who introduced the change.
+
+It’s internal adoption numbers, although driven top-down by the management decision, were *very low* and aligned with the paper — needless to say that not many engineers were motivated enough to go to a separate [http://code-quality.company.com](http://code-quality.company.com) every day, only to find that they are #N by the amount of bugs introduced to the codebase.


It’s internal adoption numbers?..

Finally understood that you mean the Personalized Quality Dashboard. I suggest to avoid "it" here - very confusing.

go to the separate

PAIN!!! I feel sorry for your first job Alex.

vmarkovtsev · 2018-08-02T13:26:15Z

content/post/review-building-static-analysis-tools.md

+
+It’s internal adoption numbers, although driven top-down by the management decision, were *very low* and aligned with the paper — needless to say that not many engineers were motivated enough to go to a separate [http://code-quality.company.com](http://code-quality.company.com) every day, only to find that they are #N by the amount of bugs introduced to the codebase.
+
+Curiously enough, one can see some open source projects like i.e Git going though the similar stage right now i.e [https://larsxschneider.github.io/git-scan](https://larsxschneider.github.io/git-scan/) with contributors introducing language-specific analysis tools to the build profiles and publishing a dashboards with the results.


"like" or "i.e." - but not both :)

second "i.e." ...

vmarkovtsev · 2018-08-02T13:27:05Z

content/post/review-building-static-analysis-tools.md

+
+Curiously enough, one can see some open source projects like i.e Git going though the similar stage right now i.e [https://larsxschneider.github.io/git-scan](https://larsxschneider.github.io/git-scan/) with contributors introducing language-specific analysis tools to the build profiles and publishing a dashboards with the results.
+
+Despite challenges in adopting such solutions, one can also see companies, i.e [https://scan.coverity.com](https://scan.coverity.com) — a closed-sourced static analysis 


Despite the challenges

Wait. Is it "i.e." or "e.g."? https://www.grammarly.com/blog/know-your-latin-i-e-vs-e-g/

so actually you should change all "i.e."-s to "e.g."-s.

vmarkovtsev · 2018-08-02T13:29:22Z

content/post/review-building-static-analysis-tools.md

+Despite challenges in adopting such solutions, one can also see companies, i.e [https://scan.coverity.com](https://scan.coverity.com) — a closed-sourced static analysis 
+ solution for Java, C/C++, C#, JavaScript, Ruby and Python [founded in 2006](https://scan.coverity.com/about) jointly with U.S. Department of Homeland Security, being gradually adopted by some OSS projects.
+
+Companies building rule-based analysis platforms like [https://lgtm.com](https://lgtm.com), offspring of University of Oxford-based [https://semmle.com](https://semmle.com/) founded in 2007, are following this adoption path. Theri success, in my opinion, can be attributed to the fact that both support “hard” native languages like [C++](https://lgtm.com/blog/how_lgtm_builds_cplusplus).


like lgtm - an offspring of the University of Oxford-based semmle

Theri -> Their

vmarkovtsev · 2018-08-02T13:30:51Z

content/post/review-building-static-analysis-tools.md

+Companies building rule-based analysis platforms like [https://lgtm.com](https://lgtm.com), offspring of University of Oxford-based [https://semmle.com](https://semmle.com/) founded in 2007, are following this adoption path. Theri success, in my opinion, can be attributed to the fact that both support “hard” native languages like [C++](https://lgtm.com/blog/how_lgtm_builds_cplusplus).
+
+### 2. 2009 Filing bugs/Fixit
+


At this point I ran out of my time, will continue reviewing soon.

vmarkovtsev · 2018-08-07T12:07:23Z

content/post/review-building-static-analysis-tools.md

+
+### 2. 2009 Filing bugs/Fixit
+
+Next attempt introducing static analysis tools for Java, documented in the paper was filing the results of analysis as bugs in the project bug-tracking system. Then, a companywide dedicated effort was made though a “Fixit” week for all engineers to have a time to clean up those issues.


The next attempt to integrate static analysis tools for Java which was documented in the paper was filing ...

Then , the company-wide ...

though -> through

... was made in the format of a "Fix-it" week so that engineers had preallocated time to clean up ...

vmarkovtsev · 2018-08-07T12:14:59Z

content/post/review-building-static-analysis-tools.md

+
+This approach has some advantages:
+
+* it is valid scientific approach as it allows to quantify the results very well: how many of reported issues were actually fixed by developers


it is a ~~valid~~ proven

My concern is that "Proven" in this context does not seem to be used very often and also may rise unnecessary questions like "proven by whom? where is the proof?", etc

The problem with "valid" is that it assumes that there is an invalid scientific approach, which is ugly too. How about "good" or "reliable"?

vmarkovtsev · 2018-08-07T12:16:05Z

content/post/review-building-static-analysis-tools.md

+* other researchers use similar approach i.e in early “[Learning Natural Coding Conventions](https://arxiv.org/abs/1402.4182)” paper by [Miltos Allamanis](undefined) and [https://ml4code.github.io](https://ml4code.github.io/) group
+> We demonstrate that coding conventions are important to software teams, by showing that 1) empirically, programmers enforce conventions heavily through code review feedback and corrective commits, and 2) **patches that were based on NATURALIZE suggestions have been incorporated** into 5 of the most popular open source Java projects on GitHub — of the 18 patches that we submitted, 14 were accepted
+
+For organization it has a huge disadvantage though — it’s laborious and hard to scale. If conducted without a proper care, results will not only be ignored by developers, but can also contribute to overall issue-tracker value depreciation for the project.


It has a huge con/disadvantage for the organization though

vmarkovtsev · 2018-08-07T12:16:48Z

content/post/review-building-static-analysis-tools.md

+* other researchers use similar approach i.e in early “[Learning Natural Coding Conventions](https://arxiv.org/abs/1402.4182)” paper by [Miltos Allamanis](undefined) and [https://ml4code.github.io](https://ml4code.github.io/) group
+> We demonstrate that coding conventions are important to software teams, by showing that 1) empirically, programmers enforce conventions heavily through code review feedback and corrective commits, and 2) **patches that were based on NATURALIZE suggestions have been incorporated** into 5 of the most popular open source Java projects on GitHub — of the 18 patches that we submitted, 14 were accepted
+
+For organization it has a huge disadvantage though — it’s laborious and hard to scale. If conducted without a proper care, results will not only be ignored by developers, but can also contribute to overall issue-tracker value depreciation for the project.


without a proper care

contribute to the overall

vmarkovtsev · 2018-08-07T12:18:17Z

content/post/review-building-static-analysis-tools.md

+
+For organization it has a huge disadvantage though — it’s laborious and hard to scale. If conducted without a proper care, results will not only be ignored by developers, but can also contribute to overall issue-tracker value depreciation for the project.
+
+Despite that, one can see this approach been used by companies in this field i.e [American Software Safety Reliability Company](http://www.assrc.us), Atlanta-based enterprise that seems to have deep roots in software verifications and somehow [supported by DARPA](http://www.qbitlogic.com/darpa-bigcode/), to do archive the same — test some of their products like [https://www.mycode.ai](https://www.mycode.ai/) solution, that is planned to deploy across all of the U.S. Department of Defense software development divisions, i.e on [Git, popular OSS project](https://public-inbox.org/git/CAGm8dMApDdLEzeKU-h16G0NSpnuk9LMTWA29t4MxO1qcNpUvhA@mail.gmail.com/).


"to do archive the same"? did you mean achieve?

divisions -> departments?

second one taken from http://www.qbitlogic.com/darpa-bigcode/

vmarkovtsev · 2018-08-07T12:24:45Z

content/post/review-building-static-analysis-tools.md

+* presence of false-positives in FindBugs results made developers to lose confidence in the tool as a whole
+* customization of results view per-developer lead to an inconsistent view of analysis outcome
+
+## What worked & Lessons learned


Lessons -> lessons ? or do you refer to the paper's headers

vmarkovtsev · 2018-08-07T12:25:15Z

content/post/review-building-static-analysis-tools.md

+
+## What worked & Lessons learned
+
+As opposed to integrating each particular analysis tool in a different way, an internal “platform” — easily extensible and with support for many different kinds of program-analysis tools, including static and dynamic analyses, was built, known as [Tricorder project](https://research.google.com/pubs/pub43322.html).


an internal -> the internal

known as -> named as

I belive it's still correct to keep it as "an internal platform ... known as Ticoder project"

BTW how about rephrasing:

"platform" was built which is easily extensible and supports many different kinds of program-analysis tools including static and dynamic analyses. It is known as Tricorder

vmarkovtsev · 2018-08-07T12:25:56Z

content/post/review-building-static-analysis-tools.md

+
+As opposed to integrating each particular analysis tool in a different way, an internal “platform” — easily extensible and with support for many different kinds of program-analysis tools, including static and dynamic analyses, was built, known as [Tricorder project](https://research.google.com/pubs/pub43322.html).
+
+As it was taking into account all the lessons learned from the history above, it managed to re-gain the trust of users and proved to be a success inside Google.


Since it took into account

history -> stories

re-gain -> recover?

the users

vmarkovtsev · 2018-08-07T12:26:48Z

content/post/review-building-static-analysis-tools.md

+
+As it was taking into account all the lessons learned from the history above, it managed to re-gain the trust of users and proved to be a success inside Google.
+
+Paper contains few lessons like *Developer happiness is key* and *Crowdsource analysis development* that are nice, but I would rather to highlight instead a few key takeaways that seems to drive rest of the technical decisions, responsible for success of new analysis platform.


a few lessons

I would rather highlight

seond time a few, consider a synonym

seems -> seem

drive the rest

decisions which are responsible for success of the new

vmarkovtsev · 2018-08-07T12:29:00Z

content/post/review-building-static-analysis-tools.md

+Paper contains few lessons like *Developer happiness is key* and *Crowdsource analysis development* that are nice, but I would rather to highlight instead a few key takeaways that seems to drive rest of the technical decisions, responsible for success of new analysis platform.
+
+There are two main takeaways that drove the overall tooling design:
+


This time I stopped here.

vmarkovtsev · 2018-08-07T13:58:53Z

content/post/review-building-static-analysis-tools.md

+### 1. Best way to **measure a success of analysis**
+> by number of bugs fixed (or prevented), not the number of issues identified
+
+This way of measuring success have few notable implications


have -> has a few

a few -> several?

vmarkovtsev · 2018-08-07T13:59:06Z

content/post/review-building-static-analysis-tools.md

+
+This way of measuring success have few notable implications
+
+* If a tool that finds a bug, also suggests a fix - it will be much more successful using this metrics. This, by necessity, constraints the scope of possible analysis and a tooling required


the tool

these metrics or this metric

This of course/consequently constraints

scope of the possible

the tooling

vmarkovtsev · 2018-08-07T14:01:26Z

content/post/review-building-static-analysis-tools.md

+
+* If a tool that finds a bug, also suggests a fix - it will be much more successful using this metrics. This, by necessity, constraints the scope of possible analysis and a tooling required
+
+* It also means that repairing programs is important. For a discussion of tooling available for code transformation see **Technical Details** section below. Learning such modification from examples, instead of manual coding by engineers is also a bleeding edge research in academia [https://github.com/KTH/learning4repair](https://github.com/KTH/learning4repair)


See Technical Details section below for the discussion of tooling

modification -> modifications

is a bleeding edge research topic

vmarkovtsev · 2018-08-07T14:04:14Z

content/post/review-building-static-analysis-tools.md

+
+This immediately implies that **reporting issues sooner is better.**
+
+That leads to conclusion that the best bet is to integrate checks either ***directly into compilers*** - familiar tools on who’s feedback as errors and warnings developers are already relaying day to day. Or, if that is not possible, _**code review** is a good time_ for new changes — before they are commited to the version control system.


To conclude, ...

"on who’s feedback"?..

commited -> committed

👍 https://www.grammarly.com/blog/whos-whose/ fixed

vmarkovtsev · 2018-08-07T14:05:54Z

content/post/review-building-static-analysis-tools.md

+
+That leads to conclusion that the best bet is to integrate checks either ***directly into compilers*** - familiar tools on who’s feedback as errors and warnings developers are already relaying day to day. Or, if that is not possible, _**code review** is a good time_ for new changes — before they are commited to the version control system.
+
+Criterias that must hold for a ***compile time*** checks:


vmarkovtsev · 2018-08-07T14:19:28Z

content/post/review-building-static-analysis-tools.md

+
+Although it is not being disclosed, but an attentive reader might have noticed that **Compilation Index** part of the pipeline is very similar to something called [Compilation Database](https://kythe.io/docs/kythe-compilation-database.html) in open source Kythe project.
+
+It might be interesting to take a close look at example of API for AST query and of transformation for C++.


close -> closer

the example
the API

vmarkovtsev · 2018-08-07T14:20:55Z

content/post/review-building-static-analysis-tools.md

+{{% grid-cell %}}
+This callback will generate a code transformation: for the matched nodes it will replace the matching text of the function name with the “Baz”.
+
+For code transformations in Java, **Error-Prone** has a similar low-level [patching API](http://errorprone.info/docs/patching), that is very close to native AST manipulation API. Same as for the Clang, it was found to have step learning curve and thus pose a high entry barrier — even an experience engineer would need few weeks, before one can be productive creating a fix suggestions or refactorings.


~~For~~ Regarding code transformations

, that is very close

It was found to ... similar to Clang, and thus

a fix suggestions

vmarkovtsev · 2018-08-07T14:22:30Z

content/post/review-building-static-analysis-tools.md

+{{% /grid-cell %}}
+{{% /grid %}}
+
+That is why a higher-level API was built for Java first as [Refaster](https://research.google.com/pubs/pub41876.html) project, that was [integrated in Error-Prone](http://errorprone.info/docs/refaster) later.


~~the~~ higher level

~~the~~ Refaster project

~~, that~~ which

vmarkovtsev · 2018-08-07T14:23:27Z

content/post/review-building-static-analysis-tools.md

+
+That is why a higher-level API was built for Java first as [Refaster](https://research.google.com/pubs/pub41876.html) project, that was [integrated in Error-Prone](http://errorprone.info/docs/refaster) later.
+
+So a usual workflow would look like — after running all the checks and emitting a collection of suggested fixes, shard diffs to a smaller patches, run all the tests over the changes and if passed, submit for code review.


a the usual workflow is: after running ...

a smaller patches

if they are passed

vmarkovtsev · 2018-08-07T14:24:00Z

content/post/review-building-static-analysis-tools.md

+{{% center %}} … {{% /center %}}
+
+{{% center %}}
+##### Thank you for reading, stay tuned and keep you codebase healthy!


vmarkovtsev · 2018-08-07T14:24:47Z

, means removing a comma

bzz · 2018-08-08T14:46:38Z

😮 Thank you so much for a thoughtful and thorough review!
Will address asap 🚀

Signed-off-by: Alexander Bezzubov <bzz@apache.org>

bzz · 2018-08-08T18:39:03Z

Thank you again, @vmarkovtsev feedback addressed in e9127ee

@campoy

Hey, could you break the long lines into shorter easier to review chunks?

Done, although I understand the rationale, for the plain text this requirement seems quite un-friendly to different screen/font sizes, modern editors, semantics of the text and tools like Grammarly, no? 😕

Ready for another pass.

Signed-off-by: Alexander Bezzubov <bzz@apache.org>

campoy · 2018-08-13T17:41:18Z

Sorry about the inconvenience, @bzz.
I agree it's a pain to limit the line length but otherwise reviews become basically impossible.

Any other improvements to the current flow are welcome.

campoy · 2018-08-13T17:43:02Z

content/post/review-building-static-analysis-tools.md

+draft: true
+---
+
+A recent paper with empirical research on the application of static code analysis tool caught my attention:


s/tool/tools/ ?

campoy · 2018-08-13T17:52:22Z

content/post/review-building-static-analysis-tools.md

+and [The Morning Paper](https://blog.acolyer.org/) — I always wanted to experiment with publishing notes.
+This will be the first attempt.
+
+{{% center %}} … {{% /center %}}


consider using <hr> instead

Using of HTML tags in blog posts is strictly forbidden, suggestion on adding a shortcode for <hr> is already tracked under #230

Thanks @bzz for keeping yourself following the rules ;)
I'll try find some time next week to develop a {{% separator %}} shortcode, replacing old {{% center %}} … {{% /center %}} with the new proposed feature :D

campoy · 2018-08-13T17:52:49Z

content/post/review-building-static-analysis-tools.md

+{{% center %}} … {{% /center %}}
+
+[“Lessons from Building Static Analysis Tools at Google](https://cacm.acm.org/magazines/2018/4/226371-lessons-from-building-static-analysis-tools-at-google/fulltext)”
+by Caitlin Sadowski, Edward Aftandilian, [Alex Eagle](undefined), Liam Miller-Cushon, Ciera Jaspan


campoy · 2018-08-13T17:55:58Z

content/post/review-building-static-analysis-tools.md

+According to the paper, Google only runs simpler, *intra-procedural* type of analysis — the only one feasible
+to run at the scale of 2 billion lines of code.
+
+Interesting enough, this is somehow different from the approach taken by Facebook’s project


Interestingly

campoy · 2018-08-13T20:38:00Z

content/post/review-building-static-analysis-tools.md

+the paper:
+
+* *Large investment.* Although theoretically better and more complex analysis exists, it will require
+  the non-trivial engineering effort to scale


finish every sentence with a .

campoy · 2018-08-13T21:03:26Z

content/post/review-building-static-analysis-tools.md

+
+And those are ClangMR and JavacFlume — projects that are only briefly mentioned in this insightful paper.
+
+*That is it, thank you for reading. We will post more on papers in this field soon.*


Is this the end of the blog post?

Now it is :)

campoy · 2018-08-13T21:04:07Z

content/post/review-building-static-analysis-tools.md

+
+{{% center %}} … {{% /center %}}
+
+Now I will take a liberty and cover a few technical details that were not in the scope of the original paper


Would you be against splitting this into a second blog post?
I think the topic is by itself interesting.

Moved to #241

campoy · 2018-08-13T21:04:51Z

content/post/review-building-static-analysis-tools.md

+
+Project [Error-Prone](https://github.com/google/error-prone) is a compiler extension that is able to perform
+arbitrary analysis on the *fully typed AST*. One thing to notice is that one can not get such input by using
+only a parser even as advanced as [https://doc.bblf.sh](https://doc.bblf.sh/). Running a full build would be


use the name of the project, not the link itself [babelfish](https://doc.bblf.sh)

Same for all other links

campoy · 2018-08-13T21:05:42Z

content/post/review-building-static-analysis-tools.md

+application of those fixes to the whole codebase, called JavacFlume — which I would guess looks something like
+an Apache Spark job that applies patches in some generic format.
+
+Here is an example of how a full pipeline looks for C++


: at the end

campoy · 2018-08-13T21:06:54Z

data/authors.yml

+alex:
+  name: Alexander Bezzubov
+  thumbnail: https://avatars1.githubusercontent.com/u/5582506?s=460&v=4
+  bio: "Data engineer"


Please explain a bit more in here 😄

I like the simplicity of the description

campoy · 2018-08-13T21:07:37Z

Looking good, @bzz!

bzz · 2018-08-17T08:32:31Z

Thank you very much @campoy, appreciate your feedback! All suggestions make sense to me, will start applying.

As soon as it's done will ping you and push it to staging for preview.

Signed-off-by: Alexander Bezzubov <bzz@apache.org>

bzz · 2018-08-29T07:27:05Z

All feedback has been addressed, deployed on staging to https://blog-staging.srcd.run/post/review-building-static-analysis-tools/

@campoy ready for another pass

bzz · 2018-09-03T10:08:44Z

@vcoisne @eiso Since Francesc went on vacations, maybe you could review this instead please?

all previous comments were addressed

bzz · 2018-09-05T11:32:11Z

@campoy friendly ping

vcoisne

@bzz Really good post ! I think in general we should try to summarize as much as possible and keep these posts on the shorter side for people who are interested but don't really have that much time to read. I would also add images and add some call to actions at the end i.e Sign up for our weekly newsletter. check out awesome mloncode list, etc

vcoisne · 2018-09-05T20:38:30Z

content/post/review-building-static-analysis-tools.md

+a static analysis tool for Java at [Google](http://twitter.com/Google), and lessons learned from the success
+story of incorporating extensible analysis framework, [Tricorder](https://research.google.com/pubs/pub43322.html),
+to development workflow at Google.
+


Do you have an image to illustrate one or both of these stories ?

no, not really

vcoisne · 2018-09-05T20:39:25Z

content/post/review-building-static-analysis-tools.md

+the paper:
+
+* *Large investment.* Although theoretically better and more complex analysis exists, it will require
+  the non-trivial engineering effort to scale


vcoisne · 2018-09-05T20:41:49Z

content/post/review-building-static-analysis-tools.md

+
+## History of integrating FindBugs at Google
+
+### 0. IDE/Editor


I think we should explicit that with an introduction sentence.

Signed-off-by: Alexander Bezzubov <bzz@apache.org>

bzz · 2018-09-17T15:51:40Z

@campoy @vcoisne appreciate you feedback, it's been addressed, re-deployed on staging, ready for another round.

As it's been almost 2 months we are working on this, it would be nice to try to wrap it up soon.

vcoisne · 2018-09-19T03:20:46Z

LGTM I think it's ready to be published early next week.

vcoisne · 2018-09-20T17:27:58Z

@bzz scheduled for next Thursday as you can see in the "Blog & Press" google calendar I created

dpordomingo

from a technical pov, it LGTM 👍

campoy

LGTM!

eiso · 2018-09-27T10:11:46Z

Excited to see this published today.

bzz · 2018-09-27T15:46:16Z

Me to!

When it's done will also submit a medium post to the source{d} publication.

Add Alex to blog authors

6be3091

Signed-off-by: Alexander Bezzubov <bzz@apache.org>

bzz requested review from campoy and dpordomingo July 27, 2018 08:20

First paper review post

10fe848

Review of "Lessons from building static analysis tools at Google" Signed-off-by: Alexander Bezzubov <bzz@apache.org>

bzz force-pushed the review-static-analysis branch from 762d65d to 10fe848 Compare July 27, 2018 14:18

campoy suggested changes Jul 30, 2018

View reviewed changes

vmarkovtsev suggested changes Aug 2, 2018

View reviewed changes

vmarkovtsev suggested changes Aug 7, 2018

View reviewed changes

bzz added 2 commits August 8, 2018 20:05

Address review by Vadim

e9127ee

Signed-off-by: Alexander Bezzubov <bzz@apache.org>

Add hard wrap for long lines

ee7ec33

Signed-off-by: Alexander Bezzubov <bzz@apache.org>

vmarkovtsev approved these changes Aug 9, 2018

View reviewed changes

Address review feedback from Grammarly

69fc30a

Signed-off-by: Alexander Bezzubov <bzz@apache.org>

campoy suggested changes Aug 13, 2018

View reviewed changes

bzz mentioned this pull request Aug 28, 2018

[PROPOSAL] Source code transformations #241

Open

bzz added 2 commits August 28, 2018 10:16

Address @campoy review

604f3d3

Signed-off-by: Alexander Bezzubov <bzz@apache.org>

Fix links + small re-structuring

195c2fe

Signed-off-by: Alexander Bezzubov <bzz@apache.org>

vcoisne reviewed Sep 5, 2018

View reviewed changes

bzz mentioned this pull request Sep 17, 2018

Paper review: learning to represent programs with graphs #232

Merged

bzz added 2 commits September 17, 2018 17:44

Address @vcoisne review

5cb2b94

Signed-off-by: Alexander Bezzubov <bzz@apache.org>

Appling confusing punctuation to lists

a5c40aa

Signed-off-by: Alexander Bezzubov <bzz@apache.org>

Add call to actions at the end

1df97da

Signed-off-by: Alexander Bezzubov <bzz@apache.org>

dpordomingo approved these changes Sep 21, 2018

View reviewed changes

campoy approved these changes Sep 26, 2018

View reviewed changes

bzz merged commit c23a0e2 into src-d:master Sep 27, 2018

bzz deleted the review-static-analysis branch September 27, 2018 15:46


		{{% center %}} … {{% /center %}}

		[“Lessons from Building Static Analysis Tools at Google](https://cacm.acm.org/magazines/2018/4/226371-lessons-from-building-static-analysis-tools-at-google/fulltext)” by Caitlin Sadowski, Edward Aftandilian, [Alex Eagle](undefined), Liam Miller-Cushon, Ciera Jaspan presents 2 stories: history of failed attempts of integrating [FindBugs](https://github.com/findbugsproject/findbugs), a static analysis tool for Java at [Google](http://twitter.com/Google), and lessons learned from success story of incorporating extensible analysis framework, a [Tricorder project](https://research.google.com/pubs/pub43322.html), to development workflow at Google.


		## Source code analysis recap

		Before digging deeper into the paper, a bit of the context on what kind of analysis in general is applicable to programs. There are two main type of program analysis:


		Before digging deeper into the paper, a bit of the context on what kind of analysis in general is applicable to programs. There are two main type of program analysis:

		* Static code analysis, looks only at the source code, without running the program


		* Static code analysis, looks only at the source code, without running the program
		* Dynamic code analysis, when (potentially customized) program is executed and results are analyzed


		It was a bunch of static HTML files, that later became a database for a web application, produced as a part of nightly build using Maven that were copied and served from a well known URL inside the company. HTMLs would contain tables of potential software defects, obtained by running multiple existing static analysis tools — [FindBugs](https://github.com/findbugsproject/findbugs), [PMD](https://pmd.github.io/) and [CheckStyle](https://github.com/checkstyle/checkstyle). Each defect was attributed to the latest change in the codebase using “git blame” and “assigned” to a particular engineer who introduced the change.

		It’s internal adoption numbers, although driven top-down by the management decision, were very low and aligned with the paper — needless to say that not many engineers were motivated enough to go to a separate [http://code-quality.company.com](http://code-quality.company.com) every day, only to find that they are #N by the amount of bugs introduced to the codebase.


		It’s internal adoption numbers, although driven top-down by the management decision, were very low and aligned with the paper — needless to say that not many engineers were motivated enough to go to a separate [http://code-quality.company.com](http://code-quality.company.com) every day, only to find that they are #N by the amount of bugs introduced to the codebase.

		Curiously enough, one can see some open source projects like i.e Git going though the similar stage right now i.e [https://larsxschneider.github.io/git-scan](https://larsxschneider.github.io/git-scan/) with contributors introducing language-specific analysis tools to the build profiles and publishing a dashboards with the results.


		Curiously enough, one can see some open source projects like i.e Git going though the similar stage right now i.e [https://larsxschneider.github.io/git-scan](https://larsxschneider.github.io/git-scan/) with contributors introducing language-specific analysis tools to the build profiles and publishing a dashboards with the results.

		Despite challenges in adopting such solutions, one can also see companies, i.e [https://scan.coverity.com](https://scan.coverity.com) — a closed-sourced static analysis

		Companies building rule-based analysis platforms like [https://lgtm.com](https://lgtm.com), offspring of University of Oxford-based [https://semmle.com](https://semmle.com/) founded in 2007, are following this adoption path. Theri success, in my opinion, can be attributed to the fact that both support “hard” native languages like [C++](https://lgtm.com/blog/how_lgtm_builds_cplusplus).

		### 2. 2009 Filing bugs/Fixit


		### 2. 2009 Filing bugs/Fixit

		Next attempt introducing static analysis tools for Java, documented in the paper was filing the results of analysis as bugs in the project bug-tracking system. Then, a companywide dedicated effort was made though a “Fixit” week for all engineers to have a time to clean up those issues.


		This approach has some advantages:

		* it is valid scientific approach as it allows to quantify the results very well: how many of reported issues were actually fixed by developers


		For organization it has a huge disadvantage though — it’s laborious and hard to scale. If conducted without a proper care, results will not only be ignored by developers, but can also contribute to overall issue-tracker value depreciation for the project.

		Despite that, one can see this approach been used by companies in this field i.e [American Software Safety Reliability Company](http://www.assrc.us), Atlanta-based enterprise that seems to have deep roots in software verifications and somehow [supported by DARPA](http://www.qbitlogic.com/darpa-bigcode/), to do archive the same — test some of their products like [https://www.mycode.ai](https://www.mycode.ai/) solution, that is planned to deploy across all of the U.S. Department of Defense software development divisions, i.e on [Git, popular OSS project](https://public-inbox.org/git/CAGm8dMApDdLEzeKU-h16G0NSpnuk9LMTWA29t4MxO1qcNpUvhA@mail.gmail.com/).


		## What worked & Lessons learned

		As opposed to integrating each particular analysis tool in a different way, an internal “platform” — easily extensible and with support for many different kinds of program-analysis tools, including static and dynamic analyses, was built, known as [Tricorder project](https://research.google.com/pubs/pub43322.html).


		As opposed to integrating each particular analysis tool in a different way, an internal “platform” — easily extensible and with support for many different kinds of program-analysis tools, including static and dynamic analyses, was built, known as [Tricorder project](https://research.google.com/pubs/pub43322.html).

		As it was taking into account all the lessons learned from the history above, it managed to re-gain the trust of users and proved to be a success inside Google.


		As it was taking into account all the lessons learned from the history above, it managed to re-gain the trust of users and proved to be a success inside Google.

		Paper contains few lessons like Developer happiness is key and Crowdsource analysis development that are nice, but I would rather to highlight instead a few key takeaways that seems to drive rest of the technical decisions, responsible for success of new analysis platform.

		Paper contains few lessons like Developer happiness is key and Crowdsource analysis development that are nice, but I would rather to highlight instead a few key takeaways that seems to drive rest of the technical decisions, responsible for success of new analysis platform.

		There are two main takeaways that drove the overall tooling design:


		This way of measuring success have few notable implications

		* If a tool that finds a bug, also suggests a fix - it will be much more successful using this metrics. This, by necessity, constraints the scope of possible analysis and a tooling required


		* If a tool that finds a bug, also suggests a fix - it will be much more successful using this metrics. This, by necessity, constraints the scope of possible analysis and a tooling required

		* It also means that repairing programs is important. For a discussion of tooling available for code transformation see Technical Details section below. Learning such modification from examples, instead of manual coding by engineers is also a bleeding edge research in academia [https://github.com/KTH/learning4repair](https://github.com/KTH/learning4repair)


		This immediately implies that reporting issues sooner is better.

		That leads to conclusion that the best bet is to integrate checks either *directly into compilers* - familiar tools on who’s feedback as errors and warnings developers are already relaying day to day. Or, if that is not possible, _code review is a good time_ for new changes — before they are commited to the version control system.


		That leads to conclusion that the best bet is to integrate checks either *directly into compilers* - familiar tools on who’s feedback as errors and warnings developers are already relaying day to day. Or, if that is not possible, _code review is a good time_ for new changes — before they are commited to the version control system.

		Criterias that must hold for a *compile time* checks:


		Although it is not being disclosed, but an attentive reader might have noticed that Compilation Index part of the pipeline is very similar to something called [Compilation Database](https://kythe.io/docs/kythe-compilation-database.html) in open source Kythe project.

		It might be interesting to take a close look at example of API for AST query and of transformation for C++.

Paper review: lessons learned from building static analysis tools at google #233

Paper review: lessons learned from building static analysis tools at google #233

Conversation

bzz commented Jul 27, 2018 • edited Loading

campoy left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bzz Aug 8, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bzz Aug 8, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bzz commented Jul 27, 2018 •

edited

Loading

bzz Aug 8, 2018 •

edited

Loading

bzz Aug 8, 2018 •

edited

Loading


		That is why a higher-level API was built for Java first as [Refaster](https://research.google.com/pubs/pub41876.html) project, that was [integrated in Error-Prone](http://errorprone.info/docs/refaster) later.

		So a usual workflow would look like — after running all the checks and emitting a collection of suggested fixes, shard diffs to a smaller patches, run all the tests over the changes and if passed, submit for code review.


		And those are ClangMR and JavacFlume — projects that are only briefly mentioned in this insightful paper.

		That is it, thank you for reading. We will post more on papers in this field soon.


		{{% center %}} … {{% /center %}}

		Now I will take a liberty and cover a few technical details that were not in the scope of the original paper


		## History of integrating FindBugs at Google

		### 0. IDE/Editor