Are developers with high participation/degree more likely to have missed vulnerabilities? #245

kbaumzie · 2016-03-04T17:53:36Z

Used vuln_misses for this. Take a look at using Spearman's rank correlation coefficient. Figure out what these mean, and then report them here. Search for "spearman" in our code base see how we use it.

kbaumzie · 2016-03-06T22:20:21Z

During a Google Tech talk that I have just attended this past week, a Google developer was talking about the procedure for committing, owning, and participating on code and code reviews. One interesting thing he noted was that when a developer leaves Google (quits, etc.) someone takes over the ownership of their files. I am not sure how this is could affect our data or even how to measure who is eligible for taking over ownership of a file. If this is the case, would ownership of a new file increase their degree?

This also makes me question who carries the blame for the vulnerabilities missed on each of these files if they were once owned by a different developer?

kbaumzie · 2016-03-16T16:43:46Z

Take a look at Pearson (less sensitive to outliers)
High degree -> high betweenness

Do a rake run with R on the console

kbaumzie · 2016-04-06T16:54:40Z

Correlations have been found to be strong with betweenness, degree, and closeness. This challenges what we have been researching where we have now found that being more central will yield a higher count of vulnerability misses. My next steps will be to address perc_vuln_misses (percentage of vulnerabilities missed) to actually see missed vulnerabilities per developer, per period --> vuln_misses/participation.

After this, we should include vuln_misses in our code reviews table by count and by boolean. Be careful not to double count the same vulnerability twice (use distinct). This allows us to look at other metrics in the given code review.

kbaumzie · 2016-04-10T18:06:00Z

Currently referencing an incorrect variable name in our developer_snapshots table in file dev_analysis.rb.
perc_missed_vuln
Should be changed to perc_vuln_misses after @sso7159 refactors this change in devCollaboration.py.

Included changes to perc_missed_vuln:

spearman_percVM_deg <- cor(dev_snap$perc_missed_vuln, dev_snap$degree, method="spearman")
spearman_percVM_sher <- cor(dev_snap$perc_missed_vuln, dev_snap$sheriff_hrs, method="spearman")
spearman_percVM_close <- cor(dev_snap$perc_missed_vuln, dev_snap$closeness, method="spearman")
spearman_percVM_bet <- cor(dev_snap$perc_missed_vuln, dev_snap$betweenness, method="spearman")

Experiencing an error when running the file via rake run:dev where R is showing that the correlation is not returning a number: NaN. Any thoughts as to why this is happening? My understanding of Spearman correlation is that it can easily correlate two different things hence percentage vs. float value.

andymeneely · 2016-04-13T16:56:37Z

Use this: https://stat.ethz.ch/R-manual/R-devel/library/stats/html/cor.html

na.rm=true

kbaumzie self-assigned this Mar 6, 2016

kbaumzie mentioned this issue Apr 3, 2016

Do manual investigation of non-central sheriffs and central non-sheriffs #244

Closed

kbaumzie closed this as completed Sep 7, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are developers with high participation/degree more likely to have missed vulnerabilities? #245

Are developers with high participation/degree more likely to have missed vulnerabilities? #245

kbaumzie commented Mar 4, 2016

kbaumzie commented Mar 6, 2016

kbaumzie commented Mar 16, 2016

kbaumzie commented Apr 6, 2016

kbaumzie commented Apr 10, 2016

andymeneely commented Apr 13, 2016

Are developers with high participation/degree more likely to have missed vulnerabilities? #245

Are developers with high participation/degree more likely to have missed vulnerabilities? #245

Comments

kbaumzie commented Mar 4, 2016

kbaumzie commented Mar 6, 2016

kbaumzie commented Mar 16, 2016

kbaumzie commented Apr 6, 2016

kbaumzie commented Apr 10, 2016

andymeneely commented Apr 13, 2016