Feature/LOC #1987

idodeclare · 2018-01-25T17:21:40Z

Hello,

Please consider for integration this patch to count per-file physical lines-of-code (LOC;
Wiki) and to store LOC and total number of lines.

DirectoryListing is updated to present the counts while browsing.

Thank you.

idodeclare · 2018-01-25T17:23:10Z

I should note that this resolved issue #591.

tulinkry · 2018-01-25T17:29:04Z

I would personally prefer if all the methods mentioning LOC would have it expanded to LinesOfCode.

tulinkry · 2018-01-25T17:33:38Z

src/org/opensolaris/opengrok/index/IndexDatabase.java

@@ -272,7 +274,7 @@ public void run() {

    @SuppressWarnings("PMD.CollapsibleIfStatements")
    private void initialize() throws IOException {
-        synchronized (this) {


does this make any difference?

Just that synchronizing on public objects is suspect as it breaks encapsulation — so I revised it.

tulinkry · 2018-01-25T17:52:41Z

src/org/opensolaris/opengrok/index/IndexDatabase.java

@@ -805,7 +805,7 @@ private boolean accept(File file) {
        return !RuntimeEnvironment.getInstance().isIndexVersionedFilesOnly();
    }

-    boolean accept(File parent, File file) {
+    private boolean accept(File parent, File file) {


is this relevant? or just a cleanup?

Just a cleanup

idodeclare · 2018-01-25T18:02:28Z

I would personally prefer if all the methods mentioning LOC would have it expanded to LinesOfCode.

I was motivated to have it quite terse for the methods that are used in analyzers, since I had to edit hundreds of lines in them. Moreover since LOC is almost orthogonal to the bulk of logic in the xrefers, I wanted a tiny function name — chkLOC() — rather than seeing checkLinesOfCode() over and over (and also having to reformat possibly hundreds of newly too-long lines).

I do think "LOC" is well-known enough acronym in source code analysis; if analyzers are ever updated to support logical lines-of-code, then "LLOC" also will be well-known.

ChristopheBordieu · 2018-01-26T11:21:17Z

Excellent feature!!!
Will all the total lines and slocs be aggregated at repository root level ?

vladak · 2018-01-26T14:25:56Z

@idodeclare pls use 'fixes #591' in one the commit comments then.

idodeclare · 2018-01-26T16:10:39Z

Will all the total lines and slocs be aggregated at repository root level ?

That is certainly feasible in the future. Right now, only files have document data in the Lucene index. To store aggregated num-lines and LOC, it will be necessary to store Lucene data for directories — but without conflicting with the existing queries that all assume only file-level data.

One straight-forward idea would be to add Lucene "documents" for directories with wholly-separate fields so that these new documents never appear in current queries. Of course it would also be necessary to write an aggregator that runs at the appropriate stages.

vladak · 2018-01-26T16:18:33Z

This also brings a question about how to use these counts for restricting searches. Dne 26. 1. 2018 5:10 odp. napsal uživatel "C Fraire" < notifications@github.com>:

…

Will all the total lines and slocs be aggregated at repository root level ? That is certainly feasible in the future. Right now, only files have document data in the Lucene index. To store aggregated num-lines and LOC, it will be necessary to store Lucene data for directories — but without conflicting with the existing queries that all assume only file-level data. One straight-forward idea would be to add Lucene "documents" for directories with wholly-separate fields so that these new documents never appear in current queries. Of course it would also be necessary to write an aggregator that runs at the appropriate stages. — You are receiving this because you commented. Reply to this email directly, view it on GitHub <#1987 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/ACzGDNoySlziaA8OuXLnX7uKUe2F05q5ks5tOfkBgaJpZM4RtMrk> .

idodeclare · 2018-01-26T17:02:27Z

This also brings a question about how to use these counts for restricting
searches.

Yes, indeed.

Relatedly, I did not update to show the counts in search results, as I did not have any idea how best to present them.

tarzanek · 2018-02-12T14:36:38Z

I am merging this as is, thank you Chris, it looks good

idodeclare · 2018-02-12T15:32:05Z

Thanks, @tarzanek !

vladak · 2018-02-13T09:31:47Z

I wonder how this will cope with incremental indexing. Will pre-existing files have LOC set to 0 ? Or is full reindex required ?

idodeclare · 2018-02-13T14:02:55Z

Pre-existing documents will have blank LOC/#Lines — the same as for files which do not get an xref (e.g. Ignored/FileAnalyzer) or for files that do not have a counting xref (e.g. UuencodeAnalyzer).

Incrementally re-indexed files can get LOC/#Lines. Full re-index is required for completeness.

idodeclare added 6 commits January 25, 2018 10:47

Privatize IndexDatabase.accept(File, File), formerly package-private

a57216d

Use a private instance lock. Fix up some other syntax.

3a99504

Revise #dirlist's default sortInitialOrder: "desc", except for "Name"

eb9ed6a

Store numlines if Xref is used while analyzing

7c9872f

Store LOC if Xref is used while analyzing

3a8a81e

Use DirectoryExtraReader to supplement filesystem metadata

0a0e955

tulinkry reviewed Jan 25, 2018

View reviewed changes

Fixes oracle#591

cb82a36

tarzanek added this to the 1.1 milestone Feb 5, 2018

tarzanek merged commit a7351f5 into oracle:master Feb 12, 2018

tarzanek assigned vladak Feb 12, 2018

idodeclare deleted the feature/loc branch February 12, 2018 15:00

vladak mentioned this pull request Feb 13, 2018

allow to constrain search based on LOC #2010

Open

idodeclare mentioned this pull request Mar 7, 2018

Store TABSIZE in a supplementary document for each IndexDatabase #2035

Merged

TotoXe mentioned this pull request Jun 29, 2020

Aggregate lines and LOC numbers at directory level whatever depth it is #3168

Open

Feature/LOC #1987

Feature/LOC #1987

Uh oh!

Conversation

idodeclare commented Jan 25, 2018

Uh oh!

idodeclare commented Jan 25, 2018

Uh oh!

tulinkry commented Jan 25, 2018

Uh oh!

tulinkry Jan 25, 2018

Choose a reason for hiding this comment

Uh oh!

idodeclare Jan 25, 2018

Choose a reason for hiding this comment

Uh oh!

tulinkry Jan 25, 2018

Choose a reason for hiding this comment

Uh oh!

idodeclare Jan 25, 2018

Choose a reason for hiding this comment

Uh oh!

idodeclare commented Jan 25, 2018

Uh oh!

ChristopheBordieu commented Jan 26, 2018

Uh oh!

vladak commented Jan 26, 2018

Uh oh!

idodeclare commented Jan 26, 2018

Uh oh!

vladak commented Jan 26, 2018 via email

Uh oh!

idodeclare commented Jan 26, 2018

Uh oh!

tarzanek commented Feb 12, 2018

Uh oh!

idodeclare commented Feb 12, 2018

Uh oh!

vladak commented Feb 13, 2018

Uh oh!

idodeclare commented Feb 13, 2018

Uh oh!

Uh oh!