FE: Implement FE service #73

smacker · 2018-03-16T16:36:38Z

Part of #51

Signed-off-by: Maxim Sukharev <maxim@sourced.tech>

carlosms · 2018-03-20T17:57:19Z

Should we add the python tests to travis in this PR?

smacker · 2018-03-20T18:11:42Z

@carlosms what do you mean? they are added. Do you want to remove them? (I don't)

carlosms · 2018-03-20T18:16:49Z

My bad, I scanned the PR quickly thought your changes were testing the docker build only.

bzz · 2018-03-21T07:06:35Z

.travis.yml

@@ -54,6 +54,12 @@ matrix:
    env: STYLE_CHECK=true
    script: ./sbt scalastyle

+  - scala: 2.11.2
+    script:


It's minor, but for convenience and consistency, could you please add env section to this build with something descriptive and human-readable, like FE_PYTHON_TEST=true ?

It does not mean much yet, but is super-helpful with TravisCI, when a project has many profiles, to quickly identify the purpose of the failing one.

Like i.e here one would be able to tell that it's only Integration Tests failing for this job.

bzz · 2018-03-21T07:11:19Z

Looks awesome, a quick question - is there a reproducible way how one can generate/update the fixtures for the FE?

bzz · 2018-03-21T08:39:03Z

src/main/python/server.py

+        """Extract identifiers weighted set"""
+
+        extractor = IdentifiersBagExtractor(
+            docfreq_threshold=request.docfreqThreshold, split_stem=request.splitStem)


From what I can see, all BagsExtractors, of which IdentifiersBagExtractor is a particular kind, have a weight parameter - do you think it should be added on our side as well?

AFAIK it's something that most probably going to be used as part of parameter adjustment as soon as the values are inferred from a labeled data, provided by 🐈 .

oh. I'm not sure how I missed it in the task about creating proto files. My bad.

Do you mind if I add this parameter in a separate PR? It should be in all extractors and requires changes in proto & scala.

bzz · 2018-03-21T08:44:10Z

src/main/python/server.py

+        """Extract uast2seq weighted set"""
+
+        extractor = UastSeqBagExtractor(
+            docfreq_threshold=request.docfreqThreshold)


Same situation with arguments of the base algorithm class here , allthough these guys token2index and token_parser class name might be of less value right now and could be added later, as soon as they are really needed.

What do you think?

I found only 1 implementation for token parser and token index in ml lib. So I decided, for now, we don't need additional parameters there.

bzz · 2018-03-21T08:47:25Z

src/main/python/server.py

 if __name__ == '__main__':
    parser = argparse.ArgumentParser(description='Feature Extractor Service.')
    parser.add_argument("--port", type=int, default=9001,
                        help="server listen port")
    args = parser.parse_args()

+    # sourced-ml expects PYTHONHASHSEED != random or unset
+    if os.getenv("PYTHONHASHSEED", "random") == "random":


As far as I can see, these are src-d/ml implementation details, relevant to some extractors.

What do you think - on our side, would it be wise to be more careful and make put them as part of the API and pass them as explicit parameters of RPC request?

This could help to decouple a library API with it's runtime environment, which is a good practice anyway.

Nope. This parameter changes how python interpreter works. You can’t change it in runtime. That’s why the check is in __main__.

Got it. Reading https://docs.python.org/3.3/using/cmdline.html#envvar-PYTHONHASHSEED helped to understand it better, which would be nice to have somewhere (i.e in that comment above the code)

there is a comment already. 2 lines below.

Signed-off-by: Maxim Sukharev <maxim@sourced.tech>

smacker · 2018-03-21T11:47:25Z

You can generate a fixture using sdk. The command for it was added here bblfsh/sdk#249

all comments were addressed (by comments). @bzz could you please take another look? I see only 1 problem: weight param. But I strongly prefer to add it in separate PR to avoid conflicts.

bzz · 2018-03-21T12:04:45Z

You can generate a fixture using sdk. The command for it was added here bblfsh/sdk#249

Really, really nice, thank you for linking 👍 ! I only wish it could be documented somewhere, i.e same as code generation from .proto, so next person has to update the tests or something like that - it saves time for a him/her.

but I strongly prefer to add it in separate PR to avoid conflicts.

sounds good to me

bzz

LGTM, sans very minor documentation issue on test fixture generation.

smacker · 2018-03-21T12:26:13Z

I'm not sure about the rightest place to put info about fixtures for now.

I believe I would need fixtures for integration test scala-python (last task in umbrella issue), it might change the location of fixtures or I'll create some script. I'll document how to generate fixtures in that task when figure out the best way to do it.

Implement Identifiers extractor

cbbd99b

Signed-off-by: Maxim Sukharev <maxim@sourced.tech>

bzz mentioned this pull request Mar 16, 2018

FE: feature extractors #51

Closed

7 tasks

smacker mentioned this pull request Mar 20, 2018

Add YAPF lint and format to makefile #75

Merged

smacker added 3 commits March 20, 2018 14:27

Implement Literals extractor

3d1bdc7

Signed-off-by: Maxim Sukharev <maxim@sourced.tech>

Implement Uast2seq extractor

8d0c115

Signed-off-by: Maxim Sukharev <maxim@sourced.tech>

Add dockerfile and travis tests

d3ceaa5

Signed-off-by: Maxim Sukharev <maxim@sourced.tech>

smacker changed the title ~~[WIP] FE: Implement FE service~~ FE: Implement FE service Mar 20, 2018

carlosms approved these changes Mar 20, 2018

View reviewed changes

bzz reviewed Mar 21, 2018

View reviewed changes

add FE_PYTHON_TEST variable in travis

cc68f49

Signed-off-by: Maxim Sukharev <maxim@sourced.tech>

bzz approved these changes Mar 21, 2018

View reviewed changes

smacker merged commit 28277f3 into src-d:master Mar 21, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FE: Implement FE service #73

FE: Implement FE service #73

smacker commented Mar 16, 2018 •

edited

Loading

carlosms commented Mar 20, 2018

smacker commented Mar 20, 2018 •

edited

Loading

carlosms commented Mar 20, 2018

bzz Mar 21, 2018

bzz commented Mar 21, 2018 •

edited

Loading

bzz Mar 21, 2018

smacker Mar 21, 2018

bzz Mar 21, 2018

smacker Mar 21, 2018

bzz Mar 21, 2018 •

edited

Loading

smacker Mar 21, 2018 •

edited

Loading

bzz Mar 21, 2018 •

edited

Loading

smacker Mar 21, 2018

smacker commented Mar 21, 2018

bzz commented Mar 21, 2018

bzz left a comment

smacker commented Mar 21, 2018

FE: Implement FE service #73

FE: Implement FE service #73

Conversation

smacker commented Mar 16, 2018 • edited Loading

carlosms commented Mar 20, 2018

smacker commented Mar 20, 2018 • edited Loading

carlosms commented Mar 20, 2018

bzz Mar 21, 2018

Choose a reason for hiding this comment

bzz commented Mar 21, 2018 • edited Loading

bzz Mar 21, 2018

Choose a reason for hiding this comment

smacker Mar 21, 2018

Choose a reason for hiding this comment

bzz Mar 21, 2018

Choose a reason for hiding this comment

smacker Mar 21, 2018

Choose a reason for hiding this comment

bzz Mar 21, 2018 • edited Loading

Choose a reason for hiding this comment

smacker Mar 21, 2018 • edited Loading

Choose a reason for hiding this comment

bzz Mar 21, 2018 • edited Loading

Choose a reason for hiding this comment

smacker Mar 21, 2018

Choose a reason for hiding this comment

smacker commented Mar 21, 2018

bzz commented Mar 21, 2018

bzz left a comment

Choose a reason for hiding this comment

smacker commented Mar 21, 2018

smacker commented Mar 16, 2018 •

edited

Loading

smacker commented Mar 20, 2018 •

edited

Loading

bzz commented Mar 21, 2018 •

edited

Loading

bzz Mar 21, 2018 •

edited

Loading

smacker Mar 21, 2018 •

edited

Loading

bzz Mar 21, 2018 •

edited

Loading