Add synthetic frame for lxml schema init #17

PapaPedro · 2021-03-01T12:53:34Z

Creating schema in lxml can be a performance issue if we always recreate
a schema instead of reusing it.

Unfortunately this code is cython and does not appear in the flame
graph.

Tested with a local script.

Issue #, if available:

Description of changes:
This change searches for etree.XMLSchema in the source code
line to add a synthetic frame for it to appear in the graph.

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

gimki · 2021-03-01T16:47:46Z

codeguru_profiler_agent/sampling_utils.py

    line = linecache.getline(frame.f_code.co_filename, line_no).strip()
    if "sleep(" in line:
        result.append(TIME_SLEEP_FRAME)
+    elif "etree.XMLSchema" in line:


should it be etree.XMLSchema( as we include the open bracket in sleep(

we would miss this if user had:

from etree import XMLSchema

a = XMLSchema()

Should we cover that case too and simply use XMLSchema? How likely would be gives us false positive in matching line like ComplexXMLSchema (<-- if that exists)?

Good point about the parenthesis, will add it.

Yes you are right about missing out on from etree import XMLSchema but searching for XMLSchema could give many false positive and according to what I have seen in existing code I believe most people would do etree.XMLSchema as provided in the online documentation. I prefer to miss out on a few cases rather have false positives which could be very confusing.

Creating schema in lxml can be a performance issue if we always recreate a schema instead of reusing it. Unfortunately this code is cython and does not appear in the flame graph, this change searches for `etree.XMLSchema(` in the source code line to add a synthetic frame for it to appear in the graph. Tested with a local script.

gimki · 2021-03-01T18:59:47Z

codeguru_profiler_agent/sampling_utils.py

 TRUNCATED_FRAME = Frame(name="<Truncated>")

 TIME_SLEEP_FRAME = Frame(name="<Sleep>")
+LXML_SCHEMA_FRAME = Frame(name="lxml.etree:XMLSchema:__init__")


Question: Should we follow the format of <...> for synthetic frame? For example, lxml.etree:XMLSchema:__init__ here?

Contrary to the <Sleep>, I want this frame to remain in the flamegraph for customers to see it so I have put the frame as it would appear normally in a stack trace (plus the ":XMLSchema:" class name that we usually add). I considered creating a Frame object with all the different attributes but that means I have to build a fake file name that would give the appropriate result once serialized, I found it simpler and clearer to directly put the frame as I want the agent to report it.

Ok. We could always review this later :D.

gimki reviewed Mar 1, 2021

View reviewed changes

PapaPedro force-pushed the lxml_synthetic_frame branch from 87092e1 to 79bbafe Compare March 1, 2021 18:14

gimki reviewed Mar 1, 2021

View reviewed changes

gimki approved these changes Mar 2, 2021

View reviewed changes

gimki merged commit 6ed00bd into main Mar 2, 2021

PapaPedro deleted the lxml_synthetic_frame branch March 5, 2021 11:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add synthetic frame for lxml schema init #17

Add synthetic frame for lxml schema init #17

Uh oh!

PapaPedro commented Mar 1, 2021

Uh oh!

gimki Mar 1, 2021

Uh oh!

PapaPedro Mar 1, 2021

Uh oh!

gimki Mar 1, 2021

Uh oh!

PapaPedro Mar 1, 2021 •

edited

Loading

Uh oh!

gimki Mar 2, 2021

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add synthetic frame for lxml schema init #17

Add synthetic frame for lxml schema init #17

Uh oh!

Conversation

PapaPedro commented Mar 1, 2021

Uh oh!

gimki Mar 1, 2021

Choose a reason for hiding this comment

Uh oh!

PapaPedro Mar 1, 2021

Choose a reason for hiding this comment

Uh oh!

gimki Mar 1, 2021

Choose a reason for hiding this comment

Uh oh!

PapaPedro Mar 1, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gimki Mar 2, 2021

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

PapaPedro Mar 1, 2021 •

edited

Loading