Permanent syntax tree representation in background #800

codemanyak · 2020-01-20T13:55:01Z

A consistent syntax tree representation of all element texts (or just the contained expressions) ought to be provided.
This allows to concentrate on a clean and consistent syntax analysis, also in order to improve the performance. Parts of it are ready but need more elaboration. And it must be tested with regard to memory and time complexity. The repeated tokenizations and concatenations and the frantic search for a point where certain matching and replacements should ideally take place without spoiling all what had been transformed before or will have to be transformed thereafter could be avoided this way.
If in the event there will be a central point in all generators where built-in functions are to be handled then this will be a big achievement allowing the requested clear documentation. (At least for a while...)
Some of the benefits would be:

Built-in functions and procedures as well as operators can unambiguously be identified, which would improve Executor performance and generator functionality (e.g. addressing demands like in BASH-Code-Generator - improvement #237).
As the syntax trees are to be derived (or updated) only when an element is changed or on first Analyser inspection or first execution or code generation (in a lazy initialization approach), it is expected dramatically to reduce syntax analysis time (similar to the already achieved diagram drawing speed by permanently caching the highlighting patterns).
Instead of the current guesswork labyrinth there would be a structurally consistent and transparent operation.
It might be way easier to add new generators.

There are of course several challenges, too:

Nassi-Shneiderman diagrams are by design meant to be syntax-free, so there will always be texts not (or only partially) being convertible into syntax diagrams of some kind. (What to do here? Handle them as competely untranslatable text or try a partial analysis? Maintain the old complex and unstructured analysis approaches in parallel for such cases?).
Structorizer in particular supports several syntactical flavours, especially regarding explicit and implicit declarations - shall we reflect them in the syntax diagrams or canonicalize them on this occation (the latter might cause irritating Analyser reports, and Executor or generator error messages)?
How to deal with context influence on the type inference and so on (the declaration or initialization of variable already used in some existing element text might latter be inserted, which may e.g. reinterpret the meaning of some operator symbol etc.)? How to identify such impacts and their scope?
Is it better to represent an entire text line (including commands, separating keywords, and declaration stuff or just the expressions strewn into a line?

Originally posted by @codemanyak in #462 (comment)

This also relates to several internal issues.

…ighting fixed (fesch#881)

…pressionList(), Declaration.parse...()

…rsing

…tegrated in Line, first steps to type retrieval in Root

…rted)

codemanyak · 2023-11-10T00:18:25Z

Remark: Possibly it was not the best idea permanently to hold parsed lines on the elements, in particular since not all element text can be parsed and even small modifications in other elements or diagrams can invalidate any syntax tree derived on former diagram status. A very reasonable compromise seems to be to store the element text as lexically split token lists where the whitespace isn't mixed among the tokens but managed separately. This makes superfluous a lot of to-and-fro conversions, preserves original spacing without affecting token indices and it accelerates parsing a lot. On this occasion, the user-configurable "key phrases" (parser preferences) that may consist of several lexical tokens (like jusqu'à) can be represented by fix internal tokens, which make refactoring obsolete, since the internal key will always be the same, only on display and editing the user-specified keywords are to be shown. A parser preference modification will only require a drawing refresh (like with controller routine aliases). The task of Executor, Analyser and code generators will be facilitated a lot. They can make use of (ephemeral) syntax trees where it convenes. They may concentrate on the expressions embedded in the element text lines.

codemanyak self-assigned this Jan 20, 2020

codemanyak added code revision enhancement labels Jan 20, 2020

codemanyak added this to the Release 3.31 milestone Jan 20, 2020

codemanyak mentioned this issue Feb 10, 2020

Export and Reimport of C-Program #806

Closed

codemanyak mentioned this issue Apr 11, 2020

Proposal: "Variable registry" and "Data type configurator" #259

Open

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Apr 14, 2020

Syntax tree classes for issue fesch#800 added

301f3a0

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Oct 20, 2020

Some code refactoring for issue fesch#800

236b438

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Oct 30, 2020

Further refactoring for fesch#800, Expression and Line parsing advanced

4f205e9

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Nov 1, 2020

Issue fesch#800 implementation further advanced.

243435a

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Nov 1, 2020

Further refactoring steps for fesch#800, Type hierarchy advanced

b8b5da6

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Nov 1, 2020

Issue fesch#800 Alternative transmutation changed, bit operator highl…

d9eadf5

…ighting fixed (fesch#881)

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Nov 3, 2020

Extensions for the Expression and Line stuff (fesch#800)

496ce60

codemanyak added a commit that referenced this issue Nov 8, 2020

Issue #800: functions toDegrees and toRadians also converted

84c549d

codemanyak mentioned this issue Nov 12, 2020

PowerShell Code-Building #883

Open

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Dec 10, 2020

Issue fesch#800: Simplification of Type class, methods Syntax.splitEx…

52bf108

…pressionList(), Declaration.parse...()

This was referenced Jan 26, 2021

C operator mode could consistently be applied to the text in the editor #916

Open

Executor fails correctly to handle a mixed component / index access #922

Closed

codemanyak modified the milestones: Release 3.31, Release 3.32 Mar 1, 2021

codemanyak mentioned this issue Jul 20, 2021

Could I help with Octave (Matlab) language support? #981

Open

codemanyak modified the milestones: Release 3.32, Release 3.33 Sep 19, 2021

codemanyak mentioned this issue Oct 25, 2021

Declaration semantics in Structorizer has to be reassessed #980

Open

codemanyak mentioned this issue Nov 5, 2021

Support for ARM code #967

Open

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Dec 5, 2021

Issue fesch#800: More efficient lexical splitting implemented

8e35d16

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Dec 7, 2021

Issues fesch#800, fesch#920: Several modifications of scanning and pa…

a55c64a

…rsing

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Dec 13, 2021

Issue fesch#800: Tentative line parsing no longer supported, error in…

635f21b

…tegrated in Line, first steps to type retrieval in Root

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Dec 13, 2021

Issue fesch#800: Partial reorganisation of Analyser begun.

99fb51a

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Dec 22, 2021

Intermediate fesch#800 redesign stage (Analyser stuff partially conve…

2e13738

…rted)

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Jan 4, 2022

Analyser redesign according to fesch#800 in progress

ebca960

codemanyak mentioned this issue Feb 16, 2022

C-like mere declarations are not recognized as such #1029

Closed

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue May 23, 2022

Minor adaptations for fesch#800

451fcf4

codemanyak mentioned this issue Aug 2, 2022

Wish: generate multiple filenames in batch mode #1047

Closed

codemanyak mentioned this issue Aug 18, 2022

Proposal: Name completion mechanism in element editor text field #1066

Closed

codemanyak mentioned this issue Jun 27, 2023

Imported C code doesn't recognize increment operator and issues false positive warning for endless loop #1081

Open

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Nov 8, 2023

More mending for fesch#800 code revision

7762175

codemanyak mentioned this issue Nov 9, 2023

Suggestion for improvement of HLL code import if feasible #1115

Closed

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Nov 14, 2023

Line parsing modification (fesch#800)

4781d86

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Dec 18, 2023

Partial revision of Element.getHighlightUnits() for fesch#800.

a2a0319

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Feb 27, 2024

Some Generator / Expression updates for fesch#800.

e809f35

codemanyak added a commit to codemanyak/Structorizer.Desktop that referenced this issue Apr 21, 2024

Issue fesch#800: Adaptations to new internal keyword mechanism

f579545

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Permanent syntax tree representation in background #800

Permanent syntax tree representation in background #800

codemanyak commented Jan 20, 2020 •

edited

codemanyak commented Nov 10, 2023 •

edited

Permanent syntax tree representation in background #800

Permanent syntax tree representation in background #800

Comments

codemanyak commented Jan 20, 2020 • edited

codemanyak commented Nov 10, 2023 • edited

codemanyak commented Jan 20, 2020 •

edited

codemanyak commented Nov 10, 2023 •

edited