Formula helpers #24

wojciechczerniak · 2019-11-17T16:36:12Z

Description

The point is to provide public API for the methods we already have internally:

 public calculateFormula(formula: string, cellPosition: SimpleCellAddress): any;
 public validateFormula(formula: string): boolean; 
 public normalizeFormula(formula: string): string;

While normalize might be a static, validate and calculate should be considered with the context of the workbook.

Use case

A full use case is described in #17 but lets link to the screenshot once again:

Events

Extracted to #135

Problem?

~~The formula is stored outside of the graph. We won't know if it should be updated. We would have to recalculate each time something changes in the workbook.~~

~~Is there any alternative? Should we register the formula in HyperFormula so it would become a part of the graph?~~

Solved by @bardek8! Formulas will be added to sheet that is indexed at -1. It will be a part of the graph and change lists.

The text was updated successfully, but these errors were encountered:

wojciechczerniak · 2020-02-05T12:58:38Z

Once registered, the formula should return an address so we can watch for #135 that are related to it.
And we need an API to remove this formula from the graph when the external widget is destroyed.

swistak35 · 2020-02-07T08:01:34Z

I assume that normalizing formula means returning it unparsed from AST (so, for example, with all formula names capitalized). I also assume that validation means validating syntax of the entered formula.

I think that using multiple sheets (each sheet for each formula) would be better, because it will work with matrices. If we would add one temporary formula in A1, and then another one in A2, but A1 would be a matrix, then probably things will start breaking :) So in the end, maybe only single identifier will be needed (a negative number representing a sheet), instead of whole address (because address would be always A1). But in the end I guess that it does not matter much to you (whether it's in one sheet, or multiple ones).

swistak35 · 2020-02-07T10:09:05Z

We've discussed it, and maybe indeed one worksheet would be better, if we only would assume that we do not approve matrix formulas in these temporary formulas. Is that ok for you?

Issue: #24

* Add initial version of calculateFormula() Issue: #24 * Add test which checks whether external formula is recomputed * Add possibility to change contents of external formula * Add possibility to normalize formula * Refactoring: Introduce ChangeList type * Add test ensuring that it works for more formulas * Refactoring: Split test suite into two * Add basic validation of external formulas * Add documentation * External formula is not valid if it has lexical errors * Literal error is ok (if that's what user want) * Extract NamedReferences module * Refactoring: Rename HyperFormula#calculateFormula -> HyperFormula#addNamedExpression * Add ability to retrieve value of named expression by it's name Temporarily changing named expression formula is forbidden, need to do new API for that which does not use -1 sheet internal. * Allow to add only one expression for given name * Add assertion ensuring that NamedExpression won't contain duplicates * Add basic validation of named expressions * Refactoring: Rename NamedExpressionAlreadyTaken -> NamedExpressionNameIsAlreadyTaken * Refactoring: Extract error classes out to separate file They still belongs to the responsibility of the HyperFormula unit, but majority of this code is just formatting the error. So I think it's cleaner when they are in separate file. * Refactoring: Remove unused return value * Refactoring: Rename ivar NamedExpressions#nextExternalFormulaId -> nextNamedExpressionRow * Make it possible to retrieve value of non existing named expression * Add API for removing named expressions * Refactoring: extract NamedExpressions#buildAddress method * Refactoring: Remove magic constant * SparseStrategy will not leak memory in the long run for named expressions * Allow again to change formulas in named expressions * Refactoring * Does not allow to change not existing named expression * Add possibility to list existing named expressions * Prerefactoring: Extract class NamedExpression * Remove unnecessary bang * Add assertion in NamedExpressions#changeNamedExpressionFormula * Make named expressions names to be case insensitive * Refactoring: Extract a NamedExpressionsStore * Refactoring: Extract method NamedExpressions#storeFormulaInCell * No longer workarounds for keeping sheet = -1 * Refactoring: Use ChangeList instead of CellValueChange[] Because in a moment I want to add different kind of element to ChangeList -- the one which will tell that named expression has changed. * lint-fix * Add documentation * Fixes after merge conflict * Resolving conflicts is so. much. fun. Co-authored-by: Jakub Kowalski <kowalski_jakub@hotmail.com>

wojciechczerniak · 2020-02-19T11:39:04Z

@swistak35, I know we spoke that those are basically the same as named expressions but I would keep the calculateFormula method in a lot simpler form. Without registering it, tracing changes or being able to reference it. Calculate the result of =A1+1 and forget. It will be useful for status bar SUM or formula editor live calculations.

Also, I didn't had a chance to do a review so here are my comments:

Current addNamedExpression implementation does not support additional arguments

hyperformula/test/named-expressions.spec.ts

Line 12 in be57f95

engine.addNamedExpression('myName', '=Sheet1!A1+10')

We know that we will need more details about the expression: { value, scope, comment, visibility } so we should be ready for the object and avoid breaking changes in the near future. Or we can add config after the value: addNamedExpression(name, value, { scope, comment, visibility })

hyperformula/test/named-expressions.spec.ts

Lines 44 to 46 in be57f95

    
           expect(() => { 
        
             engine.addNamedExpression('myName', '=Sheet1!A1+10') 
        
           }).toThrowError(`Name of Named Expression 'myName' is already present in the workbook`)

This depends on the scope. We can have one global name but each sheet can have its own.

hyperformula/test/named-expressions.spec.ts

Line 81 in be57f95

engine.changeNamedExpressionFormula('myName', '=Sheet1!A1+11')

We can edit much more than formula. IMO name changeNamedExpressionFormula is misleading, the value can be a number, boolean or string. I hope, I don't see a test for that.

When we will have scope and other metadata it will be much more useful to have updateNamedExpression(name, { scope, ... }) method.

What happens when we remove a named expression? From what I see its address is lost forever. We always increment it and will never reuse a row. Can it create a problem for us when multiple named expressions are created and removed over time?

Just a random thought: I would move

hyperformula/src/HyperFormula.ts

Lines 123 to 124 in be57f95

    
           this.namedExpressions = new NamedExpressions(this.cellContentParser, this.dependencyGraph, this.parser) 
        
           this.addressMapping.addSheet(-1, new SparseStrategy(0, 0))

To a helper private method initializeNamedExpressions so someone in the future would know straight away that the -1 sheet is created for them and they have to go in pair in case of refactoring.

Do we have the name of the named expression in the changes list? or is it only { sheet: -1, ... }. I think it would be helpful to have it as property there

wojciechczerniak · 2020-02-19T11:40:18Z

Note to self (named expressions are not yet available within syntax so this have to wait)

This is awesome 🏆 as it solves #27 and #5 at the same time. We should add a test with

    const engine = HyperFormula.buildFromArray([
      ['=TRUE()', '=TRUE', '=A1=B1'],
      ['=FALSE()', '=FALSE', '=A2=B2'],
      ['=FALSE', '=TRUE', '=A3=B3'],
    ])

    engine.addNamedExpression('TRUE', '=True()')
    engine.addNamedExpression('FALSE', '=False()')

    expect(engine.getCellValue('C1')).toEqual(true)
    expect(engine.getCellValue('C2')).toEqual(true)
    expect(engine.getCellValue('C3')).toEqual(false)

So we can confirm it and close both issues.

swistak35 · 2020-02-21T08:30:44Z

PRs: #167 #192

As discussed, calculateFormula became calculating "fire and forget", without any notifying of changes and storing it anywhere.
Questions:

calculateFormula now takes only formula string. Wouldn't we like to pass there also a sheet? I imagine that we may want to calculate formula in context of specific worksheet, so =A1 is obvious in context of specific worksheet, but it's not clear without it. Currently, the behaviour is undefined if someone will use a reference without a specific sheet. We definitely want to do something about it, but I don't know what:

either always pass some sheet (I'd like that one)
or go through the whole AST, check whether there are some references without sheets and say that if so, the formula is invalid
I'd prefer this option, because second is more work and less flexible at the same time :)

validateFormula / normalizeFormula -- it wasn't exactly specified what these do so I've done what I've told in a comment above ( Formula helpers #24 (comment) )

wojciechczerniak · 2020-02-21T09:42:10Z

calculateFormula now takes only formula string. Wouldn't we like to pass there also a sheet? I imagine that we may want to calculate formula in context of specific worksheet, so =A1 is obvious in context of specific worksheet, but it's not clear without it. Currently, the behaviour is undefined if someone will use a reference without a specific sheet. We definitely want to do something about it, but I don't know what:

For that reason I've proposed an additional argument to the API:

 public calculateFormula(formula: string, cellPosition: SimpleCellAddress): any;

where cellPosition is the "context" where theoretically is the formula located and all relative references could be calculated in this context. We had a short discussion on slack https://handsoncode.slack.com/archives/CP3FX2DBM/p1579187598009700 but later the subject shifted a little bit.

I'd prefer this option, because second is more work and less flexible at the same time :)

Yup, I agree. We should give flexibility but avoid any magic assumptions.

validateFormula / normalizeFormula -- it wasn't exactly specified what these do so I've done what I've told in a comment above ( #24 (comment) )

I think that we understood those the same. But here's my idea:
validateFormula - return boolean true if the string starting with = is the correct, complete, parsable formula (evaluation result is unimportant) so it won't return the #ERR! error. Return false otherwise
normalizeFormula - return formula string where functions names are uppercase, sheet names case is fixed (as they were registered), same for named expressions, some whitespace is removed (ie from ranges) other whitespace is preserved. It should be exactly as if we stored it in a worksheet and requested with getCellFormula.

swistak35 · 2020-02-25T21:20:21Z

This depends on the scope. We can have one global name but each sheet can have its own.

yes, but for now only workbook level is supporter (because my understanding is that we want to do #126 in March and I don't want to constantly increase scope in #24 :) )

What happens when we remove a named expression? From what I see its address is lost forever. We always increment it and will never reuse a row. Can it create a problem for us when multiple named expressions are created and removed over time?

No, note that we specifically ask for Sparse strategy for address mapping, so it's performant with sheet which has a lot of gaps, and doesn't lose memory.

To a helper private method initializeNamedExpressions so someone in the future would know straight away that the -1 sheet is created for them and they have to go in pair in case of refactoring.

Good catch!

Do we have the name of the named expression in the changes list? or is it only { sheet: -1, ... }. I think it would be helpful to have it as property there

Actually it's the other way around. We have a name (which is unique in given scope), we (will) have a scope (and sheet, if scope is worksheet). I didn't want -1 hack to leak outside the engine, so from the user point of view it's transparent where we store these named expressions.

wojciechczerniak · 2020-02-25T22:05:19Z

yes, but for now only workbook level is supporter (because my understanding is that we want to do #126 in March and I don't want to constantly increase scope in #24 :) )

Fair point. I just want to keep other features in mind so we don't have to break the API next month.

No, note that we specifically ask for Sparse strategy for address mapping, so it's performant with sheet which has a lot of gaps, and doesn't lose memory.

Awesome 👍

Actually it's the other way around. We have a name (which is unique in given scope), we (will) have a scope (and sheet, if scope is worksheet). I didn't want -1 hack to leak outside the engine, so from the user point of view it's transparent where we store these named expressions.

True, we should keep -1 private. Either add scope as sheet number to indicate its scope should be named scope. Lack of scope/sheet indicates that it's global expression.

swistak35 · 2020-02-26T23:40:08Z

I think that #192 exhaust that issue, there are still things to do in terms of named expression, but that one is done.

For that reason I've proposed an additional argument to the API:

I've added sheetName as additional argument for now. Is there any usecase where it would be suitable to pass whole "mental position" of named expression instead of only sheet? Changing to position is easy, I just don't want to add broader API which will later need to be supported

wojciechczerniak · 2020-02-27T07:02:45Z

I've added sheetName as additional argument for now. Is there any usecase where it would be suitable to pass whole "mental position" of named expression instead of only sheet? Changing to position is easy, I just don't want to add broader API which will later need to be supported

I would prefer sheetName as well. But what about OFFSET function, or can you think about any other that would be impossible to calculate without the position?

EDIT: Is it optional?

EDIT2: The name changeNamedExpressionExpression is confusing

hyperformula/src/HyperFormula.ts

Line 859 in e1d91aa

    
           public changeNamedExpressionExpression(expressionName: string, newExpression: RawCellContent): ExportedChange[] {

swistak35 · 2020-02-27T08:52:19Z

changeExpressionOfNamedExpression?

wojciechczerniak · 2020-02-27T09:49:14Z

Then we will break the naming pattern. ChangeNamedExpression should be sufficient? Or updateNamedExpression?

Issue: #24

swistak35 · 2020-03-05T21:34:25Z

I would prefer sheetName as well. But what about OFFSET function, or can you think about any other that would be impossible to calculate without the position?

I think that assuming that we always calculate formula from A1 position is reasonable, at least for now. We can exchange that to full "cell address" later, if someone really needs it.

EDIT: Is it optional?

No. There's always some context, I could make it default to "first sheet" or something like that, but isn't that more confusing?

Anything else to do here?

Issue: #24

wojciechczerniak · 2020-03-06T07:52:24Z

Nope. All done in #167 #192 and #238

We will check with @aninde when the reference is needed

This was referenced Nov 17, 2019

MVP #1

Closed

Public API proposal #7

Closed

wojciechczerniak added API Public methods and properties Feature Something we can add later on without introducing a breaking change Integration labels Dec 6, 2019

wojciechczerniak added this to the January 2020 milestone Dec 6, 2019

This was referenced Jan 28, 2020

New formula plugin handsontable/handsontable#6466

Closed

Add events to notify about value and formula updates #135

Closed

wojciechczerniak modified the milestones: January 2020, Februrary 2020 Feb 3, 2020

swistak35 self-assigned this Feb 7, 2020

swistak35 added a commit that referenced this issue Feb 7, 2020

Add initial version of calculateFormula()

bf75789

Issue: #24

swistak35 added a commit that referenced this issue Feb 12, 2020

Add initial version of calculateFormula()

4771cca

Issue: #24

swistak35 mentioned this issue Feb 12, 2020

Formula helpers and named expressions with formulas #167

Merged

7 tasks

swistak35 added a commit that referenced this issue Feb 12, 2020

Add initial version of calculateFormula()

e260fe2

Issue: #24

swistak35 added a commit that referenced this issue Feb 14, 2020

Add initial version of calculateFormula()

3b971f5

Issue: #24

swistak35 mentioned this issue Feb 20, 2020

Formula helpers and named expressions (part 2) #192

Merged

3 tasks

This was referenced Feb 24, 2020

First Release #54

Closed

[HyperFormula] Testing handsontable/quality#3

Closed

swistak35 added a commit that referenced this issue Mar 5, 2020

Rename API method

e13a941

Issue: #24

swistak35 added a commit that referenced this issue Mar 5, 2020

Rename API method

4583c54

Issue: #24

swistak35 mentioned this issue Mar 5, 2020

Rename API method #238

Merged

7 tasks

swistak35 added a commit that referenced this issue Mar 5, 2020

Rename API method (#238)

91c74ee

Issue: #24

wojciechczerniak closed this as completed Mar 6, 2020

wojciechczerniak mentioned this issue Mar 6, 2020

Named Expressions: API and structure (aka Stage 1) #239

Closed

24 tasks

aninde mentioned this issue Mar 17, 2020

[HyperFormula] Testing API - stage 1 handsontable/quality#35

Closed

95 tasks

wojciechczerniak unassigned swistak35 Jun 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Formula helpers #24

Formula helpers #24

wojciechczerniak commented Nov 17, 2019 •

edited

wojciechczerniak commented Feb 5, 2020 •

edited

swistak35 commented Feb 7, 2020

swistak35 commented Feb 7, 2020

wojciechczerniak commented Feb 19, 2020 •

edited

wojciechczerniak commented Feb 19, 2020

swistak35 commented Feb 21, 2020

wojciechczerniak commented Feb 21, 2020

swistak35 commented Feb 25, 2020

wojciechczerniak commented Feb 25, 2020

swistak35 commented Feb 26, 2020

wojciechczerniak commented Feb 27, 2020 •

edited

swistak35 commented Feb 27, 2020

wojciechczerniak commented Feb 27, 2020

swistak35 commented Mar 5, 2020 •

edited

wojciechczerniak commented Mar 6, 2020

Formula helpers #24

Formula helpers #24

Comments

wojciechczerniak commented Nov 17, 2019 • edited

Description

Use case

Events

Problem?

wojciechczerniak commented Feb 5, 2020 • edited

swistak35 commented Feb 7, 2020

swistak35 commented Feb 7, 2020

wojciechczerniak commented Feb 19, 2020 • edited

wojciechczerniak commented Feb 19, 2020

swistak35 commented Feb 21, 2020

wojciechczerniak commented Feb 21, 2020

swistak35 commented Feb 25, 2020

wojciechczerniak commented Feb 25, 2020

swistak35 commented Feb 26, 2020

wojciechczerniak commented Feb 27, 2020 • edited

swistak35 commented Feb 27, 2020

wojciechczerniak commented Feb 27, 2020

swistak35 commented Mar 5, 2020 • edited

wojciechczerniak commented Mar 6, 2020

wojciechczerniak commented Nov 17, 2019 •

edited

wojciechczerniak commented Feb 5, 2020 •

edited

wojciechczerniak commented Feb 19, 2020 •

edited

wojciechczerniak commented Feb 27, 2020 •

edited

swistak35 commented Mar 5, 2020 •

edited